Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 12 January 2023
Sec. Systems Microbiology
This article is part of the Research Topic Untangle the Broad Connections and Tight Interactions Between Human Microbiota and Complex Diseases Through Data-Driven Approaches View all 13 articles

Development and validation of an interpretable radiomic nomogram for severe radiation proctitis prediction in postoperative cervical cancer patients

Chaoyi Wei&#x;Chaoyi Wei1Xinli Xiang&#x;Xinli Xiang2Xiaobo ZhouXiaobo Zhou3Siyan RenSiyan Ren3Qingyu ZhouQingyu Zhou3Wenjun DongWenjun Dong3Haizhen LinHaizhen Lin3Saijun WangSaijun Wang3Yuyue ZhangYuyue Zhang3Hai LinHai Lin1Qingzu HeQingzu He1Yuer LuYuer Lu1Xiaoming JiangXiaoming Jiang1Jianwei ShuaiJianwei Shuai1Xiance Jin,
Xiance Jin4,5*Congying Xie
Congying Xie3*
  • 1Wenzhou Key Laboratory of Biophysics, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, Zhejiang Province, China
  • 2The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province, China
  • 3Medical and Radiation Oncology, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province, China
  • 4Radiotherapy Center, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang Province, China
  • 5School of Basic Medical Sciences, Wenzhou Medical University, Wenzhou, Zhejiang Province, China

Background: Radiation proctitis is a common complication after radiotherapy for cervical cancer. Unlike simple radiation damage to other organs, radiation proctitis is a complex disease closely related to the microbiota. However, analysis of the gut microbiota is time-consuming and expensive. This study aims to mine rectal information using radiomics and incorporate it into a nomogram model for cheap and fast prediction of severe radiation proctitis prediction in postoperative cervical cancer patients.

Methods: The severity of the patient’s radiation proctitis was graded according to the RTOG/EORTC criteria. The toxicity grade of radiation proctitis over or equal to grade 2 was set as the model’s target. A total of 178 patients with cervical cancer were divided into a training set (n = 124) and a validation set (n = 54). Multivariate logistic regression was used to build the radiomic and non-raidomic models.

Results: The radiomics model [AUC=0.6855(0.5174-0.8535)] showed better performance and more net benefit in the validation set than the non-radiomic model [AUC=0.6641(0.4904-0.8378)]. In particular, we applied SHapley Additive exPlanation (SHAP) method for the first time to a radiomics-based logistic regression model to further interpret the radiomic features from case-based and feature-based perspectives. The integrated radiomic model enables the first accurate quantitative assessment of the probability of radiation proctitis in postoperative cervical cancer patients, addressing the limitations of the current qualitative assessment of the plan through dose-volume parameters only.

Conclusion: We successfully developed and validated an integrated radiomic model containing rectal information. SHAP analysis of the model suggests that radiomic features have a supporting role in the quantitative assessment of the probability of radiation proctitis in postoperative cervical cancer patients.

1. Introduction

Cervical cancer is a malignant neoplasm at the junction of the squamous epithelial cells of the vaginal or transitional zone of the cervix and the endocervical canal columnar epithelial cells. Cervical cancer is the fourth most common cancer worldwide (Sung et al., 2021). Radiotherapy is one of the most effective methods for treating pelvic malignancies and has an irreplaceable role in treating cervical cancer at all stages. The main complication of radiotherapy for pelvic malignancies is radiation proctitis (Yeung et al., 2020). Fifty percent of patients with cervical cancer or endometrial cancer who received postoperative intensity-modulated radiotherapy developed acute rectal toxicity, and 5%–10% developed chronic rectal toxicity (Zelefsky et al., 2008; Yeung et al., 2020).

Unlike simple radiation damage to other organs, radiation proctitis is a complex disease closely related to the microbiota. A study using a rectal radiation mouse model showed that radiation affected both host and intestinal microbiota (Gerassy-Vainberg et al., 2018). Radiation therapy could induce local microbial ecological dysbiosis, and the dysbiosis microbiota could exert a direct pro-inflammatory effect on epithelial cells. In another study of 32 female patients with chronic radiation proctitis, differential patterns of dysbiosis were observed after analyzing the gut microbiota of patients with or without hematochezia (Liu et al., 2021). Gut microbiota could offer a set of biomarkers for radiation enteritis prediction, disease activity evaluation, and treatment selection (Wang et al., 2019).

However, the current prediction models of radiation proctitis were focused mainly on clinical and radiotherapy dose features. Several univariate and multivariate analyses showed that features, including tumor size, pathological characteristics, and radiological parameters, were significantly correlated with post-radiotherapy comorbidities in patients undergoing pelvic radiotherapy (Albert et al., 2008; Schmidt et al., 2022). A study by Fiorino et al. showed that rectal function was significantly correlated with treatment volume, PTV margins, radiation therapy dose, hemorrhoids presence, anticoagulant use during follow-up, and relative (%) and absolute (cm3) values of rectal V38Gy and V40Gy correlated with rectal bleeding (Fiorino et al., 2008). A study by Mahal et al. also noted that total radiation dose, dose per fraction, radiotherapy techniques, and treatment volume affected the rectum of patients (Mahal et al., 2014).

Another review also suggested features associated with radiation proctitis, including vascular diseases such as smoking, diabetes, hypertensive diabetes and atherosclerosis, collagen vascular disease, comorbid inflammatory bowel disease, and human immunodeficiency virus infection. Also, the review noted that specific underlying genetic changes could affect patients’ sensitivity to radiation. There was a correlation between genes and higher risks of gastrointestinal and genitourinary tract radiotoxicity after radiotherapy (Shadad et al., 2013).

The above studies have shown a strong correlation between patients’ oncologic features, pathologic features, and radiologic dose and the appearance of radiation proctitis in postoperative radiotherapy for pelvic cancer. However, the conclusions of these studies are inconsistent, and the accuracy of the prediction of radiation proctitis is unsatisfactory.

In recent years, computer-aided diagnosis, especially machine learning methods, has also been used for postoperative radiotherapy side effects and comorbidities prediction in oncology patients. Lee et al. applied machine learning methods such as random forest and bioinformatics to genome-wide data to predict and interpret advanced urogenital toxicity (Lee et al., 2018). They designed more robust predictive models and identified plausible biomarkers and biological processes associated with late urogenital toxicity. A study by Lewis & Kemp et al. showed that the integration of machine learning and genome-scale metabolic modeling identified multi-group biomarkers of radiation resistance (Lewis and Kemp, 2021).

However, it should be noted that the machine learning models above were developed using clinical features only. It ignored the large number of features embedded in computed tomography images (CT) that are imperceptible to the naked eye. Moreover, in the process of treatment plan determination, physicians are more focused on extracting focal area information and lack awareness of pelvic rectal information. Therefore, a comprehensive model is urgently needed to deepen the understanding of patient rectal image information to accurately assess radiotherapy treatment plans and reduce severe complications of radiation proctitis.

Deep learning, as well as dynamical modeling, is demonstrating powerful feature extraction and modeling capabilities in various medical fields (Li et al., 2021; Qian et al., 2021; Chen et al., 2022; Hu et al., 2022; Li Y. et al., 2022; Li X. et al., 2022). In data-driven disease research, a graph neuro network was used to predict the potential associations of disease-related metabolites (Sun et al., 2022). Deep learning can also be used to explore the identification of circRNA-disease associations (Wang et al., 2021) and predict the potential human lncRNA interactions (Zhang et al., 2021; Jingxuan et al., 2022; Wang et al., 2022). In drug metabolism research, deep learning can be used to predict the ability of a compound to permeate across the blood–brain barrier (Tang et al., 2022) and drug response (Kuenzi et al., 2020). At the same time, deep learning is also a popular tool for radiotherapy research. Zhong et al. developed a deep learning-based radiomic nomogram that could predict the prognosis of patients with different treatment regimens (Zhong et al., 2021). Qiang et al. established a prognosis prediction system based on deep learning for locoregionally advanced nasopharyngeal carcinoma (Qiang et al., 2021). Although deep learning has been widely applied in the analysis and prediction of various diseases, the poor interpretability of the deep learning model makes it difficult for clinicians to understand and trust these tools (Huff et al., 2021).

Radiomics can extract biomedical images containing information reflecting the underlying pathophysiology and reveal the relationships through quantitative image analysis (Le et al., 2021; Lam et al., 2022). In previous studies, radiomics has been used to predict postoperative radiotherapy-induced toxicity in prostate cancer patients. Mostafaei et al. showed that models based on CT radiomics, clinical features, and dose-volume parameters could predict radiation toxicity. The combination of imaging and clinical features could improve the performance of radiotoxicity prediction models (Mostafaei et al., 2020). However, no study has been performed to predict postoperative radiotherapy comorbidity in cervical cancer patients using radiomic features. Due to the limited resolution, the information on microbiota is almost impossible to extract directly by radiomics in theory, and no relevant studies have been reported. However, it is feasible that radiomics can indirectly reflect the effect of microbiota on the rectum.

Therefore, this study uses radiomics to extract the rectal information from medical images and improve the model performance and diagnostic accuracy through quantitative image analysis. Moreover, this study creatively introduces SHapley Additive exPlanation (SHAP) values to explore the interpretability of nomogram to improve the clinicians’ understanding of the model and its radiomic features, which facilitates later clinical promotion.

2. Materials and methods

2.1. Patients

The study protocol was reviewed and approved by the Ethics Committee in Clinical Research (ECCR) of the First Affiliated Hospital of Wenzhou Medical University. It was conducted following the Declaration of Helsinki. The Transparent Reporting of Individual Prognosis or Diagnosis Multivariate Predictive Models (TRIPOD) guidelines and the Strengthening Reports of Observational Studies in Epidemiology (STROBE) statement were applied. As this study was a retrospective cohort study, informed consent was waived, and all patient data were anonymized and desensitized.

Patients with cervical cancer from 1st January 2015 to 31st December 2020 in the First Affiliated Hospital of Wenzhou Medical University were collected for this study. These patients received a cervical cancer diagnosis, oncological surgery, and postoperative radiotherapy at the First Affiliated Hospital of Wenzhou Medical University.

The inclusion criteria (Figure 1) include (a) no severe symptoms at the time of diagnosis and good general physical condition; (b) patients with relatively intact organ functions, basically standard hematological system, and no contraindications to treatment; (c) no previous history of other malignant tumor diseases and radiotherapy; (d) postoperative pathological examination results confirming the diagnosis of cervical cancer; (e) postoperative radiotherapy; (f) patients with postoperative treatment; (g) the patient had complete pathology, imaging, and radiation therapy dose information; (h) no intracavitary brachytherapy was performed.

FIGURE 1
www.frontiersin.org

Figure 1. Flow diagram of the study enrolment patients.

The exclusion criteria include (a) no definite postoperative pathological findings; (b) no complete clinicopathological data; (c) no CT scan was performed before postoperative radiotherapy; (d) patient’s pathology, imaging, and radiation treatment dose data are missing; (e) intolerance to radiotherapy or chemotherapy, and treatment plan was not completed due to severe acute toxic reactions during treatment; (f) Have received intracavitary brachytherapy.

The Mann–Whitney U-test and the Chi-square test were used to evaluate the performance of clinical and dose-volume features. Patients were randomly divided into a training set and a validation set.

2.2. Extraction of radiomic features

The entire rectal region on the patient’s CT image was defined as the Regions of Interest (ROI). Using ITK-SNAP software (Yushkevich et al., 2006), a pelvic radiologist with 10 years of experience at the First Affiliated Hospital of Wenzhou Medical University outlined this target region manually. Another radiologist with 20 years of experience reviewed it. Extraneous components other than the rectum, such as peripheral vessels, peripheral tissues, and peripheral organs, were not outlined by radiologists to minimize interfering information. The two radiologists did not know the patient’s information. If the two doctors had the same opinion, the ROI would be included in the imaging data set.

Quantitative radiological features were automatically extracted using a feature extraction platform based on the Python package PyRadiomics (van Griethuysen et al., 2017). After segmentation and reconstruction of the patient CT, each patient extracted ROI was imported into Python in nrrd format. We extracted 1,409 radiomic features, including 8 feature classes used for further analysis and regression modeling. Radiomics features were dependent on the CT hardware, scanning parameters, and contrast agents. The process of generation and selection of radiomic features was illustrated in Figure 2.

FIGURE 2
www.frontiersin.org

Figure 2. Workflow of the radiomic model development and model analysis process. The orange arrows in the flow chart represented the processing of the radiomic features. The radiomic features were generated by the PyRadiomics package after outlining the rectal region on the original image. After feature selection by T-test, LASSO and univariate logistic regression, multivariate logistic regression models were developed and visualized as nomograms. The model analysis consists of three parts: model simplification, model evaluation, and model interpretation. By comparing the performance change before and after model simplification, we could measure the importance of the radiomic features. In particular, we applied SHapley Additive exPlanation (SHAP) values for the first time to a radiomics-based logistic regression model to further interpret the radiomic features from case-based and feature-based perspectives.

2.3. Feature selection and model development

The variance equality of radiomics features was assessed by Levene’s test. Independent t-test or Wilcoxon’s test was used for feature selection. After standardizing the radiomic features using the z fraction transform, the high-dimensional imaging features extracted from the ROI were selected by the least absolute shrinkage and selection operator (LASSO) regularization algorithm. We performed univariate logistic regression on all features to screen out the key features significantly associated with the severity of radiation proctitis. The value of p was usually set at p < 0.2, but can also be set at p < 0.05 or p < 0.1. It requires the researcher to adjust the value of p according to the sample size. Due to the limited amount of data in this study, we set p < 0.2 as the threshold. Features with p > 0.2 in the univariate logistic regression were excluded, and features with p < 0.2 were included. Finally, the features left after multiple screenings were introduced into a stepwise logistic regression analysis to build a comprehensive model.

2.4. Model simplification and model evaluation

The critical features remained after multiple screenings were included in the multivariate logistic regression model generated by the stepwise forward and backward methods. Finally, to transform the complex regression equations into simple and visual graphs and make the prediction models’ results more readable, a visual nomogram was constructed based on these features that can be stably present in the unified model. All model evaluations were performed on the unseen validation set. In addition, calibration curves were used to evaluate the model performance of the nomogram.

To further validate the performance of the radiomic features, we built a simplified non-radiomic model by removing the radiomic features. To evaluate the performance of these two models, we assessed the discrimination using the receiver operating characteristic (ROC) analysis. The area under the receiver operating characteristic curve (AUC) was used to assess the predictive discrimination of these two models. In addition, in order to verify the validity of the model from another perspective, a k-Nearest Neighbor (KNN) model was built using the same data as the radiomic nomogram. The root-mean-square error (RMSE) and 10-fold cross-validation were used to select the optimal hyperparameter of the KNN model. We used decision curve analysis (DCA) to assess clinical validity by quantifying the net benefit at each threshold probability. All statistical analyses were performed using R (version 4.2.2), Python (version 3.9.12), and SPSS (version 24.0, IBM). The workflow of the model analysis process after modeling was shown in Figure 2.

2.5. Model interpretation

SHapley Additive exPlanation (SHAP) method is a game-theoretic-based model interpretation method. From a game theory perspective, SHAP treats each feature variable as a player. The predicted outcome obtained by the model is considered as the gain from the cooperation of many players to complete a project. It connects optimal credit allocation with local explanations using the classical Shapley values from game theory and their related extensions (Lundberg and Lee, 2017). We used scikit-learn (Pedregosa et al., 2011) to build the logistic regression model and used the SHAP package to calculate the SHAP values for the logistic regression model and further analyze the SHAP values with the SHAP plot module. The decision process of each patient could be presented by force plot. By overlaying the force plots and sorting the output values, we could see how all patients made their decisions. In addition to analyzing the model from the patient’s perspective, we can also use SHAP to understand the model from the feature’s perspective. SHAP provides bar plots and scatter plots of features to help us understand which feature was most important to the model.

3. Results

3.1. Baseline information of patients

This study included 1,093 patients with cervical cancer who needed to initiate radiotherapy at the First Affiliated Hospital of Wenzhou Medical University between 1st January 2015 and 31st December 2020. After screening and exclusion, a total of 178 patients were finally included in our study. The study included 40 patients (22.5%) with a toxicity grade greater than or equal to grade 2 after radiation therapy and 138 patients (77.5%) less than grade 2 after radiation therapy. The patients were divided into a training set (n = 124) and a validation set (n = 54). Table 1 shows the baseline information of the patients.

TABLE 1
www.frontiersin.org

Table 1. Baseline information of all patients.

3.2. Radiomic features selection and multivariate analysis

We extracted a total of 1,409 radiomic features from the patients’ CTs and selected them using the LASSO algorithm. Multivariate logistic regression analysis was performed on all features selected by LASSO and univariate logistic regression. The results of the multivariate logistic regression are shown in Table 2. Radiotherapy techniques [OR = 0.000 (0.000–0.086), p = 0.005], Maximum rectal dose [OR = 1.006 (1.001–1.011), p = 0.020], Contrast [OR = 0.000 (0.000–0.002), p = 0.046] were independent risk factors for severe radiation proctitis.

TABLE 2
www.frontiersin.org

Table 2. Result of multivariate logistic regression.

3.3. Establishment of nomogram and model evaluation

In order to develop a clinically applicable method to predict the occurrence of radiation proctitis, we constructed a radiomics nomogram. The results of the nomogram were shown in Figure 3A. All model evaluations were performed on the unseen validation set. The calibration curve of the combined radiomics nomogram was shown in Figure 3B. To further validate the performance of the radiomic features, we built a simplified non-radiomic model based only on the clinical feature and dose-volume feature by removing the radiomic feature and comparing its performance with the full radiomic model. The ROC curves for the two nomogram models (Figure 3C) showed that the prediction effect of the radiomic model [AUC = 0.6855 (0.5174–0.8535)] performed better than the non-radiomic model [AUC = 0.6641 (0.4904–0.8378)]. The AUC of radiomic nomogram [AUC = 0.6855 (0.5174–0.8535)] was close to that of the KNN model [AUC = 0.7051 (0.5602–0.85)]. It illustrated the validity of the model from another perspective. The decision curve analysis (DCA; Figure 3D) was used to assess the utility of both prediction models by calculating the net benefit at various probability thresholds. According to the decision curves, the radiomic model showed more benefit in predicting the risk of radiation proctitis than the non-radiomic model. It suggested that radiomic features were supporting features for severe radiation proctitis prediction.

FIGURE 3
www.frontiersin.org

Figure 3. Nomogram for severe radiation proctitis prediction in postoperative cervical cancer patients, calibration of the nomogram, and decision curves in the overall patients. The combined nomogram (A) incorporated clinical, dose-volume, and radiomic features. By accumulating the points for each feature, we could quickly calculate the probability of radiation proctitis. All model evaluations were performed on the validation set. The Calibration curves of the combined radiomics nomogram (B) illustrated the relationship between the observed outcome frequencies and the predicted probabilities. The ROC curves (C) demonstrated the accuracy of the radiomic and non-radiomic models and KNN radiomic model. The DCA curves (D) demonstrated the net benefit of the radiomic and non-radiomic models.

3.4. Model interpretation

3.4.1. Case-based model interpretation

To further understand how decision-making occurred for individual and entire patient populations, we used SHAP to analyze from a case-based perspective. Figure 4A represented the decision process for SHAP values across all patients, with the vertical axis representing the magnitude of the SHAP values. As the graph was ordered by model output, we could clearly see the boundary line between red and blue. Features pushing the prediction higher were shown in red, and those pushing the prediction lower were in blue.

FIGURE 4
www.frontiersin.org

Figure 4. SHAP plots demonstrated SHAP values from a case-based perspective. Sampled by model output, the overall SHAP plot (A) showed the decision process of all patients. The force plot (B) and the waterfall plot (C) demonstrated the proportion and absolute SHAP value of various features in the decision-making process for a single patient.

In addition to the model interpretation for all cases, we could also provide a clearer picture of the decision-making situation for individual patients through the waterfall or force plot. For example, by selecting the patient on the far right of Figure 4A, the decision-making process could be visualized in Figure 4B or Figure 4C. Although the presentation was different, the information in Figures 4B,C was consistent. These two plots indicated the proportion and absolute SHAP value of various features in the decision-making process for that patient. SHAP could provide a quantitative and visual representation of the decision mechanisms of the radiomics model for any patient.

3.4.2. Feature-based model interpretation

We calculated and visualized the SHAP values for each feature in the radiomics model. The beeswarm plot (Figure 5A) demonstrated an overview of the feature contribution of all patients. In the beeswarm plot, features were sorted by the sum of SHAP value magnitudes over all samples, and SHAP values were used to show the distribution of each feature’s impacts. The bar plot shown in Figure 5B demonstrated the mean absolute value of the SHAP values for each feature. The plot showed that radiotherapy techniques and the maximum rectal dose have a high mean value. Since SHAP values represented a feature’s responsibility for a change in the model output, Figures 5A,B indicated that the radiotherapy technique and the maximum rectal dose were essential.

FIGURE 5
www.frontiersin.org

Figure 5. The SHAP plots illustrated the feature-based model interpretation process. (A) The beeswarm plot used SHAP values to show the distribution of each feature’s impacts. (B) The standard bar plot demonstrated the mean absolute value of the SHAP values for each feature. These two plots (C,D) showed the SHAP values in different features. Gray bar plots showed the SHAP values for each feature. Scatter plots showed the SHAP values of the other features most relevant to that feature. Vertical dispersion represented interaction effects between the horizontal and vertical features.

To understand how each feature affected the model’s output, we plotted gray bar plots to show the SHAP values for each feature and scatter plots to show the SHAP values of the other features most relevant to that feature (Figures 5C,D).

4. Discussion

With the advancement of radiotherapy techniques, the postoperative survival rates of cancers such as cervical cancer have increased dramatically (Citrin, 2017). However, complications and side effects caused by postoperative radiotherapy or chemotherapy are difficult to avoid. Radiation proctitis is one of the most common complications of postoperative radiotherapy in patients with pelvic tumors (Rustagi et al., 2015; Qian et al., 2021), with mild diarrhea or mild rectal exudate in mild cases and even intestinal necrosis or bleeding in severe cases, endangering patients’ lives (Citrin, 2017). In clinical practice, doctors currently rely on the dose-volume features of radiotherapy plans to assess the risk of radiation proctitis. However, there is a lot of valuable information in pathology and clinical imaging that is not considered by clinicians. Moreover, the sensitivity of the rectum to radiotherapy radiation also varies significantly between individuals.

To further refine the assessment of radiation proctitis, we selected radiomic features associated with radiation proctitis by univariate regression and the LASSO algorithm. Radiotherapy techniques (OR = 0.000 (0.000–0.086), p = 0.005), Maximum rectal dose (OR = 1.006 (1.001–1.011), p = 0.020), Contrast (OR = 0.000 (0.000–0.002), p = 0.046) were independent risk factors for radiation proctitis. Finally, we developed an integrated prediction model based on clinical and radiomic features [AUC = 0.6855 (0.5174–0.8535)]. Current studies of radiation proctitis had mainly focused on local radiotherapy dose limits rather than comprehensive predictive models (Snyder et al., 2001; Huang et al., 2004). There was only one study using radiomics to build a predictive model for radiation proctitis (Mostafaei et al., 2020). In gastrointestinal toxicities modeling, the AUC of radiomic model of their study was 0.71, which was relatively higher compared with our study. However, the study was conducted based on data from only 64 patients and was only suitable for patients with prostate cancer.

The radiomic features of the model potentially incorporated the effect of microbiota on rectal radiosensitivity. The model without radiomic features showed lower validity, while the model containing both radiomic features and clinical features showed better performance on the ROC curve. The change of net benefit in Figure 3D suggested that radiomic features had played a supporting role in predictive models. And as a measure of the local intensity variation, a larger contrast correlated with a greater disparity in intensity values among neighboring voxels. In our study, the contrast suggested that a lower tissue density compared to the surrounding tissue was associated with higher radiosensitivity.

In most cases, PyRadiomics followed the image biomarker standardization initiative (IBSI)'s definition of features. PyRadiomics development was also involved in the standardization effort by the IBSI team. Still, there were some differences between PyRadiomics and feature extraction as defined in the IBSI documents. Most notably were the differences in gray value discretization (just for the fixed bin size type) and resampling. In summary, the definitions of PyRadiomics and IBSI were slightly different, but did not represent one over the other. Moreover, IBSI was only an initiative, not a standard. For these reasons, IBSI would not significantly impact the reproducibility and validity of this study.

While SHAP was often used to explain features in machine learning algorithms and neural network models (Bang et al., 2021; Park et al., 2022; Shaji et al., 2022; Shi et al., 2022), SHAP analysis of logistic regression models had not yet been mentioned. Although logistic regression algorithms were simpler and more explicit than other machine learning algorithms and neural networks, logistic regression models were more challenging to understand than they may seem. Users could not directly measure the importance of features between continuous and categorical variables through odds ratio (OR) or coefficients (Table 2). In particular, for radiomic models, the significant variation in the magnitude of radiomic features made it more challenging to understand the actual decision-making process of the model through the coefficients and OR values of logistic regression. We wanted to help users better understand each feature’s role in the model. In subsequent clinical treatment, model users can further quantify the contribution of radiomic features in each model output.

To address this issue, we introduced SHAP for the first time to a radiomics-based logistic regression model, which further revealed the model’s decision-making mechanism (Figure 4A). The total contribution of SHAP for each feature included in the model was analyzed (Figure 5B). Radiotherapy techniques and the maximum rectal dose occupied vital positions in the model contribution. Notably, the SHAP value of the radiomic feature was the lowest. It suggested that the radiomic feature was weaker than the clinical feature and dose-volume feature. The SHAP could also analyze correlations between variables (Figures 5C,D). Correlations in SHAP values were observed between the three features. It may suggest an inter-collaborative relationship between variables in the model. However, this can only indicate a correlation between SHAP values, not between the values of the variables. In the subplot of therapy (Figure 5C), we can find that the most relevant variable was the maximum rectal dose. There was a harmful effect of maximum rectal dose in the decision-making process of these VMAT therapy samples. However, no fixed pattern was observed in the subplot of contrast (Figure 5D).

SHAP had a unique role in radiomics-based logistic models as a game-theoretic approach. SHAP helped us understand radiomic features that vary significantly in magnitude. Furthermore, SHAP provided a quantitative and visual representation of the decision mechanisms within the model for each patient.

We recommend that clinicians can reduce the value of the maximum rectal dose by modifying the plan when the model suggests that the current radiotherapy plan has a high probability of radiation proctitis. Clinicians can rely on interpretable models to precisely control the risk of the final plan to an acceptable level. Patients with cervical cancer can reduce unnecessary radiation doses and the incidence of radiation proctitis with the help of the comprehensive model.

5. Conclusion

We successfully developed and validated an integrated radiomic model containing rectal information in this study. The integrated radiomic model enables the accurate quantitative assessment of the probability of radiation proctitis in postoperative cervical cancer patients, addressing the limitations of the current qualitative assessment based on dose-volume parameters only. Based on the model output and SHAP values analysis, we suggest that clinicians can adjust the radiation dose to minimize the occurrence of severe radiation proctitis while not compromising the effectiveness of radiation therapy.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee in Clinical Research (ECCR) of the First Affiliated Hospital of Wenzhou Medical University. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

CW, XX, and CX conceived the project, developed the prediction method, designed and implemented the experiments, analyzed the result, and wrote the manuscript. XZ, SR, and XJin implemented the experiments and analyzed the result. QZ, WD, HaizL, SW, and YZ analyzed the result. HaiL, QH, YL, XJia, and JS contributed to the interpretation of the results. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by Zhejiang Engineering Research Center for Innovation and Application of Intelligent Radiotherapy Technology in the Second Affiliated Hospital of Wenzhou Medical University, Wenzhou key Laboratory of radiotherapy and Translational Research of Cancer (2021100848), and Wenzhou Science and Technology Bureau Y2020733. This work was also supported by the Ministry of Science and Technology of the People’s Republic of China under grant 2021ZD0201900, the National Natural Science Foundation of China under grant numbers 12090052 and 11874310, and Major Projects in Fujian Province under grant 2020Y4001.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

SHAP, SHapley Additive exPlanation; CT, computed tomography; ROI, Regions of Interest; LASSO, the least absolute shrinkage and selection operator regularization; DCA, decision curve analysis; AUC, area under curve; OR, odds ratio; ROC, receiver operating characteristic analysis; ECCR, Ethics Committee in Clinical Research; RTOG, Radiation Therapy Oncology Group; EORTC, European Organization for Research and Treatment of Cancer.

References

Albert, M., Song, J. S., Schultz, D., Cormack, R. A., Tempany, C. M., Haker, S., et al. (2008). Defining the rectal dose constraint for permanent radioactive seed implantation of the prostate. Urol. Oncol. 26, 147–152. doi: 10.1016/j.urolonc.2007.03.026

PubMed Abstract | CrossRef Full Text | Google Scholar

Bang, M., Eom, J., An, C., Kim, S., Park, Y. W., Ahn, S. S., et al. (2021). An interpretable multiparametric radiomics model for the diagnosis of schizophrenia using magnetic resonance imaging of the corpus callosum. Transl. Psychiatry 11:462. doi: 10.1038/s41398-021-01586-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, X., Zhu, R., Zhong, J., Ying, Y., Wang, W., Cao, Y., et al. (2022). Mosaic composition of RIP1–RIP3 signalling hub and its role in regulating cell death. Nat. Cell Biol. 24, 471–482. doi: 10.1038/s41556-022-00854-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Citrin, D. E. (2017). Recent developments in radiotherapy. N. Engl. J. Med. 377, 1065–1075. doi: 10.1056/NEJMra1608986

CrossRef Full Text | Google Scholar

Fiorino, C., Alongi, F., Broggi, S., Cattaneo, G. M., Cozzarini, C., Di Muzio, N., et al. (2008). Physics aspects of prostate tomotherapy: planning optimization and image-guidance issues. Acta Oncol. 47, 1309–1316. doi: 10.1080/02841860802266755

PubMed Abstract | CrossRef Full Text | Google Scholar

Gerassy-Vainberg, S., Blatt, A., Danin-Poleg, Y., Gershovich, K., Sabo, E., Nevelsky, A., et al. (2018). Radiation induces proinflammatory dysbiosis: transmission of inflammatory susceptibility by host cytokine induction. Gut, 67, 97–107. doi: 10.1136/gutjnl-2017-313789

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, H., Liu, R., Zhao, C., Lu, Y., Xiong, Y., Chen, L., et al. (2022). CITEMO(XMBD): a flexible single-cell multimodal omics analysis framework to reveal the heterogeneity of immune cells. RNA Biol. 19, 290–304. doi: 10.1080/15476286.2022.2027151

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, E. Y., Wang, C. J., Hsu, H. C., Hao, L., Chen, H. C., and Sun, L. M. (2004). Dosimetric factors predicting severe radiation-induced bowel complications in patients with cervical cancer: combined effect of external parametrial dose and cumulative rectal dose. Gynecol. Oncol. 95, 101–108. doi: 10.1016/j.ygyno.2004.06.043

PubMed Abstract | CrossRef Full Text | Google Scholar

Huff, D. T., Weisman, A. J., and Jeraj, R. (2021). Interpretation and visualization techniques for deep learning models in medical imaging. Phys. Med. Biol. 66:04TR01. doi: 10.1088/1361-6560/abcd17

PubMed Abstract | CrossRef Full Text | Google Scholar

Jingxuan, J. S., Shuai, S. C., Qi, Z., and Jianwei, S. (2022). Predicting potential interactions between lncRNAs and proteins via combined graph auto-encoder methods. Brief. Bioinform. bbac527. doi: 10.1093/bib/bbac527

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuenzi, B. M., Park, J., Fong, S. H., Sanchez, K. S., Lee, J., Kreisberg, J. F., et al. (2020). Predicting drug response and synergy using a deep learning model of human cancer cells. Cancer Cell 38, 672–684.e6. doi: 10.1016/j.ccell.2020.09.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Lam, L. H. T., Do, D. T., Diep, D. T. N., Nguyet, D. L. N., Truong, Q. D., Tri, T. T., et al. (2022). Molecular subtype classification of low-grade gliomas using magnetic resonance imaging-based radiomics and machine learning. NMR Biomed. 35:e4792. doi: 10.1002/nbm.4792

PubMed Abstract | CrossRef Full Text | Google Scholar

Le, N. Q. K., Kha, Q. H., Nguyen, V. H., Chen, Y. C., Cheng, S. J., and Chen, C. Y. (2021). Machine learning-based Radiomics signatures for EGFR and KRAS mutations prediction in non-Small-cell lung cancer. Int. J. Mol. Sci. 22:9254. doi: 10.3390/ijms22179254

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, S., Kerns, S., Ostrer, H., Rosenstein, B., Deasy, J. O., and Oh, J. H. (2018). Machine learning on a genome-wide association study to predict late genitourinary toxicity after prostate radiation therapy. Int. J. Radiat. Oncol. Biol. Phys. 101, 128–135. doi: 10.1016/j.ijrobp.2018.01.054

PubMed Abstract | CrossRef Full Text | Google Scholar

Lewis, J. E., and Kemp, M. L. (2021). Integration of machine learning and genome-scale metabolic modeling identifies multi-omics biomarkers for radiation resistance. Nat. Commun. 12:2700. doi: 10.1038/s41467-021-22989-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., He, Q., Guo, H., Zhong, C. Q., Li, X., Li, Y., et al. (2022). MSSort-DIA(XMBD): a deep learning classification tool of the peptide precursors quantified by OpenSWATH. J. Proteomics 259:104542. doi: 10.1016/j.jprot.2022.104542

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., Zhang, P., Yin, Z., Xu, F., Yang, Z. H., Jin, J., et al. (2022). Caspase-1 and Gasdermin D afford the optimal targets with distinct switching strategies in NLRP1b Inflammasome-induced cell death. Research 2022, 9838341–9838317. doi: 10.34133/2022/9838341

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., Zhong, C. Q., Wu, R., Xu, X., Yang, Z. H., Cai, S., et al. (2021). RIP1-dependent linear and nonlinear recruitments of caspase-8 and RIP3 respectively to necrosome specify distinct cell death outcomes. Protein Cell 12, 858–876. doi: 10.1007/s13238-020-00810-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, L., Chen, C., Liu, X., Chen, B., Ding, C., and Liang, J. (2021). Altered Gut Microbiota Associated With Hemorrhage in Chronic Radiation Proctitis. Front Oncol. 11:637265. doi: 10.3389/fonc.2021.637265

PubMed Abstract | CrossRef Full Text | Google Scholar

Lundberg, S. M., and Lee, S.-I. (2017). A unified approach to interpreting model predictions. Adv. Neural Inf. Proces. Syst. 30, 4765–4774. doi: 10.48550/arXiv.1705.07874

CrossRef Full Text | Google Scholar

Mahal, B. A., Ziehr, D. R., Hyatt, A. S., Neubauer-Sugar, E. H., O’Farrell, D. A., O’Leary, M. P., et al. (2014). Use of a rectal spacer with low-dose-rate brachytherapy for treatment of prostate cancer in previously irradiated patients: initial experience and short-term results. Brachytherapy 13, 442–449. doi: 10.1016/j.brachy.2014.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Mostafaei, S., Abdollahi, H., Kazempour Dehkordi, S., Shiri, I., Razzaghdoust, A., Zoljalali Moghaddam, S. H., et al. (2020). CT imaging markers to improve radiation toxicity prediction in prostate cancer radiotherapy by stacking regression algorithm. Radiol. Med. 125, 87–97. doi: 10.1007/s11547-019-01082-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, Y. W., Eom, J., Kim, D., Ahn, S. S., Kim, E. H., Kang, S. G., et al. (2022). A fully automatic multiparametric radiomics model for differentiation of adult pilocytic astrocytomas from high-grade gliomas. Eur. Radiol. 32, 4500–4509. doi: 10.1007/s00330-022-08575-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., et al. (2011). Scikit-learn: machine learning in python. J. Machine Learn. Res. 12, 2825–2830. doi: 10.48550/arXiv.1201.0490

CrossRef Full Text | Google Scholar

Qian, X., Qiu, Y., He, Q., Lu, Y., Lin, H., Xu, F., et al. (2021). A review of methods for sleep arousal detection using Polysomnographic signals. Brain Sci. 11:1274. doi: 10.3390/brainsci11101274

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiang, M., Li, C., Sun, Y., Sun, Y., Ke, L., Xie, C., et al. (2021). A prognostic predictive system based on deep learning for Locoregionally advanced nasopharyngeal carcinoma. J. Natl. Cancer Inst. 113, 606–615. doi: 10.1093/jnci/djaa149

PubMed Abstract | CrossRef Full Text | Google Scholar

Rustagi, T., Corbett, F. S., and Mashimo, H. (2015). Treatment of chronic radiation proctopathy with radiofrequency ablation (with video). Gastrointest. Endosc. 81, 428–436. doi: 10.1016/j.gie.2014.04.038

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmidt, D. R., Bhagwat, M., Glazer, D. I., Chen, M. H., Moteabbed, M., McMahon, E., et al. (2022). MRI-based radiotherapy planning to reduce rectal dose in excess of tolerance. Prostate Cancer 2022, 7930744–7930749. doi: 10.1155/2022/7930744

PubMed Abstract | CrossRef Full Text | Google Scholar

Shadad, A. K., Sullivan, F. J., Martin, J. D., and Egan, L. J. (2013). Gastrointestinal radiation injury: symptoms, risk factors and mechanisms. World J. Gastroenterol. 19, 185–198. doi: 10.3748/wjg.v19.i2.185

PubMed Abstract | CrossRef Full Text | Google Scholar

Shaji, S., Palanisamy, R., and Swaminathan, R. (2022). Explainable optimized LightGBM based differentiation of mild cognitive impairment using MR Radiomic features. Stud. Health Technol. Inform. 295, 483–486. doi: 10.3233/SHTI220770

PubMed Abstract | CrossRef Full Text | Google Scholar

Shi, Y., Zou, Y., Liu, J., Wang, Y., Chen, Y., Sun, F., et al. (2022). Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: individual application of SHAP. Front. Oncol. 12:897596. doi: 10.3389/fonc.2022.897596

PubMed Abstract | CrossRef Full Text | Google Scholar

Snyder, K. M., Stock, R. G., Hong, S. M., Lo, Y. C., and Stone, N. N. (2001). Defining the risk of developing grade 2 proctitis following 125I prostate brachytherapy using a rectal dose-volume histogram analysis. Int. J. Radiat. Oncol. Biol. Phys. 50, 335–341. doi: 10.1016/s0360-3016(01)01442-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, F., Sun, J., and Zhao, Q. (2022). A deep learning method for predicting metabolite–disease associations via graph neural network. Brief. Bioinform. 23:bbac266. doi: 10.1093/bib/bbac266

PubMed Abstract | CrossRef Full Text | Google Scholar

Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71, 209–249. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, Q., Nie, F., Zhao, Q., and Chen, W. (2022). A merged molecular representation deep learning method for blood-brain barrier permeability prediction. Brief. Bioinform. 23:bbac357. doi: 10.1093/bib/bbac357

PubMed Abstract | CrossRef Full Text | Google Scholar

van Griethuysen, J. J. M., Fedorov, A., Parmar, C., Hosny, A., Aucoin, N., Narayan, V., et al. (2017). Computational Radiomics system to decode the radiographic phenotype. Cancer Res. 77, e104–e107. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Z., Wang, Q., Wang, X., Zhu, L., Chen, J., Zhang, B., et al. (2019). Gut microbial dysbiosis is associated with development and progression of radiation enteritis during pelvic radiotherapy. J. Cell. Mol. Med. 23, 3747–3756. doi: 10.1111/jcmm.14289

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, C. C., Han, C. D., Zhao, Q., and Chen, X. (2021). Circular RNAs and complex diseases: from experimental results to computational models. Brief. Bioinform. 22:bbab286. doi: 10.1093/bib/bbab286

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, W., Zhang, L., Sun, J., Zhao, Q., and Shuai, J. (2022). Predicting the potential human lncRNA-miRNA interactions based on graph convolution network with conditional random field. Brief. Bioinform. 23:bbac463. doi: 10.1093/bib/bbac463

PubMed Abstract | CrossRef Full Text | Google Scholar

Yeung, A. R., Pugh, S. L., Klopp, A. H., Gil, K. M., Wenzel, L., Westin, S. N., et al. (2020). Improvement in patient-reported outcomes with intensity-modulated radiotherapy (RT) compared with standard RT: a report from the NRG oncology RTOG 1203 study. J. Clin. Oncol. 38, 1685–1692. doi: 10.1200/JCO.19.02381

PubMed Abstract | CrossRef Full Text | Google Scholar

Yushkevich, P. A., Piven, J., Hazlett, H. C., Smith, R. G., Ho, S., Gee, J. C., et al. (2006). User-guided 3D active contour segmentation of anatomical structures: significantly improved efficiency and reliability. Neuroimage 31, 1116–1128. doi: 10.1016/j.neuroimage.2006.01.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Zelefsky, M. J., Levin, E. J., Hunt, M., Yamada, Y., Shippy, A. M., Jackson, A., et al. (2008). Incidence of late rectal and urinary toxicities after three-dimensional conformal radiotherapy and intensity-modulated radiotherapy for localized prostate cancer. Int. J. Radiat. Oncol. Biol. Phys. 70, 1124–1129. doi: 10.1016/j.ijrobp.2007.11.044

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, L., Yang, P., Feng, H., Zhao, Q., and Liu, H. (2021). Using network distance analysis to predict lncRNA–miRNA interactions. Interdisciplin. Sci. Comput. Life Sci. 13, 535–545. doi: 10.1007/s12539-021-00458-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhong, L., Dong, D., Fang, X., Zhang, F., Zhang, N., Zhang, L., et al. (2021). A deep learning-based radiomic nomogram for prognosis and treatment decision in advanced nasopharyngeal carcinoma: a multicentre study. EBioMedicine 70:103522. doi: 10.1016/j.ebiom.2021.103522

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: radiomics, nomogram, radiation proctitis, SHapley Additive exPlanation (SHAP), microbiota

Citation: Wei C, Xiang X, Zhou X, Ren S, Zhou Q, Dong W, Lin H, Wang S, Zhang Y, Lin H, He Q, Lu Y, Jiang X, Shuai J, Jin X and Xie C (2023) Development and validation of an interpretable radiomic nomogram for severe radiation proctitis prediction in postoperative cervical cancer patients. Front. Microbiol. 13:1090770. doi: 10.3389/fmicb.2022.1090770

Received: 06 November 2022; Accepted: 28 December 2022;
Published: 12 January 2023.

Edited by:

Liang Wang, Guangdong Provincial People's Hospital, China

Reviewed by:

Nguyen Quoc Khanh Le, Taipei Medical University, Taiwan
Yuan Zhu, China University of Geosciences Wuhan, China
Yibao Zhang, Peking University, China
Sai Ho Ling, University of Technology Sydney, Australia

Copyright © 2023 Wei, Xiang, Zhou, Ren, Zhou, Dong, Lin, Wang, Zhang, Lin, He, Lu, Jiang, Shuai, Jin and Xie. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xiance Jin, ✉ amlueGMxOTc5QGhvdG1haWwuY29t; Congying Xie, ✉ d3p4aWVjb25neWluZ0AxNjMuY29t

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.