- 1Institute of Biomedical and Health Engineering, Chinese Academy of Sciences Shenzhen Institutes of Advanced Technology, Shenzhen, China
- 2Department of Clinical Oncology, University of Hong Kong-Shenzhen Hospital, Shenzhen, China
- 3Department of Clinical Oncology, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
- 4Biomedical Engineering, Shenzhen Polytechnic, Shenzhen, China
- 5University Hospitals/Cleverland Medical Center, Seidman Cancer Center and Case Western Reserve University, Cleveland, OH, United States
Radiation-induced lymphopenia is known for its survival significance in patients with breast cancer treated with radiation therapy. This study aimed to evaluate the impact of radiotherapy on lymphocytes by applying machine learning strategies. We used Extreme Gradient Boosting (XGboost) to predict the event of lymphopenia (grade≥1) and conduced an independent validation. Then, we induced feature attribution analysis (Shapley additive explanation, SHAP) in explaining the XGboost models to explore the directional contribution of each feature to lymphopenia. Finally, we implemented the proof-of-concept clinical validation. The results showed that the XGboost models had rigorous generalization performances (accuracies 0.764 and ROC-AUC 0.841, respectively) in the independent cohort. The baseline lymphocyte counts are the most protective feature (SHAP = 5.226, direction of SHAP = -0.964). Baseline platelets and monocytes also played important protective roles. The usage of taxane only chemotherapy was less risk on lymphopenia than the combination of anthracycline and taxane. By the contribution analysis of dose, we identified that firstly lymphocytes were sensitive to a radiation dose less than 4Gy; secondly the irradiation volume was more important in promoting lymphopenia than the irradiation dose; thirdly the irradiation dose promoted the event of lymphopenia when the irradiation volume was fixed. Overall, our findings paved the way to clarifying the radiation dose volume effect. To avoid radiation-induced lymphopenia, irradiation volume should be kept to a minimum during the planning process, as long as the target coverage is not compromised.
Introduction
The biological effects of radiation exposure on the immune system are double-edged. It has an immunostimulatory effect by promoting the release of tumor antigens (1), radiation-induced neoantigens (2) and chemokine that recruit effector cells into the tumor microenvironment (3). On the other side, radiation has the potential for direct cytotoxicity toward immune cells, especially lymphocytes, which are the most radiosensitive (4). Among human peripheral blood lymphocytes, T helper cells, cytotoxic T cells, and B cells display a radiosensitive phenotype (5). In the treatment of solid tumors, lymphopenia is a common side effect of radiotherapy known for decades (6).
Radiation-induced lymphopenia is associated with inferior clinical outcomes in a wide variety of solid malignancies (7, 8), and more importantly, it is associated with inferior survival outcomes (9–12). For example, total lymphocyte counts < 100 cells/mm3 were associated with poor overall survival in patients with locally advanced cervical cancer (13); total lymphocyte count < 500 cells/mm3 at 2 months was associated with short overall survival outcomes and was an independent predictor for survival in elderly patients with glioblastoma (10). Similarly, lymphopenia was an independent predictor of inferior survival in locally advanced pancreatic cancer (11). In patients with breast cancer, the five-year disease-free survival was significantly lower in patients with a ratio of lymphocyte nadir to pre-treatment lymphocyte less than 0.8 (14). Nevertheless, baseline characteristics associated with radiation-induced lymphopenia have not been thoroughly evaluated. Given its clinical implications, it is necessary to identify those baseline characteristics to predict radiation-induced lymphopenia.
Recently, some studies have investigated and modeled the significant effects of radiation dose on radiation-induced lymphopenia. In esophageal cancer patients who underwent radiotherapy, thoracic vertebral (TV) volume spared of 5-40Gy was significantly associated with higher lymphocyte nadirs (P<0.05) (15). Yovino et al. (16) demonstrated the lymphotoxic impact of conventionally fractionated brain radiotherapy for high-grade gliomas. They established that after 30 fractions of radiotherapy, 99% of the circulating lymphocyte received ≥0.5Gy. Furthermore, our group identified a model of the effective radiation dose to the circulating immune cells (EDIC) in patients with advanced esophageal squamous cell carcinoma treated with trimodality therapy (17, 18). EDIC negatively correlated with lymphocyte nadir. Overall, radiation therapy was associated with lymphopenia in patients with different solid tumors. However, the association between breast cancer and radiation-induced lymphopenia is less well-studied.
In this study, we generated Extreme Gradient Boosting (XGboost) (19) models in a machine learning framework to identify the impact of radiation dose on circulating lymphocytes in patients with breast cancer. XGboost is the specific implementation of the gradient boosting algorithm. It employs more accurate approximations to find the best models and an advanced regularization technique, enhancing model training speed and generalization and reducing model complexity (20). Next, we sought to understand the associations between lymphopenia and baseline features, including baseline blood counts, clinical and tumor characteristics, treatment regimens and especially radiation dose, in patients with breast cancer who underwent radiation therapy. Therefore, we assessed the relative importance and direction of feature contribution to lymphopenia in XGboost models via Shapley additive explanation (SHAP) approaches (21, 22). Rigorous quality validations were implemented, including cross-validation, bootstrapping and Proof-of-concept clinical validation.
Materials and Methods
Description of Cohorts
Lymphopenia (grade ≥1) was defined from the Common Terminology Criteria for Adverse Events, version 4.0 (CTCAE v4.0). Patients with breast cancer who received adjuvant radiation therapy from March 2015 to October 2019 at the University of Hong Kong-Shenzhen Hospital formed the study population (Testing cohort). The eligibility criteria were: pathologically confirmed invasive breast cancer, received adjuvant radiation therapy, aged 18-years old or above, peripheral lymphocyte counts evaluated within 7 days after the end of radiation therapy in the same hospital. Exclusion criteria included patients with non-invasive breast cancer (stage 0), stage IV or recurrent breast cancer, breast lymphoma, and underlying autoimmune diseases. Patients who underwent surgery and chemotherapy in other hospitals were eligible if they received the whole course of radiotherapy at the University of Hong Kong-Shenzhen Hospital.
Another independent prospective validation cohort included patients between November 2019 and December 2020. Patients with breast cancer who underwent radiation therapy with the same criteria as the Testing cohort were selected from a prospective observational study of the Bio-Imaging Repository Databank (BIRD) project at the University of Hong Kong-Shenzhen Hospital.
Radiation Therapy Procedures
Radiotherapy techniques included 2-field tangential opposing technique (2D-fields), tangential opposing fields with an anterior SCF field (three-dimensional conformal technique, 3D-fields) and RapidArc (Varian Medical Systems, Palo Alto, CA, USA). CT scans from the skull base to the level of the first lumbar vertebra were obtained in this study. The 2D-fields technique was normally employed on patients who needed irradiation only of the breast. 3D-fields technique was usually used on patients who needed irradiation of the breast or chest wall and supraclavicular fossa or axillary fossa. RapidArc, a volume modulated arc therapy technique, was employed on patients with invasive breast cancer with N3 or N2 diseases with centrally or medially located primary tumors.
Feature Groups
The “clinical data” feature group included age, family disease (without, with), smoking (without, with) and drinking (without, with), and menopausal (pre, peri, post). All participants are females.
The “tumor characteristics” feature group included modified N stage (0, more than 0), modified stage (I, II, III), tumor sides (right, left), tumor size, estrogen receptors (ER) (negative, positive), progesterone receptors (PR) (negative, positive), human epidermal growth factor receptor 2 (HER2) (negative, positive), immunohistochemistry (IHC) (HR+/HER2-, HR-/HER2+, HR+/HER2+, HR-/HER2-) and Ki67.
The “blood cells” feature group included the baseline counts of white blood cell (WBC), hemoglobin (Hb), platelet (PLT), neutrophil (ANC), lymphocyte (LYM) and monocyte (MON), which were measured within 7 days before radiation therapy.
The “radiotherapy” feature group included the total body dose, mean heart dose, maximum heart dose, V20 of ipsilateral lung, V5 of ipsilateral lung, mean ipsilateral lung, V20 of the bilateral lung, V5 of the bilateral lung, mean bilateral lung, radiotherapy technique (RapidArc, 2D-fields, 3D-fields), electron boost (none, 10Gy/5fx, 16Gy/8fx), RT fields (tangential breast only, breast or chest wall with regional lymphatics) and RT dose (40.5Gy/15fx, more than 50Gy/25fx). Irradiation dose and volume were extracted from a dose-volume histogram (DVH).
The “treatment” feature group included treatment regimens (breast-conserving therapy (BCT), modified radical mastectomy (MRM)), surgical treatment (Sentinel lymph node biopsy (SLNB), axillary lymph node dissection (ALND)), margin (clear, close or positive), chemotherapy strategy (none, neoadjuvant, adjuvant, both neoadjuvant and adjuvant), chemotherapy regimens (taxane, anthracycline+taxane, others), anti-HER2 therapy (without, with), endocrine therapy (without, with).
Predictive Models of the Lymphopenia Events
We established XGboost models by tenfold cross-validation (CV) framework via 100 bootstrapping iterations to predict radiation-induced lymphopenia. We used either all features or each feature group as input to build XGboost models and applied Lasso regressions for comparison. To estimate the explained prediction variances, we evaluated the predicting results using sensitivity, specificity, accuracy, f1-score, the area under the receiver operating characteristic curve (ROC-AUC) and the area under the precision-recall curve (PR-AUC). All samples were used in training and validation, considering XGboost’s abilities to handle the missing data. In contrast, those samples without missing data were used in training and validating the Lasso regressions.
In each bootstrapping iteration, we randomly selected 80% of patients and grouped them into one patient set. The patient set was separated into a training set and a testing set (ratio 7: 3) and performed the tenfold CV to train models. Next, AUC and its minimum value were used as hyperparameters for Lasso regression in the training set. The grid-search sets were used as hyperparameters for XGboost, i.e. learning_rate from 0.01 to 0.1 step 0.01, gamma from 0 to 5 step 1, max_depth from 3 to 6 step 1, scale_pos_weight from 1 to 2 step 0.2, subsample from 0.7 to 1 step 0.1, colsample_bytree from 0.7 to 1 step 0.1, min_child_weigth from 3 to 6 step 1, max_delta_step from 0 to 5 step 1, the other hyperparameters were set as defaults. Finally, we computed the model’s prediction on the remaining testing set.
Feature Attributions Analysis
To emphasize the predictive power of XGboost models, we used Shapley additive explanation (SHAP). SHAP values assign an importance value to each feature representing the effect on the model prediction. In brief, for a specific prediction, the SHAP value of a feature is defined as the change in the expected value of the model’s output when this feature is observed versus when it is missing.
We computed individual SHAP values using the module TreeExplainer (Python v.0.37.0). We summarized the mean absolute SHAP value across all instances, reflecting the mean effect of each feature on predicting the lymphopenia outcome and serving as a feature’s contribution measure. The bigger mean absolute SHAP value mentions the feature is more important in lymphopenia.
We further determined the directional mean absolute SHAP values by taking into account the mean value of Spearman correlations between individual SHAP values and corresponding outcomes via all iterations. The directional SHAP values more closing to 1 or -1 mentions that the feature promotes or protects the occurrence of lymphopenia more. The directional SHAP values has the same sign meaning as the odd ratio in the Lasso regression. Finally, we used a graphical layout (Cytoscape 3.7.2) in order to visualize the contributions of features (both mean absolute SHAP value and conditional SHAP values) in XGboost models for the event of lymphopenia.
Selection of Paired Patients
To study the contributions of the less important features that important features may overshadow, we selected the paired patients with controlled discrepancy of important features in the Testing cohort and Validation cohort separately. In this study, for every two patients a and b, the discrepancy was defined as the mean value of absolute relative differences in important features: , where i is the important feature number, n is the total important feature number, abs means absolution, f_a and f_b are the feature values in patients a and b, respectively, and f_max means the maximum of the feature. For each less important feature we need to study, the important features were defined as those with the SHAP values bigger than the less important features. Two patients with a discrepancy less than the threshold and different lymphopenia outcomes were considered one paired patient. In the selected paired patient cohort, we used paired t-test to assess the significance of the less important feature on the lymphopenia events.
Statistical Analysis
For all statistical analysis and prediction models we used R 3.6.1 (23) with the following packages: ggplot2 3.3.2, glmnet 4.0.2, caret 6.0.86, xgboost 1.2.0.1, Matrix 1.2.18; For SHAP value analysis we used Python 3.7.3 (24) with the following packages: pandas 0.24.2, numpy 1.18.5, sklearn 0.20.3, shap 0.37.0, xgboost 1.3.1. In univariate analysis, P values were corrected by Bonferroni correction. Statistical significance was defined as P values <0.05.
Results
Patients and Characteristics
The patient flowchart is shown in Figure 1A. In the Testing cohort, 589 patients with breast cancer who underwent radiation therapy were enrolled. We collected data about clinical characteristics, baseline and post-treatment blood test results, tumor characteristics, radiation dose and other therapy regimes (summarized in Table 1). All patients were females with a median age of 45 (1st to 3rd Qu: 39-51). The median lymphocyte count in patients before radiation therapy was 1530/µL (1st to 3rd Qu: 1200 to 1860), while it decreased to 950/µL (1st to 3rd Qu: 750 to 1170) after radiation therapy. A total of 340 (57.7%) patients had lymphopenia (grade≥1) after radiation therapy.
Table 1 The characteristics of breast cancer patients in Testing cohort (total 589 patients), continues features are shown as median (1st to 3rd quantile) and classified features are shown as numbers (percentage).
Figure 1 The study flowchart. (A) The patient flowchart; (B) the machine learning flowchart; (C) the statistical verification flowchart.
To validate the accuracy and robustness of our results, we adopted an independent prospective cohort (Validation cohort) enrolling 203 patients with breast cancer. The patients in the Validation cohort were also all females with a median age of 45 (1st to 3rd Qu: 39-51). The median lymphocyte count in patients before radiation therapy was 1500/µL (1st to 3rd Qu: 1230 to 1935), while it decreased to 1000/µL (1st to 3rd Qu: 700 to 1255) after radiation therapy. A total of 104 (51.2%) patients had lymphopenia (grade≥1) after radiation therapy. The characteristics of the Validation cohort were all summarized in Table S1.
Prediction of the Event of Lymphopenia
The flowchart for establishing the machine learning models is shown in Figure 1B. We trained XGboost and lasso regression in tenfold CV via 100 bootstrapping iterations in the Testing cohort to predict the binary classified lymphopenia. The models were validated in the Validation cohort across all iterations. The final XGboost models were development with the hyperparameters as follows: learning_rate=0.04, gamma=5, max_depth=3, scale_pos_weight=1.8, subsample=0.8, colsample_bytree=0.7, min_child_weigth =5, max_delta_step = 1.
In the full XGboost models, the main metrics to evaluate the classifying abilities are both accuracy (median: 0.781, 1st to 3rd Qu: 0.762 to 0.817) and ROC-AUC (median: 0.841, 1st to 3rd Qu: 0.822 to 0.856); the models were validated in Validation cohort for accuracy (median: 0.764, 1st to 3rd Qu: 0.753 to 0.774) and ROC-AUC (median: 0.841, 1st to 3rd Qu: 0.817 to 0.868), as shown in Figure 2. Other evaluation metrics in the Testing and Validation cohorts are shown in Figures S1–S4 and compared, sensitivity, specificity, F1-score, and PR-AUC. The XGboost models’ evaluation metrics and prediction abilities in the Validation cohort keep up with the Lasso regression’s metrics.
Figure 2 The XGboost and Lasso regression models for predicting the radiation-induced lymphopenia were trained in Testing cohort and validated in Validation cohort. (A) The top two trees in one example of XGboost model; (B) coefficients and P-values in one example of Lasso regression; (C, D) the classifying performances (accuracy and ROC-AUC, respectively) of XGboost models across all iterations in Testing cohort and in Validation cohort, compared with the Lasso regressions. In subplots (C, D), numerical labels are median values. The color represents the feature’s group, including: the full model (Orange), dosimetrics (blue), blood cell baselines (maroon), tumor features (green), Treatment regimens (Khaki), clinical features (yellow).
We also investigated the abilities of each feature group to predict lymphopenia. Radiation dose and baseline blood cells were the two important feature groups, as shown in Figure 2. In contrast, the other feature groups, including tumor characteristics, treatment regimens and clinical features, have fewer contributions to lymphopenia. These are similar to the results of lasso regressions, which are also shown in Figure 2 and Figures S1–S4.
Feature Attributions
The feature parameters, including both gain and frequency indexes, in the full XGboost models constructed via all iterations, are summarized in Table S2. In each XGboost model, the gain index represents the fractional contribution of each feature to the model based on the total gain of this feature’s splits; and the frequency index represents the relative number of times a feature has been used in sub-trees in one XGboost model. The higher gain and frequency indexes mean the more important predictive feature for predicting outcome. We also checked the features’ coefficients and P-values in Lasso regressions and their occurrence frequencies (Table S3). The top 10 important features for lymphopenia either in XGboost models or Lasso regressions are compared in Figure S5.
However, both Gain and frequency indexes in XGboost cannot show the directional contributions of each feature on the occurrence of lymphopenia, which is not similar to the meaning of coefficient in Lasso regressions. We analyzed the directional SHAP values of each feature in the XGboost models, illustrated in the method section. The directional SHAP values of features in XGboost models are visualized in Figure 3. The comparison of feature contributions to the occurrence of lymphopenia between the directional SHAP values in XGboost models and the coefficients in Lasso regressions is listed in Table 2.
Figure 3 The graphic layout of directional SHAP values of each feature classified in five feature groups: (A) baseline blood cells group (maroon), (B) radiation dose group (blue), (C) treatment regimens group (Khaki), (D) tumor characteristics group (green) and (E) clinical characteristics group (yellow). The event of lymphopenia is a gray square. Each feature node is colored by group and sized by the SHAP values; each edge is colored from red (positive) to blue (negative) by the direction value of SHAP and its thickness is sized by the absolute direction values of SHAP.
Table 2 The dummy features’ contributions to lymphopenia in Testing cohort (total 589 patients), including the mean SHAP value and direction of SHAP in XGboost models and the mean coefficients in Lasso regression across all iterations.
Baseline Blood Cells’ Attributions
As shown in Figure 3A, we considered the baseline blood cell counts’ contributions to the occurrence of lymphopenia in the XGboost and SHAP analysis. The baseline lymphocyte counts are negatively associated with lymphopenia events (SHAP = 5.226, direction of SHAP = -0.964). The higher the baseline lymphocyte level, the fewer lymphopenia events after treatment. Interestingly, baseline hemoglobin level promotes lymphopenia events (SHAP = 0.737, direction of SHAP = 0.378), while white blood cells, platelets and monocytes protected the patients from lymphopenia. We also found their directional contributions to the lymphopenia events were consistent with coefficients in Lasso regressions (Table 2)
Dose Attributions
We considered the contribution of the radiation dose to the occurrence of lymphopenia in XGboost models and SHAP analysis, as shown in Figure 3B. The total body dose was an important radiation dose leading to the lymphopenia event (SHAP = 4.021, direction of SHAP = 0.858). Followed by V5 of bilateral lungs (SHAP = 1.451, direction of SHAP = 0.886), V5 of the ipsilateral lung (SHAP = 1.151, direction of SHAP = 0.871), mean dose of bilateral lungs (SHAP = 0.905, direction of SHAP = 0.788), maximum heart dose (SHAP = 0.783, direction of SHAP = 0.187), mean heart dose (SHAP = 0.774, direction of SHAP = -0.377), mean dose of the ipsilateral lung (SHAP = 0.469, direction of SHAP = 0.654), V20 of the ipsilateral lung (SHAP = 0.383, direction of SHAP = 0.308), and V20 of bilateral lungs (SHAP = 0.15, direction of SHAP = 0.126), The order of SHAP value almost agrees with the feature attribution results (Figure S5 and Table S2).
Next, we extracted both irradiation doses and corresponding volumes of each radiation dose from the patient’s DVH curves. As shown in Figures 4A–D, we sorted both the irradiation volume and dose by their contributions to lymphopenia (SHAP values). The irradiation volumes are ordered ascendingly by SHAP values (log10(volume)~1.325*SHAP value, P value<1e-8). In contrast, the corresponding irradiation dose is in the descending order with SHAP values (log2(dose)~0.603*SHAP value, P value<1e-8). The regression information is shown in Figure S6. It can be illustrated that (point 1) lymphocytes are sensitive to an irradiation dose lower than 4Gy, because the integral dose of body (median: 4Gy, 1st-3rd Qu: 3.3-5Gy) is found to be the most important dose in lymphopenia; (point 2) the irradiation volume plays a significantly greater role than the irradiation dose, in promoting lymphopenia.
Figure 4 The relationships between SHAP value and irradiation volume in the Testing cohort (A) and in the Validation cohort (B); the relationships between SHAP value and irradiation dose in the Testing cohort (C) and in the Validation cohort (D). The regression relationships between irradiation volume and irradiation dose in the Testing cohort (E) and in the Validation cohort (F). Each radiation dose is differently colored and sorted by SHAP value.
We studied the relationship between irradiation dose and irradiation volume in both the Test and Validation cohort (Figures 4E, F). Irradiation volume is commonly positively correlated with irradiation dose. Notably, the irradiation volume is low when the mean heart dose is higher both in the Testing cohort and the Validation cohort that means the higher mean heart dose is related to the smaller irradiation range. According to the points 1 and 2 we summarized above, the protective role of mean heart dose against a lymphopenia event (SHAP = 0.774, direction of SHAP = -0.377) can be explained. That is, not the higher mean heart dose, but the smaller corresponding irradiation volume protects against a lymphopenia event.
In addition, the irradiation volume of the maximum heart dose is almost close to zero, and the positive SHAP value of the maximum heart dose (SHAP = 0.783, direction of SHAP = 0.187) mentions that the irradiation dose promotes the event of lymphopenia when the irradiation volume is fixed. Following the points above, it can be further illustrated that: (point 3) the irradiation dose promotes the occurrence of lymphopenia when the irradiation volume is controlled. Because of the negative correlations between irradiation volume and irradiation doe of heart, the associations between the mean/maximum heart dose and SHAP value do not follow the regression relationships, it can be compared between Figures S6, S7.
Other Features’ Attributions
Finally, we considered the contributions of other features to the occurrence of lymphopenia (Figures 3C–E), especially those with relatively high SHAP value and direction of SHAP. Notably, the chemotherapy regimens (anthracycline/taxane) were found to promote the occurrence of lymphopenia (SHAP = 0.368, direction of SHAP = 0.714). Conversely, the Taxane monotherapy chemotherapy regimens were less of risk factor (SHAP = 0.420, direction of SHAP = -0.672). These findings were consistent with the results in Lasso regressions, as shown in Table 2.
Proof-of-Concept Clinical Validation
We found that maximum heart dose and V20 of ipsilateral lung promoted the lymphopenia events (SHAP = 0.783, direction of SHAP = 0.187; SHAP = 0.383, direction of SHAP = 0.308, respectively). The mean heart dose protected the patients from lymphopenia (SHAP = 0.774, direction of SHAP = -0.377) (Table 2). However, these three radiation doses were never included in Lasso regressions. The mean heart dose promotes lymphopenia in the univariate logistic analysis in both the Testing cohort (Table 1) and the Validation cohort (Table S1).
The contradiction between mean heart dose in SHAP analysis and univariate logistic analysis might be caused by the multi-collinearity among radiation doses (Pearson’s correlations > 0.181, as shown in Figure S8). Moreover, the contradiction between these three radiation doses in SHAP analysis and the lasso regressions may be because of the shrinkage function in lasso regression, which means the highly correlated but less important features may not be selected in the regression model. Therefore, we did a proof-of-concept analysis in paired patients for these three radiation doses, as illustrated in the method section and shown in Figure 1C.
As shown in Figure 5, after controlling for the main features, the mean heart dose is higher in the patients without lymphopenia events (both P values < 0.05). In comparison, the maximum heart dose and V20 of the ipsilateral lung are significantly higher in the patients with lymphopenia events (all P values < 0.05). These results are all consistent with the SHAP value analysis, indicating that our XGboost models and directional mean absolute SHAP values rigorously revealed the features’ importance and their directional contribution to lymphopenia, especially those features were contradictory in univariate analysis or those features were never included in lasso regressions because of being masked by the highly correlated major features.
Figure 5 Paired t-test analysis in matched patients who with controlled discrepancy of important features. Subplots (A–C) are analyzed in the matched patients in Testing cohort, and subplots (D–F) are analyzed in the matched patients in Validation cohort. In the paired patients of each cohort, three boxplots (from left to right) were the comparisons of feature levels between with (blue box) and without (red box) lymphopenia in (A or D) maxim heart dose, (B or E) mean heart dose and (C or F) V20 of ipsilateral lung, respectively.
Discussion
To best of our knowledge, our study is the first to describe the diverse potential determinants, especially including the complex radiation dose, of radiation-induced lymphopenia in the patients with breast cancer by the machine learning algorithm (XGboost). Furthermore, XGboost and SHAP interpretation approach were combined to determine the predictive performances and the feature contributions in radiation-induced lymphopenia. We found that baseline lymphocyte counts protect against while the baseline hemoglobin level impact the event of radiation-induced lymphopenia; more importantly, we summarize some regularities between radiation dose and occurrence of lymphopenia, i.e. (1) lymphocytes are sensitive to an irradiation dose lower than 4Gy; (2) the irradiation volume plays a significantly greater role than the irradiation dose, in promoting lymphopenia; (3) the irradiation dose promotes the occurrence of lymphopenia when the irradiation volume is controlled.
The protective role of baseline lymphocyte counts in radiation-induce lymphopenia is consistent to the common acknowledgement. However, there were little previous works have correlated the hemoglobin and lymphopenia in radiation therapy. It is known for decades that cancer-related hypoxia and anemia are associated with decrease in radiosensitivity of tumor cells (25). In another words, it can be hypothesized that hypoxia and anemia might decrease the radiosensitivity of lymphocytes which deserves more studies in the future. For the protective effects of platelets and monocytes, frequent platelet donation is usually associated with T-cell lymphopenia in platelet donation studies (26, 27), which mentioned that possible casual correlations between platelet reduction and lymphopenia but the mechanisms were still unknown. The correlations between monocytes and lymphopenia were seldom studied, but macrophage, which differentiated from monocyte, leads to either radiosensitization or radioresistance depending on different tumor types or different radiation regimen studied, and various molecular players as NF-kB, MAPKs, p53, reactive oxygen species, and inflammasomes that have been involved in these processes (28), which might mention the monocyte reduction correlated to lymphopenia at some extend.
Considering the impact of irradiation dose on lymphopenia, the contribution of integral dose of the total body also has been presented in our previous work of EDIC formula containing integral dose of the total body resulted lymphopenia in 488 esophageal cancer patients (29); V5 of ipsilateral lung/bilateral lungs have been previously reported to impact lymphopenia in non-small cell lung cancer patients (30). Similar findings have been reported in patients with early-stage lung cancer, V10 to V50 were significantly negatively correlated with the decrease of absolute lymphocyte counts following radiation (31). In patients with pancreatic cancer, V10, V15, and V20 were significantly higher in patients with severe lymphopenia (32), and the same findings in patients with esophageal squamous cell carcinoma (33). For the contribution of irradiation volume at heart, our findings are consistent with our previous study in lung cancer patients, i.e. higher heart V5 (the heart volume in 5Gy) was significant with decrease in post-radiation lymphocyte counts (31). The results of these studies were all consistent with our findings.
We illustrated that the irradiation volume had a more significant lymphotoxic impact than the irradiation dose. The lymphocytes behave as mobile organs and circulate through the blood at about 1 cycle/min (34). Therefore, this may explain our novel findings that the higher the volume of radiotherapy dose delivered may result in a higher administration of radiotherapy to the circulating lymphocytes, thereby increasing the risk of radiation-induced lymphopenia.
The impact of chemotherapy on lymphopenia is known. Our findings is consistent with findings from study of Tolaney SM et al. who demonstrated that lymphopenia (grade 3 or grade 4) was associated with the combined use of adjuvant anthracycline and taxane regimens in three breast cancer patient cohorts (35). This study revealed that the taxane only chemotherapy seemed to have less risk on lymphopenia (OR 1.2, 95%CI 0.67-2.14, P-value>0.5, as shown in Table 1) in univariate analysis which was consistent with our previous study (36), similarly in multivariable analysis. Less lymphopenia effect from taxane monotherapy may be simply explained by the less damage from the combination of anthracycline and taxane. Of note, we previously noted that single-agent taxane treatment increased serum IL-2 levels in patients with advanced breast cancer (37). IL-2 serves as a cell-cycle progression signal for T lymphocytes, stimulating their proliferation and differentiation. The underpinning biology driving chemotherapy-induced lymphopenia is not fully understood, and this study further highlights the unmet need for future studies.
There are some limitations in this study. First, we used an independent prospective cohort with different population and study protocols but the same staffs who assembled these two cohorts. Our findings are needed to be verified with external cohorts in the future. Secondly, it is well known that inflammation indicators are also important in immune responses including lymphopenia, while the Testing cohort in this study were established without inflammation indictors. This limitation could be addressed in subsequent studies by measuring inflammation indicator levels in serum.
In conclusion, in patients with breast cancer who underwent radiotherapy, we found that the baseline lymphocyte, platelet and monocyte play protective roles in lymphopenia; the usage of taxane results in less impact on lymphopenia than the combination of an anthracycline with a taxane; all radiation doses promote the occurrence of lymphopenia except the mean heart dose. Especially for the contributions of complicated radiation dose on lymphopenia, we draw three conclusions: 1) lymphocytes are sensitive to an irradiation dose lower than 4Gy; 2) the irradiation volume plays a more important role in promoting the occurrence of lymphopenia than the irradiation dose; 3) the irradiation dose promotes the lymphopenia occurrence when the irradiation volume is controlled. Higher than the dose’s priority, irradiation volume should be kept as small as possible during the planning process to avoid radiation-induced lymphopenia as long as the target coverage is not compromised.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Ethics Statement
The studies involving human participants were reviewed and approved by the ethics committee of the University of Hong Kong-Shenzhen Hospital. The patients/participants provided their written informed consent to participate in this study.
Author Contributions
Conceptualization: HY, FC and F-MK; Data selection and clinical trials: FC and LY; Methodology: HY and YW; Funding acquisition: HY and F-MK; Project administration: K-OL and F-MK; Supervision: J-YJ; Writing – original draft: HY and FC; Writing – review and editing: K-OL, AH, and F-MK. All authors contributed to the article and approved the submitted version.
Funding
This project was supported in parts by Shenzhen Fundamental Research Program (No. JCYJ2020109150427184), Shenzhen Science and Technology Program (No. KQTD20180411185028798).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2022.768811/full#supplementary-material
References
1. Lin W, Xu Y, Chen X, Liu J, Weng Y, Zhuang Q, et al. Radiation-Induced Small Extracellular Vesicles as "Carriages" Promote Tumor Antigen Release and Trigger Antitumor Immunity. Theranostics (2020) 10(11):4871–84. doi: 10.7150/thno.43539
2. Martinez-Zubiaurre I, Chalmers AJ, Hellevik T. Radiation-Induced Transformation of Immunoregulatory Networks in the Tumor Stroma. Front Immunol (2018) 9:1679. doi: 10.3389/fimmu.2018.01679
3. Weichselbaum RR, Liang H, Deng L, Fu YX. Radiotherapy and Immunotherapy: A Beneficial Liaison? Nat Rev Clin Oncol (2017) 14(6):365–79. doi: 10.1038/nrclinonc.2016.211
4. Nakamura N, Kusunoki Y, Akiyama M. Radiosensitivity of CD4 or CD8 Positive Human T-Lymphocytes by an In Vitro Colony Formation Assay. Radiat Res (1990) 123(2):224–7. doi: 10.2307/3577549
5. Heylmann D, Rödel F, Kindler T, Kaina B. Radiation Sensitivity of Human and Murine Peripheral Blood Lymphocytes, Stem and Progenitor Cells. Biochim Biophys Acta (2014) 1846(1):121–9. doi: 10.1016/j.bbcan.2014.04.009
6. Newman GH, Rees GJ, Jones RS, Grove EA, Preece AW. Changes in Helper and Suppressor T Lymphocytes Following Radiotherapy for Breast Cancer. Clin Radiol (1987) 38(2):191–3. doi: 10.1016/S0009-9260(87)80032-6
7. Cho O, Chun M, Chang SJ, Oh YT, Noh OK. Prognostic Value of Severe Lymphopenia During Pelvic Concurrent Chemoradiotherapy in Cervical Cancer. Anticancer Res (2016) 36:3541–7.
8. Kitayama J, Yasuda K, Kawai K, Sunami E, Nagawa H. Circulating Lymphocyte is an Important Determinant of the Effectiveness of Preoperative Radiotherapy in Advanced Rectal Cancer. BMC Canc (2011) 11:64. doi: 10.1186/1471-2407-11-64
9. Davuluri R, Jiang W, Fang P, Xu C, Komaki R, Gomez DR, et al. Lymphocyte Nadir and Esophageal Cancer Survival Outcomes After Chemoradiation Therapy. Int J Radiat Oncol Biol Phys (2017) 99:128–35. doi: 10.1016/j.ijrobp.2017.05.037
10. Mendez JS, Govindan A, Leong J, Gao F, Huang J, Campian JL. Association Between Treatment-Related Lymphopenia and Overall Survival in Elderly Patients With Newly Diagnosed Glioblastoma. J Neurooncol (2016) 127(2):329–35. doi: 10.1007/s11060-015-2037-1
11. Wild AT, Ye X, Ellsworth SG, Smith JA, Narang AK, Garg T, et al. The Association Between Chemoradiation-Related Lymphopenia and Clinical Outcomes in Patients With Locally Advanced Pancreatic Adenocarcinoma. Am J Clin Oncol (2015) 38:259–65. doi: 10.1097/COC.0b013e3182940ff9
12. Venkatesulu BP, Mallick S, Lin SH, Krishnan S. A Systematic Review of the Influence of Radiation-Induced Lymphopenia on Survival Outcomes in Solid Tumors. Crit Rev Oncol Hematol (2018) 123:42–51. doi: 10.1016/j.critrevonc.2018.01.003
13. Wu ES, Oduyebo T, Cobb LP, Cholakian D, Kong X, Fader AN, et al. Lymphopenia and its Association With Survival in Patients With Locally Advanced Cervical Cancer. Gynecol Oncol (2016) 140(1):76–82. doi: 10.1016/j.ygyno.2015.11.013
14. Sun GY, Wang SL, Song YW, Jin J, Wang WH, Liu YP, et al. Radiation-Induced Lymphopenia Predicts Poorer Prognosis in Patients With Breast Cancer: A Post Hoc Analysis of a Randomized Controlled Trial of Postmastectomy Hypofractionated Radiation Therapy. Int J Radiat Oncol Biol Phys (2020) 108(1):277–85. doi: 10.1016/j.ijrobp.2020.02.633
15. Anderson JL, Newman NB, Anderson C, Sherry AD, Yock AD, Osmundson EC. Mean Cardiopulmonary Dose and Vertebral Marrow Dose Differentially Predict Lineage-Specific Leukopenia Kinetics During Radiotherapy for Esophageal Cancer. Radiother Oncol (2020) 152:169–76. doi: 10.1016/j.radonc.2019.12.008
16. Yovino S, Kleinberg L, Grossman SA, Narayanan M, Ford E. The Etiology of Treatment-Related Lymphopenia in Patients With Malignant Gliomas: Modeling Radiation Dose to Circulating Lymphocytes Explains Clinical Observations and Suggests Methods of Modifying the Impact of Radiation on Immune Cells. Cancer Invest (2013) 31(2):140–4. doi: 10.3109/07357907.2012.762780
17. Jin JY, Mereniuk T, Yalamanchali A, Wang W, Machtay M, Spring Kong FM, et al. A Framework for Modeling Radiation Induced Lymphopenia in Radiotherapy. Radiother Oncol (2020) 144:105–13. doi: 10.1016/j.radonc.2019.11.014
18. So TH, Chan SK, Chan WL, Choi H, Chiang CL, Lee V, et al. Lymphopenia and Radiation Dose to Circulating Lymphocytes With Neoadjuvant Chemoradiation in Esophageal Squamous Cell Carcinoma. Adv Radiat Oncol (2020) 5(5):880–8. doi: 10.1016/j.adro.2020.03.021
19. Borstelmann SM. Machine Learning Principles for Radiology Investigators. Acad Radiol (2020) 27(1):13–25. doi: 10.1016/j.acra.2019.07.030
20. Chen T, Guestrin C. Xgboost: A Scalable Tree Boosting System. In: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining. San Francisco. (2016). KDD’16 p. 785–94. Available at: https://arxiv.org/abs/1603.02754.
21. Bi Y, Xiang D, Ge Z, Jia C, Song J. An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGboost and SHAP. Mol Ther (2020) 22(4):362–72. doi: 10.1016/j.omtn.2020.08.022
22. Lee S-I, Lundberg S. A Unified Approach to Interpreting Model Predictions. In: NIPS. California (2017). NlPS. Available at: https://arxiv.org/abs/1705.07874.
23. R Core Team. R: A Language and Environment for Statistical Computing. In: R Foundation for Statistical Computing. Austria: Vienna (2018). Available at: https://www.R-project.org/.
24. van Rossum G. Python Tutorial, Technical Report CS-R9526, Centrum Voor Wiskunde En Informatica (CWI). Amsterdam: Centrum voor Wiskunde en Informatica (1995).
25. Harrison L, Blackwell K. Hypoxia and Anemia: Factors in Decreased Sensitivity to Radiation Therapy and Chemotherapy? Oncologist (2004) 9(Suppl 5):31–40. doi: 10.1634/theoncologist.9-90005-31
26. Gansner JM, Rahmani M, Jonsson AH, Fortin BM, Brimah I, Ellis M, et al. Plateletpheresis-Associated Lymphopenia in Frequent Platelet Donors. Blood (2019) 133(6):605–14. doi: 10.1182/blood-2018-09-873125
27. Zhao J, Gabriel E, Norda R, Höglund P, Baden L, Diedrich BA, et al. Frequent Platelet Donation is Associated With Lymphopenia and Risk of Infections: A Nationwide Cohort Study. Transfusion (2021) 61(2):464–73. doi: 10.1111/trf.16175
28. Wu Q, Allouch A, Martins I, Modjtahedi N, Deutsch E, Perfettini JL. Macrophage Biology Plays a Central Role During Ionizing Radiation-Elicited Tumor Response. BioMed J (2017) 40(4):200–11. doi: 10.1016/j.bj.2017.06.003
29. Xu C, Jin J-Y, Zhang M, Liu A, Wang J, Mohan R, et al. The Impact of the Effective Dose to Immune Cells on Lymphopenia and Survival of Esophageal Cancer After Chemoradiotherapy. Radiother Oncol (2020) 146:180–6. doi: 10.1016/j.radonc.2020.02.015
30. Tang C, Liao Z, Gomez D, Levy L, Zhuang Y, Gebremichael RA, et al. Lymphopenia Association With Gross Tumor Volume and Lung V5 and its Effects on non-Small Cell Lung Cancer Patient Outcomes. Int J Radiat Oncol Biol Phys (2014) 89(5):1084–91. doi: 10.1016/j.ijrobp.2014.04.025
31. Zhao Q, Li T, Chen G, Zeng Z, He J. Prognosis and Risk Factors of Radiation-Induced Lymphopenia in Early-Stage Lung Cancer Treated With Stereotactic Body Radiation Therapy. Front Oncol (2020) 9:1488. doi: 10.3389/fonc.2019.01488
32. Chadha AS, Liu G, Chen HC, Das P, Minsky BD, Mahmood U, et al. Does Unintentional Splenic Radiation Predict Outcomes After Pancreatic Cancer Radiation Therapy? Int J Radiat Oncol Biol Phys (2017) 97(2):323–32. doi: 10.1016/j.ijrobp.2016.10.046
33. Xu H, Lin M, Hu Y, Zhang L, Li Q, Zhu J, et al. Lymphopenia During Definitive Chemoradiotherapy in Esophageal Squamous Cell Carcinoma: Association With Dosimetric Parameters and Patient Outcomes. Oncologist (2021) 26(3):e425–34. doi: 10.1002/onco.13533
34. Abravan A, Faivre-Finn C, Kennedy J, McWilliam A, van Herk M. Radiotherapy-Related Lymphopenia Affects Overall Survival in Patients With Lung Cancer. J Thorac Oncol (2020) 15(10):1624–35. doi: 10.1016/j.jtho.2020.06.008
35. Tolaney SM, Najita J, Winer EP, Burstein HJ. Lymphopenia Associated With Adjuvant Anthracycline/ Taxane Regimens. Clin Breast Canc (2008) 8(4):352–6. doi: 10.3816/CBC.2008.n.041
36. Fang C, Hao Y, Hong Z, Yaqing N, Qian W, Haiman J, et al. Risk Factors for Radiation Induced Lymphopenia in Patients With Breast Cancer Receiving Adjuvant Radiotherapy. Ann Trans Med (2021) 9(16):1288–302. doi: 10.21037/atm-21-2150
Keywords: radiation-induced lymphopenia, breast cancer, radiation dose, machine learning, SHapley Additive exPlanation
Citation: Yu H, Chen F, Lam K-O, Yang L, Wang Y, Jin J-Y, EI Helali A and Kong F-M(S) (2022) Potential Determinants for Radiation-Induced Lymphopenia in Patients With Breast Cancer Using Interpretable Machine Learning Approach. Front. Immunol. 13:768811. doi: 10.3389/fimmu.2022.768811
Received: 01 September 2021; Accepted: 23 May 2022;
Published: 21 June 2022.
Edited by:
Mohammad Hojjat-Farsangi, Karolinska Institutet (KI), SwedenReviewed by:
Matsumoto Kotaro, Kurume University, JapanLetizia Deantonio, Oncology Institute of Southern Switzerland (IOSI), Switzerland
Aysan Namdar, Tabriz University of Medical Sciences, Iran
Copyright © 2022 Yu, Chen, Lam, Yang, Wang, Jin, EI Helali and Kong. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Feng-Ming (Spring) Kong, a29uZzAwMDFAaGt1Lmhr
†These authors have contributed equally to this work and share first authorship