- 1Clinical Oncology Center, The University of Hong Kong - Shenzhen Hospital, Shenzhen, China
- 2Shenzhen Key Laboratory of Translational Research on Recurrent/Metastatic Cancer, The University of Hong Kong - Shenzhen Hospital, Shenzhen, China
- 3Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- 4Department of Pathology, Zhujiang Hospital, Southern Medical University, Guangzhou, China
Background/purpose: Severe lymphopenia during pelvic radiotherapy (RT) predicts poor survival in patients with cervical cancer. However, the risk of severe lymphopenia has not been well predicted. We developed a machine learning model using clinical and dosimetric information to predict grade 4 (G4) lymphopenia during pelvic RT in patients with cervical cancer.
Methods: This retrospective study included cervical cancer patients treated with definitive pelvic RT ± induction/concurrent chemotherapy. Clinical information and a set of dosimetric parameters of external beam radiotherapy plan were collected. G4 lymphopenia during RT, which was also referred to as G4 absolute lymphocyte count (ALC) nadir, was defined as ALC nadir <0.2 × 109 cells/L during RT according to Common Terminology Criteria for Adverse Events (CTCAE) v4.03. Elastic-net logistic regression models were constructed for the prediction of G4 lymphopenia during pelvic RT using a repeated cross-validation methodology.
Results: A total of 130 patients were eligible, and 43 (33.1%) patients had G4 lymphopenia during RT. On multivariable analysis, G4 ALC nadir was associated with poor overall survival (OS) [hazard ratio (HR), 3.91; 95% confidence interval (CI), 1.34–11.38, p = 0.01]. Seven significant factors [Eastern Cooperative Oncology Group (ECOG) performance score, pre-RT hemoglobin, pre-RT lymphocytes, concurrent chemotherapy, gross tumor volume of regional lymphadenopathy (GTV_N volume), body volume, and maximum dose of planning target volume receiving at least 55 Gy (PTV_5500 Dmax)] were obtained by elastic-net logistic regression models and were included in the final prediction model for G4 ALC nadir. The model’s predicting ability in test set was area under the curve (AUC) = 0.77 and accuracy = 0.76. A nomogram of the final predicting model was constructed.
Conclusions: This study developed and validated a comprehensive model integrating clinical and dosimetric parameters by machine learning method, which performed well in predicting G4 lymphopenia during pelvic RT for cervical cancer and will facilitate physicians to identify patients at high risk of G4 lymphopenia who might benefit from modified treatment approaches.
1. Introduction
Cervical cancer is the fourth most frequently diagnosed cancer and the fourth leading cause of cancer death in women (1). Pelvic radiotherapy (RT) plays an integral part in the treatment of locally advanced cervical cancer (2), but it can also result in toxicities, including effects on host immunity. A higher radiation dose to immune cells was reported to be associated with poor treatment outcomes in patients with non-small cell lung cancer (NSCLC) (3). Lymphocytes, one of the most important components of the immune system, are especially critical in mediating cellular immunity against malignant tumor cells. In cervical cancer patients treated with concurrent chemoradiotherapy (CCRT), the incidence of grade 3 (G3) and grade 4 (G4) lymphopenia during CCRT, graded by Common Terminology Criteria for Adverse Events (CTCAE), reached as high as 73% and 16%, respectively, and G4 lymphopenia was associated with poor survival (4). Although it has clinical significance, the risk of G4 lymphopenia has not been well predicted in cervical cancer patients.
Many factors were reported to be associated with lymphopenia during RT. Radiation per se is among the most important risk factors for lymphopenia because lymphocytes continuously traverse the irradiated field and are extremely sensitive to radiation (5). The modeled RT dose to peripheral lymphocytes were associated with lymphopenia in patients treated with RT (6). Radiation field size, dose per fraction, and fraction number are all correlated with risk of lymphopenia (7). Dose–volume parameter (volume receiving at least 40 Gy) of the pelvic bone marrow was associated with a higher risk of acute G3 [odds ratio (OR)=1.018] or late grade 2 (G2) lymphopenia (OR=1.005) in prostate cancer patients treated with RT (8). The RT dose to the large blood vessels, bone, and whole body were also correlated with lymphopenia (6). Besides these dose–volume parameters, our previous study demonstrated that the International Federation of Gynecology and Obstetrics (FIGO) stage, pre-treatment lymphocyte, and pre-treatment hemoglobin were significantly associated with lymphopenia during CCRT in cervical cancer patients. Other studies showed that baseline lymphocyte had an important role in predicting lymphopenia during RT (8, 9). Integrating both dosimetric and clinical information might improve the prediction performance for lymphopenia.
Machine learning (ML), one of the most relevant subsets of artificial intelligence (AI) in medicine, mainly focuses on making as accurate predictions as possible. Compared with traditional statistical methods, ML could be more suited in highly innovative fields with a huge bulk of data (10). By using deep ML method to integrate dosimetric and clinical information, Cong Zhu et al. (9) developed a model to predict G4 RT-induced lymphopenia in patients with esophageal carcinoma with area under the curve (AUC) at 0.831, accuracy at 0.769, and precision at 0.670.
At present, ML method has not been widely applied in the prediction of G4 lymphopenia during pelvic RT for cervical cancer. By integrating both clinical factors and a set of dosimetric parameters, this study aimed to build an ML model to predict G4 lymphopenia during pelvic RT in patients with cervical cancer, with the hope to aid the physician’s decision-making process in clinical practice.
2. Materials and methods
2.1 Patients
This study was approved by the institutional ethics committee of the University of Hong Kong-Shenzhen Hospital [No (2022).020], and informed consent form from each patient for this study was waived. A cohort of patients diagnosed with cervical carcinoma from January 2015 to February 2021 in the University of Hong Kong-Shenzhen Hospital was selected for this study. Patients were included if they met the following criteria: 1) ≥18 years old; 2) newly diagnosed, pathology-confirmed cervical carcinoma; 3) FIGO stage (2018) IB-IVB (only stage IVB with oligo-metastases scheduled for radical pelvic RT were included); 4) major treatment was external beam radiotherapy (EBRT) followed by brachytherapy (BT) or stereotactic body radiotherapy (SBRT, if BT was contraindicated or declined) with or without induction or concurrent chemotherapy; and (5) complete blood counts (CBCs) were tested before and weekly during RT. Patients were excluded if they had the following: 1) cervical small cell carcinoma; 2) concomitant secondary primary malignant tumor; 3) acquired immune deficiency syndrome (AIDS); 4) pelvic RT in recurrent or adjuvant settings; and 5) did not complete planned EBRT.
2.2 Radiation therapy
All patients received pelvic EBRT followed by BT or SBRT. The simulation computed tomography (CT) scans for EBRT were taken with 3-mm slices from the interspace between thoracic vertebra 9 and 10 to the upper one-third of the femur. EBRT techniques were RapidArc or three-dimensional conformal radiotherapy (3D-CRT). For RapidArc, gross tumor volume (GTV) of the primary tumor (P) and regional pathological lymph nodes (N) detected by physical examination, simulation CT, pelvis magnetic resonance imaging (MRI), or positron emission tomography (PET)/CT were denoted as GTV_P and GTV_N, respectively. CTV_4500 [clinical target volume (CTV) receiving prescribed dose of ≥45Gy], including cervix, bilateral parametrium, uterus, part of vagina, and pelvic lymphatics; CTV_5500 (CTV receiving prescribed dose of ≥55 Gy), pelvic GTV_N + 3 mm margin; CTV_5750 (CTV receiving prescribed dose of ≥57.5 Gy), retroperitoneal GTV_N + 3 mm margin; PTV_4500, PTV_5500, and PTV_5750 [planning target volume (PTV) receiving prescribed doses of ≥45, ≥55, and ≥57.5 Gy], CTV_4500, CTV_5500, and CTV_5750 + 5 mm margin, respectively. Prescription dose was delivered as 45 Gy in 25 fractions (45 Gy/25 Fr) to PTV_4500 with a simultaneous integrated boost (SIB) of 55 Gy to PTV_5500 or 57.5 Gy to PTV_5750. For 3D-CRT, two sequential phases were adopted: 45 Gy/25 Fr to whole pelvis as phase I; boosting to pelvic wall with 16 Gy/8 Fr for FIGO IIIB or 10 Gy/5 Fr for other stages as phase II. All EBRT was delivered daily, five fractions per week. CT- or MRI-guided BT was started 3–4 weeks after the initiation of EBRT with 192Ir (iridium) high-dose rate, once a week for a total of 4 weeks. Cumulative equivalent doses in 2 Gy/Fr (EQD2) of > 84 Gy for stage IB–IIIA and >90 Gy for ≥ stage IIIB were set to cervical primary tumor.
2.3 Chemotherapy
Concurrent cisplatin (40 mg/m2) was given weekly during EBRT for up to 5–6 weeks. If creatinine clearance ≤50 ml/min, carboplatin at the dose of AUC = 2 mg/ml/min was given weekly as an alternative. Induction chemotherapy (IC) with paclitaxel and carboplatin was given if anticipated RT waiting time exceeded about 3 weeks. For patients aged over 70 or with FIGO stage IB1, no chemotherapy was recommended.
2.4 End points and dose–volume histogram metrics
Absolute lymphocyte count (ALC) was measured as ×109 cells/L and graded by CTCAE v4.03. G4 lymphopenia during RT, which was also referred to as G4 ALC nadir, was defined as ALC nadir < 0.2×109 cells/L during RT. Progression-free survival (PFS) was the time between the initiation of RT and the date of disease progression or death from any cause. Overall survival (OS) was the time between the initiation of RT and the date of death from any cause. Dose–volume histogram (DVH) metrics of both tumor targets and organs at risk (OARs) during EBRT were extracted directly from the Varian eclipse treatment planning system (version15.0, External Beam Planning, Varian) with anisotropic analytical algorithm. The tumor targets of interest included GTV_P, GTV_N, CTV_4500, PTV_4500, and PTV_5500. The OARs of interest included body (defined as the part of body within the range of simulation CT scan for EBRT) and bones (defined as bones within 2 cm beyond PTV). For each structure, the whole volume [in cubic centimeter (cc)], maximum dose (Dmax, in Gy), mean dose (Dmean, in Gy), and the percentage of the whole volume receiving ≥5, ≥10, ≥20, ≥30, ≥40, and ≥45 Gy (denoted as V5, V10, V20, V30, V40, and V45, respectively) were extracted.
2.5 Univariate and multivariable analysis
For the outcomes of OS and PFS, G4 ALC nadir and all clinical characteristics were analyzed by the univariate and multivariable Cox proportional hazards regression (cox-PH) models. Kaplan–Meier product-limit estimates with time-to-event curves were generated. To classify factors associated with G4 ALC nadir, all clinical characteristics and DVH metrics from tumor targets and OARs were analyzed between patient groups with or without G4 ALC nadir by the univariate logistic regression method.
2.6 Elastic-net logistic regression modeling
Elastic-net logistic regression is a type of penalized logistic regression (11, 12). Elastic-net uses both L1 and L2 norm penalty on the regression covariates and uses a mixing parameter that defines the proportion (alpha parameter) of penalty applied to the covariates between both L1 and L2 norms. Taken together, the elastic-net regression method allows retention of correlated covariates and also regularizes model predictors in a manner that allows for improved prediction performance. The risk factors selected from clinical characteristics and DVH metrics by elastic-net logistic regression models were applied to construct the multivariable logistic regression model.
Elastic-net logistic regression models were constructed for G4 ALC nadir prediction using a repeated cross-validation (CV) methodology to approximate the models’ generalization abilities when lacking an external validation dataset (13, 14). To determine the important features for G4 ALC nadir by elastic-net logistic regression models, we selected the best alpha parameter in one randomly separated train set as the first step; then, one elastic-net model was established in the train set and validated in the rest of the test set in 10-fold CV; finally, the 10-fold CV process was repeated 100 times in different held-out sets to estimate model mean efficacy [95% confidence interval (CI)], which is called repeated CV, considering to reduce overfitting in the small sample size. The statistically significant features were selected as the important features for G4 ALC nadir.
2.7 Statistical considerations
The Wilcoxon paired rank test was applied to compare the performances between two models. A p-value <0.05 was considered statistically significant in all statistical analysis. The Bonferroni correction was applied in multiple statistical testing. R software (version 4.0.2, R Development Core Team, Vienna, Austria) was used to conduct all statistical analyses. Elastic-net logistic regression modeling was implemented by the R package glmnet.
3. Results
3.1 Patient characteristics and clinical outcomes
Both pre-RT characteristics and clinical outcomes of the patients are listed in Table 1. A total of 130 patients formed the study cohort. The median age at diagnosis was 53 [interquartile range (IQR), 46–63] years. RapidArc was used in 79.2% of the patients. Twenty percent of patients had IC, and 83.8% received concurrent chemotherapy. The median (IQR) follow-up was 26.4 (14.2–41.6) months. The incidence of death, disease progression, local failure, regional lymph node metastasis, and distant metastasis during follow-up was 19.2%, 24.6%,11.5%, 3.8%, and 15.4%, respectively.
ALC of all patients declined during RT and generally recovered to some extent at the completion of RT, as shown in Figure 1A. The median pre-RT ALC was 1.74×109 cells/L. The counts declined during RT to the median ALC nadir as 0.24×109 cells/L, and the median onset time of ALC nadir was 33 days from the initiation of RT. Finally, ALC partially recovered to the median counts of 0.57×109 cells/L at the end of RT. The incidence of pre-, during-, and post-RT G4 lymphopenia were 0%, 33.1%, and 4.6%, respectively (Figure 1B).
Figure 1 The change of absolute lymphocyte count (ALC) from baseline to post-RT. (A) ALC declined during RT and generally recovered to some extent at the completion of RT. (B) The median ALC and incidence of G4 lymphopenia at different time points (pre-, nadir during-, and post-RT).
3.2 G4 ALC nadir during RT was associated with poor clinical outcomes
During follow-up, there were a total of 25 deaths. G4 ALC nadir was seen in 33.1% of patients. Patients with G4 ALC nadir had worse OS (p = 0.023) and PFS (p = 0.054) than those without G4 ALC nadir as shown in Figures 2A, B. On univariate analysis, OS was significantly worse in patients with G4 ALC nadir than those without G4 ALC nadir [hazard ratio (HR), 2.44; 95% CI, 1.1–5.39; p = 0.03], and PFS was also worse in patients with G4 ALC nadir (HR, 1.96; 95% CI, 0.98–3.94; p = 0.05), as shown in Table 2. On multivariable analysis, patients with G4 ALC nadir still had significantly poor OS and a trend of poor PFS than those without G4 ALC nadir (HR, 3.91; 95% CI, 1.34–11.38; p = 0.01, and HR, 1.82; 95% CI, 0.75–4.42; p = 0.19, respectively). Although without statistical significance (p > 0.05), G4 ALC nadir showed a trend of promotion effects in the occurrence of local failure, regional lymph node metastasis, and distant metastasis after RT (OR, 1.41, 1.37, and 1.83, respectively) (Figure 2C).
Figure 2 G4 ALC nadir and different treatment outcomes. (A) Patients with G4 ALC nadir had worse overall survival (OS) (p = 0.023). (B) Patients with G4 ALC nadir had worse progression-free survival (PFS) (p = 0.054). (C) The incidence and risk of local failure, regional lymph node metastasis, and distant metastasis after RT in patients with G4 ALC nadir.
3.3 Clinical and DVH characteristics and their correlations with G4 ALC nadir
The clinical characteristics were compared between the patient groups with or without G4 ALC nadir, and the univariate analysis results (ORs and p-values) are listed in Supplementary Table S1. Age, Eastern Cooperative Oncology Group (ECOG) performance status score, pre-RT hemoglobin, and pre-RT lymphocytes had protective effects from the occurrence of G4 ALC nadir (OR, 0.97, p = 0.03; OR, 0.11, p =0.04; OR, 0.97, p = 5.6e−3; OR, 0.22, p = 2.0e−4, respectively), while the usage of concurrent chemotherapy promoted the occurrence of G4 ALC nadir (OR, 5.73; p = 0.02). Body mass index (BMI) had protective (OR, 0.9; p = 0.06) and pre-RT regional lymph node metastasis had promotive (OR, 2.22; p = 0.06) effects from the occurrence of G4 ALC nadir with borderline significance.
All DVH metrics of interest were summarized in the format of median (IQR) in Supplementary Table S2. The radiation dosimetrics of different structures had high correlations (Pearson’s correlations > 0.5). There were little correlations among clinical characteristics and DVH dosimetrics, also among radiation dosimetrics and volumes, as shown in Supplementary Figure S1.
The DVH dosimetrics of each structure were compared between the patient groups with or without G4 ALC nadir as shown in Figure 3, and univariate logistic regression analysis results of each DVH dosimetrics in each structure for G4 ALC nadir are listed in Supplementary Table S3. The volume of GTV_N was correlated with the occurrence of G4 ALC nadir (OR, 1.07; p = 0.01). The volume of GTV_P, all dosimetrics of PTV_5500, and the volume, V5, and V10 of the body showed a tendency to correlate with the occurrence of G4 ALC nadir (all p-values <0.1).
Figure 3 Summary of dose–volume histogram (DVH) metrics of both tumor targets and organs at risk (OARs) in patients with or without G4 ALC nadir (red is the summary of DVH metrics in patients without G4 ALC nadir; green is the summary of DVH metrics in patients with G4 ALC nadir). Panels (A, B) show the median (95%CI) of Dmax and Dmean of both tumor targets and OARs in patients with or without G4 ALC nadir. Panel (C) shows the relative volume (in percentage) covered by different dose levels of both tumor targets and OARs in patients with or without G4 ALC nadir.
3.4 Elastic-net regression modeling for selecting risk factors affecting G4 ALC nadir
In searching grids from 0 to 1 step 0.05, the best alpha in the elastic-net logistic regression model with best performances was selected as 0.6 in one randomly separated train set. Then, elastic-net models were established in differently separated train sets and validated in the rest of the test sets in 10-fold CV for 100 iterations to summarize the prediction performances. As summarized in all models, mean AUC value in train sets was 0.84 (IQR, 0.82–0.86) and that in test sets was 0.76 (IQR, 0.69–0.83). The selected frequencies of all the 14 clinical characteristics and 63 DVH parameters in elastic-net regression models in 100 iterations bootstrapping are shown in Supplementary Figure S2.
The most moderate elastic-net model was selected as the final model. In building the model, the correlations between regression coefficients and lambda are shown in Figures 4A, B. The final model is shown in Figure 4C. Considering both the significance and selected frequency in bootstrapping, seven important risk factors were included in the final model (Figure 4C), including four clinical characteristics (ECOG, pre-RT hemoglobin, pre-RT lymphocytes, and concurrent chemotherapy) and three DVH parameters (GTV_N volume, PTV_5500 Dmax, and body volume).
Figure 4 Elastic-net regression modeling for selecting important features for G4 ALC nadir. (A) Elastic-net coefficient profiles of all factors. (B) The selected optimal parameter (lambda) in the most moderate elastic-net model. (C) Forest plot of the seven selected important features for G4 ALC nadir.
3.5 Development and validation of the prediction model for G4 ALC nadir
The final multivariable logistic regression model was further constructed with these seven factors selected by elastic-net model in one seldomly separated train set, as shown in Figure 4C. Among clinical characteristics, ECOG (OR, 0.16; 95% CI, 0.02–1.15; p = 0.07), pre-RT lymphocytes (OR, 3.01e−04;95% CI,0–0.13; p < 0.01), and pre-RT hemoglobin (OR, 6.16e−05; 95% CI, 0–0.08; p < 0.01) were protective factors for occurrence of G4 ALC nadir, while concurrent chemotherapy was a promoting factor for G4 ALC nadir (OR, 10.12; 95% CI, 1.76–58.18; p < 0.01). Among DVH parameters, body volume played as a protective factor (OR, 0.56; 95% CI, 0.35–0.89, p = 0.01), while the GTV_N volume and PTV_5500 Dmax promoted the incidence of G4 ALC nadir (OR, 1.16; 95% CI, 0.7–1.93; p = 0.56; OR, 1.58; 95% CI, 1–2.51; p = 0.05, respectively). It was consistent with the coefficient in the final LASSO-net regression model.
The final multivariable logistic regression model with seven important factors was compared with the multivariable logistic regression model with four clinical characteristics selected by elastic-net model. Their prediction abilities were testified in both train and test sets, as summarized in Figure 5B, and one example of AUC in the train and test sets was shown in Figure 5A. Four evaluation criteria, including sensitivity, specificity, accuracy, and AUC were all summarized and compared in Figure 5B. The final model with seven important factors had significantly higher AUC (mean, 0.84; 95% CI, 0.83–0.84) than the model with four clinical features (AUC mean, 0.8; 95% CI, 0.8–0.81) in train sets (a Wilcoxon paired rank test, p < 0.01), and the final model also had significantly higher AUC (mean, 0.77; 95% CI, 0.76–0.79) than the model with only four clinical features (mean, 0.76; 95% CI, 0.75–0.78) in test sets (a Wilcoxon paired rank test, p < 0.01). These results indicated that the DVH parameters improved the prediction performance for G4 ALC nadir. Finally, for the purpose of clinical usage in the future, the corresponding nomogram of the final multivariable logistic regression model with seven important factors for predicting G4 ALC nadir was plotted, as shown in Figure 5C.
Figure 5 Evaluations of the models and nomogram for G4 ALC nadir prediction. (A) One example of receiver operating characteristic (ROC) curve of the prediction models in train and test sets. (B) Comparison of four evaluation criteria of different prediction models in train and test sets. (C) Nomogram of the final prediction model with seven parameters.
4. Discussion
Lymphocytes are the most radiosensitive cells among the erythroid, myeloid, and lymphoid lineage with LD50 (lethal dose required to reduce the surviving fraction of lymphocytes by 50%) of only 2 Gy (5). RT-induced lymphopenia was common and correlated with poor survival in patients with different types of solid tumors, such as thoracic malignancies, brain tumors, head and neck cancers, and cervical cancer (15). In cervical cancer, the reported incidence of G4 lymphopenia during CCRT was 16%, and G4 lymphopenia could predict poor survival (4). The current study confirmed the results of previous studies. In our study, the incidence of G4 ALC nadir during pelvic (chemo)RT was as high as 33.1%, and G4 ALC nadir was associated with poor survival outcomes. Therefore, studies that focused on the prediction model for G4 lymphopenia are justified.
In this study, using the important risk factors selected from elastic-net models in machine learning framework, we developed and validated a multivariable logistic regression model for predicting G4 lymphopenia during pelvic (chemo)RT in cervical cancer patients with mean AUC = 0.84 (95% CI, 0.83–0.84), mean accuracy = 0.78 (95% CI, 0.77–0.79) in train sets and mean AUC = 0.77 (95% CI, 0.76–0.79), mean accuracy = 0.76 (95% CI, 0.75–0.78) in test sets. The final multivariable logistic regression model included four clinical characteristics (ECOG, pre-RT hemoglobin, pre-RT lymphocytes, and concurrent chemotherapy) and three DVH parameters (GTV_N volume, PTV_5500 Dmax, and body volume). Until now, we are not aware of any similar prediction models in cervical cancer. There are some other studies using machine learning algorithms to predict RT-induced lymphopenia in esophageal cancer (9, 16). Zhu and colleagues (9) constructed a novel deep learning model using dosimetric and clinical information to predict G4 lymphopenia during CCRT for esophageal cancer. Compared with their novel hybrid deep learning model in the train set, the AUC (0.84 vs. 0.831) and accuracy (0.78 vs. 0.769) of our final model were numerically similar (9). The performance of the final proposed model with both clinical and dosimetric factors was evaluated based on four prediction metrics and compared to the multivariable model with only clinical factors selected by elastic-net model. As expected, the model including both clinical and dosimetric factors outperformed the model including only clinical factors [mean AUC, 0.84 (95% CI, 0.83–0.84) vs. 0.8 (95% CI, 0.8–0.81), p < 0.01 in train sets, and mean AUC of 0.77 (95% CI, 0.76–0.79) vs. 0.76 (95% CI, 0.75–0.78), p < 0.01 in test sets]. With regard to potential clinical applications, we speculated that our model might play a role in the following clinical scenarios. First, as our model was totally based on pre-RT clinical and dosimetric parameters, it will enable physicians to assess EBRT plans for G4 lymphopenia risk and to identify patients at high risk who might benefit from modified treatment approaches and to guide modification of treatment approaches. Second, with the success of immunotherapy in solid tumors, the immunomodulatory effects of RT in conjunction with immune checkpoint blockade are currently under active investigation in cervical cancer (17). Lymphocytes are key effectors of immunotherapy, and lymphopenia was predictive for compromised efficacy of immunotherapy (18). It was reported that treatment-related severe lymphopenia was correlated with disease progression in NSCLC patients receiving consolidative immunotherapy after definitively chemoradiation (19). Applying lymphocyte-sparing RT has been recommended when combining RT with immunotherapy (20). RT-induced lymphopenia, which can be predicted using our model, should be one of the issues to take into consideration in designing clinical trials of RT combined with immunotherapy in cervical cancer (21).
In our proposed model, concurrent chemotherapy was a significant promote clinical factor for G4 ALC nadir (OR, 10.12; 95% CI, 1.76–58.18, p < 0.01), which was consistent with some previous studies (22, 23). In a study with large cohort of patients (N = 3,920) with different cancer types, the use of concurrent chemotherapy, particularly platinum compounds versus none, was associated with a lower ALC at end of RT (612 vs. 937 cells/ml, p < 0.001) (22). Another study with 711 patients who received definitive RT for NSCLC revealed that receipt of concurrent chemotherapy was associated with lower lymphocyte nadirs in multivariable analysis (p < 0.0001) (23). However, the contribution of concurrent chemotherapy to lymphocyte depletion is difficult to conclusively establish in studies demonstrating decreased ALC after CCRT. There were studies that showed that lymphopenia during CCRT was not significantly different among patients receiving different chemotherapy regimens, suggesting that no chemotherapy regimen per se was more likely to be cytotoxic to lymphocyte (24). In a study comparing effects of concurrent cisplatin administration during RT to RT alone on the immune function of patients with cervical cancer, administration of concurrent cisplatin might synergistically increase cytotoxic effects of radiation on tumor cells but did not alter the magnitude and the characteristics of radiation-induced immunosuppression (25). When patients received induction chemotherapy followed by consolidation chemoradiation, the drop in ALC occurred after consolidation therapy but not induction therapy, suggesting that induction chemotherapy did not play a major immediate role in causing lymphopenia (24, 26). Our study also showed that induction chemotherapy did not correlate with G4 ALC nadir during RT (OR, 1.09; p = 0.85). In esophageal cancer, neither induction chemotherapy nor the type of concurrent chemotherapy [e.g., taxane and 5-fluorouracil (5-FU) versus platinum and 5-FU or taxane and platinum or other] was associated with G4 lymphopenia (27). All these results suggest that the effect of concurrent chemotherapy on lymphopenia during RT is complex. More studies on the synergistic mechanism of chemotherapy and RT on the immune system are needed. Our study showed that ECOG was a protective factor from developing G4 ALC nadir. Patients with ECOG of 2 were less likely to develop G4 ALC nadir than those with ECOG of 0–1 (OR, 0.16; 95% CI, 0.02–1.15; p = 0.07). The possible reason for this result might be that patients with ECOG of 2 received less concurrent chemotherapy (p = 0.002).
Other two factors that promoted the incidence of G4 ALC nadir in our final model were GTV_N volume and PTV_5500 Dmax. In clinical practice, the dose level of 55 Gy was prescribed for metastatic locoregional lymph nodes. Both the two parameters indicated that the irradiation dose of the lymphatic system was positively correlated with G4 ALC nadir. Studies about the role of nodal irradiation for solid tumors on the reduction in circulating lymphocytes are scarce. Haas et al. studied the immunological effects of nodal irradiation for Hodgkin’s disease and showed that lymphoid irradiation (LI) was cytotoxic to peripheral blood T cells (28). They also postulated that the bone marrow outside the irradiation fields was a major source of T cells repopulating the peripheral blood after LI (28).
Among the clinical factors, pre-RT hemoglobin and lymphocytes were protective from developing G4 ALC nadir, which were consistent with results from previous studies (8, 9, 27, 29). Zhu et al. reported that patients with G4 lymphopenia during CCRT for esophageal cancer had lower level of baseline hemoglobin (12.94 vs. 13.28 g/dl, p = 0.008) and baseline ALC (1.42 vs. 1.78 x 109/L, p <0.001) (9). Sini et al. also suggested that baseline ALC played an extremely important role in the development of lymphopenia in patients treated with pelvis RT for prostate cancer (8). Baseline ALCs below 1.83 x 109/L were predictive of an enhanced probability of acute G3 lymphopenia (8). Recent studies found that erythroid cells can regulate immune responses (30). CD71+erythroid cells (CECs), which are immature red blood cells, including erythroblasts and reticulocytes, exert immunosuppressive functions by producing reactive oxygen species to decrease T-cell proliferation or secreting cytokines, including transforming growth factor β (TGF-β), which promotes T-cell differentiation into regulatory T cells (30, 31). In patients with cancer, anemia leads to increased frequency of CECs in the peripheral blood contributing to diminished immunity (30), and late-stage tumors can induce anemia and immunosuppressive extramedullary erythroid progenitor cells (32). In addition to affecting erythroid cells, as a systemic disease, cancer induces many functional and compositional changes to the immune system as a whole (33). Reduced abundance and decreased function of T cells in the blood was observed in cancer (33). All these studies can partly explain our results that pre-RT hemoglobin and lymphocytes were protective factors for G4 lymphopenia occurrence.
We observed that body volume was included in our final model as a protective factor for the occurrence of lymphopenia (OR, 0.56; 95% CI, 0.35–0.89, p = 0.01). In our study, body volume was defined as the part of body within the range of simulation CT scan for EBRT. On the condition of same irradiation dose, patients with larger body volume would receive lower dose per unit of body volume. We further did linear regression analysis and demonstrated that body volume was moderately correlated with BMI (R2 = 0.602, p <0.01). Univariate analysis of this study also showed that BMI had a propensity to protect patients from developing G4 ALC nadir (OR = 0.9, p = 0.06), which was consistent with previous studies (9, 27). Due to the significant correlation, BMI was not included in the final prediction model by the machine learning framework, while it might improve the model’s generalizability in external dataset.
Bone marrow displays structural and functional features resembling a secondary lymphoid organ and contains follicle-like structures similar to lymph nodes or spleen. Approximately 8%–20% of bone marrow mononuclear cells are lymphocytes (34). The correlation of bone marrow irradiation with lymphopenia is controversial. Some studies showed that the doses to the pelvic bone marrow was correlated with RT-induced lymphopenia (8, 35). However, a study on dosimetric predictors of lymphopenia induced by palliative RT showed that bone marrow dose–volume parameters did not predict lymphopenia. Our study did not find any relationship between DVH parameters of bones and occurrence of G4 ALC nadir. We postulate two possible reasons for the negative results of this study. First, lymphocytes are extremely radiosensitive to radiation (5), the doses to the pelvic bone exceeded the lethal dose of the lymphocyte (mean dose of bone was as high as 29.2 Gy). Second, the whole irradiated bones were treated as a whole organ during the process of our analysis. However, radiation to different parts of the pelvic bones may contribute differently to hematological toxicities. A study in patients treated with whole-pelvis RT for prostate cancer showed that the model for acute G3 lymphopenia included V40 of the whole pelvis, and the 1-year G2 lymphopenia model included V40 of the ilium (8).
It is also meaningful to explore the relationship between tumor biology and the risk of developing G4 ALC nadir. Squamous cell carcinoma accounted for approximately 80% of all cervical cancers, and adenocarcinoma accounted for approximately 20% (36). In our study, the majority of patients (95.4%) had squamous cell carcinoma, and histology types were not associated with the incidence of G4 ALC nadir (squamous vs. non-squamous: 33.1% vs. 33.3%, p > 0.999). Cho et al. also reported that histology types did not correlate with chemoradiation-induced lymphopenia in cervical cancer (p = 0.713) (4). Most cervical cancers are positive for human papillomavirus (HPV) (37). In our study, it was not possible to analyze the relationship between HPV and occurrence of G4 ALC nadir because there were a lot of missing data on HPV and we are not aware of any such analysis by others as well. With regard to FIGO stage, it had no significant impact on the occurrence of G4 ALC nadir in univariate analysis in our study, which was consistent with the results from Cho et al. (4). In FIGO (2018), locoregional lymph nodes (pelvic and paraaortic) are indicators for staging, and patients with locoregional lymph nodes metastases only are staged as IIIC. In clinical practice, radiation dose boosts mainly to locoregional metastatic lymph nodes, which was included in the final model.
With regard to ways of reducing the risk of lymphopenia or restoration of the number of lymphocytes in the peripheral blood, several measures can be attempted. First, according to our proposed model, if patients are at high risk of G4 lymphopenia, measures can be taken before treatment, such as correcting anemia, modifying EBRT plan to reduce PTV_5500 Dmax, or restoring a physiological number of lymphocytes in the peripheral blood by use of cytokines IL-2, IL-7, and IL-15, which play a role in the development, proliferation, and survival of T cells (18).
This study had some limitations. First, there were some discordances on the time points of blood tests due to the retrospective nature of the study. Second, lymphocyte subtypes changed differently after pelvic RT (38, 39) and had different impacts on treatment outcomes (40, 41). However, lymphocyte subtypes were unavailable for the patients included in the study, as lymphocyte subtypes were not routinely tested in our clinical practice, and no blood was collected for further tests. Third, body volume in the final model is determined by the extent of the simulation CT scans, which might be hard to synchronize across different centers. However, body volume was selected as a protective factor for the occurrence of G4 ALC nadir through elastic-net regression modeling; we think it is still meaningful to keep it in the final model to remind readers of its potential role in the occurrence of G4 ALC nadir during CCRT in cervical cancer, and we also recommend external validation of the role of body volume. Fourth, although the data were split into a training and a testing set, it would be better to use external data from different institutions to validate our results.
In conclusion, the present study developed and validated a comprehensive model integrating clinical and dosimetric parameters by machine learning method, which performed well in predicting G4 lymphopenia during pelvic RT for cervical cancer and may facilitate physicians to identify patients at high risk of G4 lymphopenia who might benefit from modified treatment approaches.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
This study was reviewed and approved by Ethics committee of the University of Hong Kong-Shenzhen Hospital. The ethics committee waived the requirement of written informed consent for participation.
Author contributions
LY: Primary data collection, data analysis, manuscript writing, and manuscript approval. ZX: Primary data collection, data analysis, manuscript editing, and manuscript approval. HY: Corresponding author, data double check, data analysis, result interpretation, study design, and manuscript approval. LG: Corresponding author, study design oversight, data quality control, results check and manuscript approval. All authors contributed to the article and approved the submitted version.
Funding
This project is supported in part by Health Commission of Guangdong Province, China (NO. B2020100); Shenzhen Science and Technology Program (NO. KQTD20180411185028798 and JCYJ20210324114600002); High Level-Hospital Program, Health Commission of Guangdong Province, China (NO. HKUSZH201902031 and HKUSZH201901038); Shenzhen Fundamental Research Program (NO. JCYJ2020109150427184) and Shenzhen Key Medical Discipline Construction Fund (NO. SZXK014).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.905222/full#supplementary-material
Supplementary Figure 1 | The Pearson’s correlations among clinical characteristics and dose-volume histogram (DVH) metrics.
Supplementary Figure 2 | The selected frequencies of 14 clinical characteristics and 63 dose-volume histogram (DVH) parameters in Elastic-net regression models in 100 iterations bootstrapping.
References
1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660
2. Koh WJ, Abu-Rustum NR, Bean S, Bradley K, Campos SM, Cho KR, et al. Cervical cancer, version 3.2019, NCCN clinical practice guidelines in oncology. J Natl Compr Cancer Network JNCCN (2019) 17(1):64–84. doi: 10.6004/jnccn.2019.0001
3. Jin JY, Hu C, Xiao Y, Zhang H, Paulus R, Ellsworth SG, et al. Higher radiation dose to the immune cells correlates with worse tumor control and overall survival in patients with stage III NSCLC: A secondary analysis of RTOG0617. Cancers (2021) 13(24):6193. doi: 10.3390/cancers13246193
4. Cho O, Chun M, Chang SJ, Oh YT, Noh OK. Prognostic value of severe lymphopenia during pelvic concurrent chemoradiotherapy in cervical cancer. Anticancer Res (2016) 36(7):3541–7.
5. Nakamura N, Kusunoki Y, Akiyama M. Radiosensitivity of CD4 or CD8 positive human T-lymphocytes by an in vitro colony formation assay. Radiat Res (1990) 123(2):224–7. doi: 10.2307/3577549
6. Qian JM, Akama-Garren E, Shin J, Gunasti L, Bang A, Pike LRG, et al. Dosimetric modeling of lymphopenia in patients with metastatic cancer receiving palliative radiation and PD-1 immune checkpoint inhibitors. Adv Radiat Oncol (2022) 7(2):100880. doi: 10.1016/j.adro.2021.100880
7. Ellsworth SG. Field size effects on the risk and severity of treatment-induced lymphopenia in patients undergoing radiation therapy for solid tumors. Adv Radiat Oncol (2018) 3(4):512–9. doi: 10.1016/j.adro.2018.08.014
8. Sini C, Fiorino C, Perna L, Noris Chiorda B, Deantoni CL, Bianchi M, et al. Dose-volume effects for pelvic bone marrow in predicting hematological toxicity in prostate cancer radiotherapy with pelvic node irradiation. Radiother Oncol J Eur Soc Ther Radiol Oncol (2016) 118(1):79–84. doi: 10.1016/j.radonc.2015.11.020
9. Zhu C, Lin SH, Jiang X, Xiang Y, Belal Z, Jun G, et al. A novel deep learning model using dosimetric and clinical information for grade 4 radiotherapy-induced lymphopenia prediction. Phys Med Biol (2020) 65(3):035014. doi: 10.1088/1361-6560/ab63b6
10. Rajula HSR, Verlato G, Manchia M, Antonucci N, Fanos V. Comparison of conventional statistical methods with machine learning in medicine: Diagnosis, drug development, and treatment. Medicina (Kaunas Lithuania) (2020) 56(9):455. doi: 10.3390/medicina56090455
11. Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Society: Ser B (Statistical Methodol (2005) 67(2):301–20. doi: 10.1111/j.1467-9868.2005.00503.x
12. Hastie T TR, Friedman J. The elements of statistical learning: data mining, inference, and prediction. 2nd ed. New York, NY: Springer (2009).
13. Deist TM, Dankers F, Valdes G, Wijsman R, Hsu IC, Oberije C, et al. Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers. Med Phys (2018) 45(7):3449–59. doi: 10.1002/mp.12967
14. Xu CJ, van der Schaaf A, Schilstra C, Langendijk JA, van't Veld AA. Impact of statistical learning methods on the predictive power of multivariate normal tissue complication probability models. Int J Radiat oncol biology Phys (2012) 82(4):e677–84. doi: 10.1016/j.ijrobp.2011.09.036
15. Venkatesulu BP, Mallick S, Lin SH, Krishnan S. A systematic review of the influence of radiation-induced lymphopenia on survival outcomes in solid tumors. Crit Rev oncol/hematol (2018) 123:42–51. doi: 10.1016/j.critrevonc.2018.01.003
16. Ebrahimi S, Lim G, Hobbs BP, Lin SH, Mohan R, Cao W. A hybrid deep learning model for forecasting lymphocyte depletion during radiation therapy. Med Phys (2022) 49(5):3507–22. doi: 10.1002/mp.15584
17. Dyer BA, Feng CH, Eskander R, Sharabi AB, Mell LK, McHale M, et al. Current status of clinical trials for cervical and uterine cancer using immunotherapy combined with radiation. Int J Radiat oncol biology Phys (2021) 109(2):396–412. doi: 10.1016/j.ijrobp.2020.09.016
18. Ménétrier-Caux C, Ray-Coquard I, Blay JY, Caux C. Lymphopenia in cancer patients and its effects on response to immunotherapy: an opportunity for combination with cytokines? J immunother Cancer (2019) 7(1):85. doi: 10.1186/s40425-019-0549-5
19. Friedes C, Chakrabarti T, Olson S, Prichett L, Brahmer JR, Forde PM, et al. Association of severe lymphopenia and disease progression in unresectable locally advanced non-small cell lung cancer treated with definitive chemoradiation and immunotherapy. Lung Cancer (Amsterdam Netherlands) (2021) 154:36–43. doi: 10.1016/j.lungcan.2021.01.022
20. Lambin P, Lieverse RIY, Eckert F, Marcus D, Oberije C, van der Wiel AMA, et al. Lymphocyte-sparing radiotherapy: The rationale for protecting lymphocyte-rich organs when combining radiotherapy with immunotherapy. Semin Radiat Oncol (2020) 30(2):187–93. doi: 10.1016/j.semradonc.2019.12.003
21. Holub K, Vargas A, Biete A. Radiation-induced lymphopenia: the main aspects to consider in immunotherapy trials for endometrial and cervical cancer patients. Clin Trans Oncol Off Publ Fed Spanish Oncol Societies Natl Cancer Institute Mexico (2020) 22(11):2040–8. doi: 10.1007/s12094-020-02345-3
22. Terrones-Campos C, Ledergerber B, Vogelius IR, Specht L, Helleberg M, Lundgren J. Lymphocyte count kinetics, factors associated with the end-of-Radiation-Therapy lymphocyte count, and risk of infection in patients with solid malignant tumors treated with curative-intent radiation therapy. Int J Radiat oncol biology Phys (2019) 105(4):812–23. doi: 10.1016/j.ijrobp.2019.07.013
23. Tang C, Liao Z, Gomez D, Levy L, Zhuang Y, Gebremichael RA, et al. Lymphopenia association with gross tumor volume and lung V5 and its effects on non-small cell lung cancer patient outcomes. Int J Radiat oncol biology Phys (2014) 89(5):1084–91. doi: 10.1016/j.ijrobp.2014.04.025
24. Campian JL, Ye X, Brock M, Grossman SA. Treatment-related lymphopenia in patients with stage III non-small-cell lung cancer. Cancer Invest (2013) 31(3):183–8. doi: 10.3109/07357907.2013.767342
25. Santin AD, Hermonat PL, Ravaggi A, Bellone S, Roman J, Pecorelli S, et al. Effects of concurrent cisplatinum administration during radiotherapy vs. radiotherapy alone on the immune function of patients with cancer of the uterine cervix. Int J Radiat oncol biology Phys (2000) 48(4):997–1006. doi: 10.1016/s0360-3016(00)00769-0
26. Chadha AS, Liu G, Chen HC, Das P, Minsky BD, Mahmood U, et al. Does unintentional splenic radiation predict outcomes after pancreatic cancer radiation therapy? Int J Radiat oncol biology Phys (2017) 97(2):323–32. doi: 10.1016/j.ijrobp.2016.10.046
27. van Rossum PSN, Deng W, Routman DM, Liu AY, Xu C, Shiraishi Y, et al. Prediction of severe lymphopenia during chemoradiation therapy for esophageal cancer: Development and validation of a pretreatment nomogram. Pract Radiat Oncol (2020) 10(1):e16–26. doi: 10.1016/j.prro.2019.07.010
28. Haas GS, Halperin E, Doseretz D, Linggood R, Russell PS, Colvin R, et al. Differential recovery of circulating T cell subsets after nodal irradiation for hodgkin's disease. J Immunol (Baltimore Md 1950) (1984) 132(2):1026–30.
29. Yang L, Xu Z, Ma L, Liu Q, Chang ATY, Wang Q, et al. Early onset of severe lymphopenia during definitive radiotherapy correlates with mean body dose and predicts poor survival in cervical cancer. Cancer Biomarkers section A Dis Markers (2022) 34(1):149–59. doi: 10.3233/CBM-210292
30. Grzywa TM, Nowis D, Golab J. The role of CD71(+) erythroid cells in the regulation of the immune response. Pharmacol Ther (2021) 228:107927. doi: 10.1016/j.pharmthera.2021.107927
31. Elahi S, Ertelt JM, Kinder JM, Jiang TT, Zhang X, Xin L, et al. Immunosuppressive CD71+ erythroid cells compromise neonatal host defence against infection. Nature (2013) 504(7478):158–62. doi: 10.1038/nature12675
32. Zhao L, He R, Long H, Guo B, Jia Q, Qin D, et al. Late-stage tumors induce anemia and immunosuppressive extramedullary erythroid progenitor cells. Nat Med (2018) 24(10):1536–44. doi: 10.1038/s41591-018-0205-5
33. Hiam-Galvez KJ, Allen BM, Spitzer MH. Systemic immunity in cancer. Nat Rev Cancer (2021) 21(6):345–59. doi: 10.1038/s41568-021-00347-z
34. Zhao E, Xu H, Wang L, Kryczek I, Wu K, Hu Y, et al. Bone marrow and the control of immunity. Cell Mol Immunol (2012) 9(1):11–9. doi: 10.1038/cmi.2011.47
35. Albuquerque K, Giangreco D, Morrison C, Siddiqui M, Sinacore J, Potkul R, et al. Radiation-related predictors of hematologic toxicity after concurrent chemoradiation for cervical cancer and implications for bone marrow-sparing pelvic IMRT. Int J Radiat oncol biology Phys (2011) 79(4):1043–7. doi: 10.1016/j.ijrobp.2009.12.025
36. National Comprehensive Cancer Network. Cervical cancer (version 1.2021) . Available at: http://www.nccn.org/professionals/physician_gls/pdf/cervical.pdf (Accessed October 2, 2020).
37. Walboomers JM, Jacobs MV, Manos MM, Bosch FX, Kummer JA, Shah KV, et al. Human papillomavirus is a necessary cause of invasive cervical cancer worldwide. J Pathol (1999) 189(1):12–9. doi: 10.1002/(SICI)1096-9896(199909)189:1<12::AID-PATH431>3.0.CO;2-F
38. Eric A, Juranic Z, Tisma N, Plesinac V, Borojevic N, Jovanovic D, et al. Radiotherapy-induced changes of peripheral blood lymphocyte subpopulations in cervical cancer patients: relationship to clinical response. J BUON Off J Balkan Union Oncol (2009) 14(1):79–83.
39. Swanson GP, Jhavar SG, Hammonds K. The effect of pelvic radiation alone on lymphocyte subgroups. Clin Transl Radiat Oncol (2020) 23:100–2. doi: 10.1016/j.ctro.2020.05.010
40. Jordanova ES, Gorter A, Ayachi O, Prins F, Durrant LG, Kenter GG, et al. Human leukocyte antigen class I, MHC class I chain-related molecule a, and CD8+/regulatory T-cell ratio: which variable determines survival of cervical cancer patients? Clin Cancer Res (2008) 14(7):2028–35. doi: 10.1158/1078-0432.CCR-07-4554
Keywords: cervical cancer, machine learning model, lymphopenia, prediction, pelvic radiotherapy
Citation: Xu Z, Yang L, Yu H and Guo L (2022) A machine learning model for grade 4 lymphopenia prediction during pelvic radiotherapy in patients with cervical cancer. Front. Oncol. 12:905222. doi: 10.3389/fonc.2022.905222
Received: 26 March 2022; Accepted: 22 August 2022;
Published: 15 September 2022.
Edited by:
Kathy Han, University Health Network, CanadaReviewed by:
Hong Qi Tan, National Cancer Centre Singapore, SingaporeAnthony Magliocco, Protean BioDiagnostics Inc., United States
Copyright © 2022 Xu, Yang, Yu and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Linlang Guo, bGlubGFuZ2dAeWFob28uY29t; Hao Yu, eXVoYW9femp1QDE2My5jb20=
†These authors have contributed equally to this work and share first authorship