Identification of high-risk factors associated with mortality at 1-, 3-, and 5-year intervals in gastric cancer patients undergoing radical surgery and immunotherapy: an 8-year multicenter retrospective analysis

Liu, Yuan; Wang, Lanyu; Du, Wenyi; Huang, Yukang; Guo, Yi; Song, Chen; Tian, Zhiqiang; Niu, Sen; Xie, Jiaheng; Liu, Jinhui; Cheng, Chao; Shen, Wei

doi:10.3389/fcimb.2023.1207235

ORIGINAL RESEARCH article

Front. Cell. Infect. Microbiol., 31 May 2023

Sec. Extra-intestinal Microbiome

Volume 13 - 2023 | https://doi.org/10.3389/fcimb.2023.1207235

This article is part of the Research TopicNew Insights into the Role of Tumor Microbial Microenvironment in Tumor ImmunotherapyView all 7 articles

Identification of high-risk factors associated with mortality at 1-, 3-, and 5-year intervals in gastric cancer patients undergoing radical surgery and immunotherapy: an 8-year multicenter retrospective analysis

Yuan Liu^1†

Lanyu Wang^2†

Wenyi Du^1†

Yukang Huang^1†

Yi Guo^3†

Chen Song¹

Zhiqiang Tian¹

Sen Niu¹

Jiaheng Xie⁴

Jinhui Liu^4*

Chao Cheng^5*

Wei Shen^1*

¹Department of General Surgery, Wuxi People’s Hospital Affiliated to Nanjing Medical University, Wuxi, China
²Department of Urology, Wuxi People’s Hospital Affiliated to Nanjing Medical University, Wuxi, China
³Department of General Practice, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, Shandong, China
⁴The First Affiliated Hospital of Nanjing Medical University, Nanjing, China
⁵Department of Neurosurgery, Wuxi People’s Hospital Affiliated to Nanjing Medical University, Wuxi, China

Background: Combining immunotherapy with surgical intervention is a prevailing and radical therapeutic strategy for individuals afflicted with gastric carcinoma; nonetheless, certain patients exhibit unfavorable prognoses even subsequent to this treatment regimen. This research endeavors to devise a machine learning algorithm to recognize risk factors with a high probability of inducing mortality among patients diagnosed with gastric cancer, both prior to and during their course of treatment.

Methods: Within the purview of this investigation, a cohort of 1015 individuals with gastric cancer were incorporated, and 39 variables encompassing diverse features were recorded. To construct the models, we employed three distinct machine learning algorithms, specifically extreme gradient boosting (XGBoost), random forest (RF), and k-nearest neighbor algorithm (KNN). The models were subjected to internal validation through employment of the k-fold cross-validation technique, and subsequently, an external dataset was utilized to externally validate the models.

Results: In comparison to other machine learning algorithms employed, the XGBoost algorithm demonstrated superior predictive capacity regarding the risk factors that affect mortality after combination therapy in gastric cancer patients for a duration of one year, three years, and five years posttreatment. The common risk factors that significantly impacted patient survival during the aforementioned time intervals were identified as advanced age, tumor invasion, tumor lymph node metastasis, tumor peripheral nerve invasion (PNI), multiple tumors, tumor size, carcinoembryonic antigen (CEA) level, carbohydrate antigen 125 (CA125) level, carbohydrate antigen 72-4 (CA72-4) level, and H. pylori infection.

Conclusion: The XGBoost algorithm can assist clinicians in identifying pivotal prognostic factors that are of clinical significance and can contribute toward individualized patient monitoring and management.

Introduction

Gastric cancer is among the most prevalent malignancies and serves as the primary cause of cancer-related deaths, with a particularly high incidence observed in developing countries (Maomao et al., 2022; Sekiguchi et al., 2022). Global epidemiological surveys on tumors have indicated that the incidence of gastric cancer is on the rise, concurrent with a shift in dietary habits (Morgan et al., 2022). Early diagnosis and prompt treatment hold critical importance in tumor management. Over the years, the advent of comprehensive therapeutic modalities such as immunotherapy and molecular targeted drugs has significantly improved the survival rates of patients with advanced gastric cancer (Li et al., 2021). However, a considerable fraction of tumor cells lack the requisite molecules that can interact with the immune system or exhibit immune evasion mechanisms, consequently resulting in immunotherapy failure. Furthermore, immunotherapy necessitates specific biomarkers or gene expression in patients, thus limiting its applicability to certain patient populations. The presence of side effects such as immune suppression or immune hyperactivation post immunotherapy further impedes its development. In contemporary clinical practice, clinicians frequently resort to combining advanced surgical techniques with immunotherapy to combat gastric cancer; notwithstanding, treatment failure may still occur, ultimately leading to patient mortality. This can be attributed to diverse factors, such as tumor type, stage, location and size, patient age, physical condition, and immune system status. Tumor cells may undergo mutation and evolve, resulting in increased tumor resistance to treatment (Kakinuma et al., 2021; Xiang et al., 2021; Shibata et al., 2022).

Clinicians commonly use their clinical experience and factors such as the patient’s medical history and presentation to assess the risk of death after combination therapy for gastric cancer. However, this method has limitations in terms of accuracy and subjectivity. Imaging tests such as CT and MRI are also used in diagnosis, but they increase the workload of medical staff and are financially burdensome for patients’ families. Additionally, some examination protocols are invasive and radioactive, which can cause harm to patients. Traditional regression models have been used, but they have poor discrimination and calibration ability (Niu et al., 2020). Artificial intelligence, particularly machine learning algorithms, can analyze and learn from large amounts of data to discover complex relationships and patterns between variables, enabling prediction of future disease occurrence (Wang et al., 2015). Compared to traditional prediction methods based on statistical methods and empirical rules, machine learning algorithms have stronger adaptive and generalization capabilities and can adapt to a wider and more complex data situation while avoiding errors introduced by researchers’ subjective factors and limitations of research methods. Liu et al. employed sophisticated data mining techniques to enhance the identification of prognostic risk factors in individuals diagnosed with early-stage gastric cancer, focusing specifically on non-invasive variables (Liu et al., 2018; Afrash et al., 2023). In this study, we analyzed clinical data from patients with gastric cancer and utilized machine learning algorithms to develop a prediction model for patient death after radical gastric cancer surgery combined with immunotherapy to improve the quality of postoperative survival.

Materials and methods

Study subjects

In this study, we used data from the clinical databases of the Affiliated Wuxi People’s Hospital of Nanjing Medical University, Wuxi Second People’s Hospital, and Shandong Provincial Hospital affiliated with Shandong First Medical University. The criteria for patient inclusion in this study were as follows: (1) adult patients aged 18 years and above but below 80 years of age; (2) patients who underwent a combination of radical gastric cancer surgery and immunotherapy; (3) the surgical team involved senior surgeons with the expertise to independently perform radical gastric cancer surgery; and (4) patients were diagnosed with gastric adenocarcinoma through postoperative pathology. Exclusion criteria for the case included the following: (1) patients presenting with coexisting malignancies; (2) patients diagnosed with gastric cancer with distant metastasis based on pathological examination or imaging studies; (3) patients diagnosed with severe cardiovascular or respiratory diseases; (4) patients with significant liver or kidney pathology; and (5) patients with incomplete case data, missing clinical information, or absent visits. All patients in the study were followed up for at least 5 years after surgery. This study was conducted in accordance with the Declaration of Helsinki and was approved by the Ethics Committee of the Affiliated Wuxi People’s Hospital of Nanjing Medical University, Wuxi Second People’s Hospital, and Shandong Provincial Hospital affiliated with Shandong First Medical University, with approval number KY22085.

Diagnosis of Helicobacter pylori infection and determination of associated factors

The diagnosis of H. pylori infection was established using three criteria: first, through postoperative bacterial culture of gastric mucosa, duodenal mucosa, gastric fluid, and expiratory samples to confirm the presence of positive H. pylori; second, through postoperative HE staining of gastric mucosal tissue sections to determine the presence of positive H. pylori; and third, through postoperative confirmation of H. pylori infection by means of urea breath test (UBT), fecal antigen test, and endoscopy active infection. The patient fulfilled all three criteria and was ultimately diagnosed with H. pylori infection (Hou et al., 2020). In this study, clinicians employed PD-1/PD-L1 checkpoint inhibitors for the immunotherapeutic treatment of patients.

Study design and data collection

Clinical information of the patients was evaluated, including demographic characteristics, basic clinical features, basic medical history, laboratory test indices before and during combination therapy, tumor characteristics, and intraoperative information of the patients. All laboratory tests conducted prior to the combination therapy were collected within 24 hours of the day, which included the measurement of albumin (ALB) levels. All laboratory tests conducted after the combination therapy were collected within 48 hours and included the patient’s H. pylori infection status, as well as the levels of carcinoembryonic antigen (CEA), carbohydrate antigen 19-9 (CA19-9), carbohydrate antigen 72-4 (CA72-4), carbohydrate antigen 125 (CA125), neutrophil-to-lymphocyte ratio (NLR), procalcitonin (PCT), C-reactive protein (CRP), and serum amyloid A (SAA). Demographic information included sex, age, body mass index (BMI), and history of smoking and alcohol abuse. Basic clinical features comprised the American Society of Anesthesiologists physical status classification (ASA score), Nutrition Risk Screening 2002 (NRS2002) score, history of surgery, family history, history of adjuvant chemotherapy, and history of adjuvant radiotherapy. Medical history included anemia, diabetes mellitus, hypertension, hyperlipidemia, and coronary heart disease (CHD). The study included tumor characteristics such as tumor T-stage, N-stage, peripheral nerve invasion (PNI), tumor size, and tumor number, as well as intraoperative variables such as the surgical approach, type of surgery, number of intraoperative lymph node dissections, anastomosis, type of anastomosis, and whether the surgery was performed as an emergency. The outcome indicators for this study were patient mortality rates at one, three, and five years.

Statistical analysis

Continuous variables were presented using medians and interquartile ranges (IQRs), whereas categorical variables were presented using frequencies and percentages. The chi-square test was utilized to compare differences between the two groups for categorical variables, while the t test was employed for continuous variables that followed a normal distribution. For continuous variables that did not follow a normal distribution, the rank sum test was applied. A two-tailed P value of less than 0.05 was considered statistically significant. All statistical analyses were conducted using SPSS, R, and Python software.

Development and evaluation of predictive models for machine learning algorithms

(1) Data preprocessing. Patients with gastric cancer who received treatment at Wuxi People’s Hospital and Wuxi Second People’s Hospital between January 2010 and January 2018 were selected as the internal validation group, while patients with gastric cancer who received treatment at the Provincial Hospital affiliated with the First Medical University of Shandong Province during the same period were selected as the external validation group. The internal validation group was divided randomly into a training set (70%) and a testing set (30%). (2) The internal validation set data underwent univariate analysis, and only the variables that demonstrated significant associations were selected for the subsequent stages of the prediction model construction. (3) Build and evaluate prediction models. The selected feature variables were integrated into the prediction models of three machine learning algorithms, namely, extreme gradient boosting (XGBoost), random forest (RF), and k-nearest neighbor algorithm (KNN). Utilizing the algorithm’s underlying principle, we employ an iterative methodology to dynamically modify the model’s parameters and observe its outcomes, aiming to ascertain the model parameters that yield optimal results. To compare and select different model algorithms, k-fold cross-validation was used since it is easy to implement and has a lower bias evaluation capability compared to other methods. Hyperparameters were adjusted by grid search, and k-fold cross-validation was performed on the internal validation set using a resampling method with k=5. The dataset was divided into five groups, with one group used as a test dataset and the rest used as a training dataset. This process was repeated until each group had been used as a test dataset (Zhao et al., 2023). Model evaluation metrics such as the area under the curve (AUC), accuracy, sensitivity, and specificity were calculated and averaged over the k-round fitness to derive the most accurate estimate of the model prediction performance. The models were evaluated for discrimination, calibration, and clinical utility, and the best model was selected for prediction analysis. Receiver operating characteristic (ROC) curves were used to determine the predictive efficacy of the model, calibration curves were used to assess agreement between the predicted and actual outcomes, and decision curve analysis (DCA) was used to evaluate the clinical utility of the model. The DCA curve starts at the intersection of the red curve with the All curve and ends at the intersection of the red curve with None, within which the corresponding patient can benefit. (4) External validation of the best model will be conducted using an external test set. ROC curves will be plotted to evaluate the predictive efficiency and generalizability of the model. (5) Model interpretation. The Shapley value, obtained through Shapley additive explanation (SHAP) analysis, allows us to determine the contribution of each feature in the sample to the prediction. Based on the Shapley values, two types of plots are constructed: the SHAP summary plot, which ranks the importance of risk factors, and the single-sample SHAP force plot, which analyzes and explains the prediction results of a single sample (Chi et al., 2023).

Results

Basic clinical information of the patient

A total of 1015 patients were included in the study, of whom 92 (9.06%) died within one year, 299 (29.46%) died within three years, and 404 (39.8%) died within five years (Figures 1A, B). The internal validation set consisted of 709 patients with gastric cancer, of whom 66 (9.31%) died within one year, 206 (29.06%) died within three years, and 281 (39.63%) died within five years. The external validation set included 306 patients with gastric cancer, of whom 26 (8.5%) died within one year, 93 (30.39%) died within three years, and 123 (40.2%) died within five years. The original data presented in the study are included in Table S1.

FIGURE 1

Figure 1 Model-making process and flowchart of the study. (A) Study design flow chart. (B) Flow diagram of patients included in the study.

Screening for risk factors for death at one, three, and five years after combination therapy in patients with gastric cancer

The univariate analysis results indicated that several factors significantly influenced the one-year death rate among gastric cancer patients, including age, emergency surgery, tumor T-stage, lymph node metastasis, peripheral nerve metastasis, tumor number, size, CEA level after combined treatment, CA125 level, CA72-4 level, and H. pylori infection (p<0.05). Similarly, the death rate at three years was influenced by several factors, such as age, surgical method, operative time, intraoperative bleeding, operation mode, tumor T-stage, lymph node metastasis, peripheral nerve metastasis, tumor number, size, CEA level, CA125 level, CA72-4 level after combined treatment, intraoperative blood transfusion, and H. pylori infection. Moreover, gender, age, surgical method, time of surgery, intraoperative bleeding, tumor T-stage, lymph node metastasis, peripheral nerve metastasis, tumor number, size, CEA level after combined treatment, CA125 level, CA72-4 level, NRS2002 score, intraoperative blood transfusion, and H. pylori infection were found to be significant influencing factors for five-year mortality in gastric cancer patients (Table 1).

TABLE 1

Table 1 Univariate analysis of the prognosis of combined treatment.

Model building and evaluation

Regarding the prediction analysis of one-year, three-year, and five-year death of patients with gastric cancer, the results of ROC curve analysis revealed that XGBoost had the highest performance compared to the other two algorithms. Specifically, the AUC value of XGBoost was 0.993 in the training set and 0.808 in the validation set for one-year death prediction; 0.994 in the training set and 0.758 in the validation set for three-year death prediction; and 0.995 in the training set and 0.829 in the validation set for five-year death prediction (Table 2). Additionally, the calibration curves of the three models were similar to the ideal curves, indicating a high level of agreement between the predicted and actual outcomes. The DCA curves also showed that all three models achieved a net clinical benefit relative to the full treatment or no treatment plan (Figures 2A–L). Finally, the k-fold cross-validation method was used to compare the generalization ability of the three models.

TABLE 2

Table 2 Evaluation of the performance of the three models.

FIGURE 2

Figure 2 Evaluation of the three models for predicting prognosis. (A) ROC curves for the training set of three models predicting patient death at one year. (B) ROC curves for the validation set of three models predicting patient death at one year. (C) Calibration plots of the three models predicting patient death at one year. (D) DCA curves of the three models predicting patient death at one year. (E) ROC curves for the training set of three models predicting patient death at three years. (F) ROC curves for the validation set of three models predicting patient death at three years. (G) Calibration plots of the three models predicting patient death at three years. (H) DCA curves of the three models predicting patient death at three years. (I) ROC curves for the training set of three models predicting patient death at five years. (J) ROC curves for the validation set of three models predicting patient death at five years. (K) Calibration plots of the three models predicting patient death at five years. (L) DCA curves of the three models predicting patient death at five years.

In this process, a test set comprising 213 cases (30.04%) was taken, while the remaining samples were used for training the models through 5-fold cross-validation. In the prediction of risk factors for patient mortality within one year, the XGBoost algorithm achieved an AUC of 0.8373 ± 0.0457 in the validation set and an AUC of 0.7938 in the test set, with an accuracy of 0.8873 (Figures 3A–C). In comparison, the RF algorithm achieved an AUC of 0.7556 ± 0.0636 in the validation set and an AUC of 0.6627 in the test set, with an accuracy of 0.6056. The KNN algorithm achieved an AUC of 0.6555 ± 0.0648 in the validation set and an AUC of 0.5787 in the test set, with an accuracy of 0.8873.

FIGURE 3

Figure 3 Internal validation of the XGBoost model. (A) ROC curves for the training set of the XGBoost model predicting patient death at one year. (B) ROC curves for the validation set of the XGBoost model predicting patient death at one year. (C) ROC curves for the test set of the XGBoost model predicting patient death at one year. (D) External validation of the XGBoost model predicting patient death at one year. (E) ROC curves for the training set of the XGBoost model predicting patient death at three years. (F) ROC curves for the validation set of the XGBoost model predicting patient death at three years. (G) ROC curves for the test set of the XGBoost model predicting patient death at three years. (H) External validation of the XGBoost model predicting patient death at three years. (I) ROC curves for the training set of the XGBoost model predicting patient death at five years. (J) ROC curves for the validation set of the XGBoost model predicting patient death at five years. (K) ROC curves for the test set of the XGBoost model predicting patient death at five years. (L) External validation of the XGBoost model predicting patient death at five years.

In the prediction of risk factors for patient mortality within three years, the XGBoost algorithm showed an AUC of 0.7403 ± 0.0174 in the validation set and an AUC of 0.7654 in the test set, with an accuracy of 0.7606 (Figures 3E–G). The RF algorithm showed an AUC of 0.6214 ± 0.0654 in the validation set, an AUC of 0.5733 in the test set, and an accuracy of 0.6808. The KNN algorithm showed an AUC of 0.7130 ± 0.0239 in the validation set and an AUC of 0.7141 in the test set, with an accuracy of 0.7183.

The results of the prediction analysis of patients’ five-year mortality showed that XGBoost had an AUC value of 0.8076 ± 0.0317 in the validation set and an AUC value of 0.8516 in the test set, with an accuracy of 0.7653 (Figures 3I–K). RF had an AUC value of 0.8045 ± 0.0466 in the validation set and an AUC value of 0.8089 in the test set, with an accuracy of 0.7371. KNN had an AUC value of 0.7800 ± 0.0311 in the validation set, an AUC value of 0.8297 in the test set, and an accuracy of 0.7277.

The XGBoost algorithm was selected to develop the model in this research after conducting a thorough comparison.

Model external validation

The AUC value in the external validation set for the prediction analysis of one-year patient mortality was 0.73, for the prediction analysis of three-year patient mortality was 0.77, and for the prediction analysis of five-year patient mortality was 0.79. These values indicate that the prediction model has high accuracy in diagnosing the disease (Figures 3D, H, L).

Model explanation

The SHAP summary plot results revealed that certain risk factors contribute to one-year, three-year, and five-year mortality in patients with gastric cancer. For one-year mortality, the highest-ranking risk factors were the CEA level after combined treatment, advanced age, CA72-4 level, multiple tumors, CA125 level, tumor lymph node metastasis, H. pylori infection, tumor size, tumor peripheral nerve metastasis, emergency surgery, and T3 and T4 tumors. The top-ranking risk factors for three-year mortality were tumor size, advanced age, CA72-4 level, intraoperative blood transfusion, tumor lymph node metastasis, intraoperative bleeding, surgical approach, tumors of T3 and T4, multiple tumors, CEA level, CA125 level, time of surgery, H. pylori infection, and tumor peripheral nerve invasion. For five-year mortality, the top-ranking risk factors were advanced age, intraoperative blood transfusion, sex, CA72-4 level, surgical approach, tumor lymph node metastasis, multiple tumors, CA125 level, tumor size, tumors in T3 and T4, H. pylori infection, intraoperative bleeding volume, CEA level, time to surgery, NRS2002 score, and peritumor nerve invasion (Figures 4A–C). The shared risk factors that were found to influence patient mortality at one, three, and five years after radical gastric cancer surgery included advanced age, tumors classified as T3 and T4, tumor lymph node metastasis, tumor peripheral nerve invasion, presence of multiple tumors, tumor size, elevated CEA levels, CA125 levels, CA72-4 levels, and H. pylori infection.

FIGURE 4

Figure 4 SHAP summary plot. Risk factors are arranged along the y-axis based on their importance, which is given by the mean of their absolute Shapley values. The higher the risk factor is positioned in the plot, the more important it is for the model. (A) SHAP summary plot of models predicting patient death at one year. (B) SHAP summary plot of models predicting patient death at three years. (C) SHAP summary plot of models predicting patient death at five years.

Discussion

This study aimed to assess risk prediction models constructed using three machine learning algorithms. Of the three algorithms, XGBoost exhibited the highest accuracy (Tseng et al., 2020; Liu et al., 2023). In comparison to the RF algorithm, XGBoost employs an adaptive gradient boosting algorithm that can automatically select the optimal splitting point and tree depth, thus improving prediction performance. Furthermore, XGBoost takes into account regularization and effectively avoids model overfitting (Zhou et al., 2022). Although the KNN algorithm has higher accuracy and can avoid overfitting problems, it has high computational complexity when searching for K nearest neighbors in the training set for each test sample and calculating their distances for classification or regression prediction. Additionally, the algorithm is less stable and slower when solving problems with multiple features and large samples (DeGregory et al., 2018; Zhao et al., 2022). The XGBoost algorithm is more suitable for multidimensional studies and reduces computational effort and training time. Importantly, XGBoost provides a feature importance assessment function that can help users better understand the contribution of features in the dataset to the prediction results, improving the algorithm’s interpretability. Consequently, after a comprehensive comparison of the three machine learning algorithms, this study selected the XGBoost algorithm to construct a model to predict the long-term postoperative prognosis of gastric cancer patients.

In the realm of clinical studies, multiple risk factors may exhibit a nonlinear relationship with poor patient prognosis, particularly in the context of cancer research. This may lead to conventional models displaying suboptimal goodness of fit or limited predictive accuracy. In contrast, machine learning is capable of training algorithms to identify and discern intricate patterns, accommodating more sophisticated nonlinear relationships. As such, it may offer superiority over traditional models in medical research. Jacek et al (Baj et al., 2020). confirmed the effectiveness of machine learning algorithms for clinical diagnosis and prognosis, and this technique in artificial intelligence may also enable accurate prediction of adverse outcomes in disease progression. Notably, machine learning algorithms assumed a crucial role in developing the predictive model utilized in this study. The present model facilitates the identification of high-risk patients with precision by clinical decision-makers, enabling timely intervention to improve patient prognosis. Furthermore, it has potential utility for medical institutions to allocate resources judiciously, monitor the vital signs of high-risk patients, and enhance survival rates among gastric cancer patients.

In this study, SHAP analysis was utilized to rank the risk factors that affect the long-term prognosis of patients with gastric cancer who received combined treatment. It was discovered that H. pylori infection was a crucial factor among all high-risk factors. It is believed that H. pylori infection hinders the effectiveness of immunotherapy and promotes the growth and survival of cancer cells by altering the normal environment between the tumor and the host (Zuo et al., 2022). The primary mechanisms of action include the following: first, the infection diminishes the number of beneficial bacteria, such as lactobacilli and bifidobacteria, which has an effect on the inflammatory environment, thereby promoting the development of gastric cancer (Sipos et al., 2006). Second, H. pylori suppresses the activity of T cells and natural killer cells, encourages the recruitment of immunosuppressive cells, and affects the immune response in the stomach, consequently impeding the immune surveillance and clearance function of the body (Figueiredo et al., 2014). Furthermore, H. pylori infection is also known to stimulate the production of proinflammatory cytokines such as IL-1β, TNF-α, and IL-6, thus creating a microenvironment that promotes cancer cell growth and survival (Li et al., 2017). At the genetic level, H. pylori infection triggers the production of reactive oxygen species (ROS) and reactive nitrogen species (RNS), thereby increasing the risk of cancer development. Li further established a strong link between H. pylori and the activation of oncogenes such as c-Met and β-catenin, as well as the inactivation of tumor suppressor genes such as p53 and E-cadherin (Siregar et al., 2017), which corroborates the findings of the current study. Additionally, due to the intricate anatomy of the stomach, surgeons may find it challenging to accurately assess the extent of tumor invasion during surgery and perform intraoperative rapid pathological examination to ensure negative margins. This may result in the retention of some blood vessels of the tumor after surgery, which can become the seeds of gastric cancer recurrence, especially in patients with H. pylori infection. This is because H. pylori can accelerate the expression of vascular endothelial growth factor (VEGF), which in turn promotes the formation of new blood vessels in the tumor microenvironment, thereby promoting tumor proliferation and migration (Bagheri et al., 2018). H. pylori infection can also upregulate the expression of metalloproteinase-9 (MMP-9), an enzyme that degrades the extracellular matrix, which increases the risk of poor prognosis in tumor patients (Deng et al., 2022).

Similar to previous research, this study also observed that deeper tumor infiltration is correlated with an increased risk of lymphatic and peripheral nerve metastasis and a higher likelihood of postoperative mortality (Xiang et al., 2021). Highly malignant and biologically active tumor cells detach from the primary site by degrading the extracellular matrix and basement membrane via protein hydrolases. These detached tumor cells invade the surrounding normal tissues and enter the nearby lymph nodes. The large perigastric omentum contains numerous blood vessels, and after gastric cancer invades the surrounding lymph nodes, vascular invasion can occur, leading to the flow of tumor cells back to the liver through the portal vein system, resulting in postoperative recurrence or metastasis (Oster et al., 2022). Furthermore, these tumors can metastasize to retroperitoneal organs via lymph nodes, and clinical manifestations are often obscure, with imaging examinations being difficult to diagnose. David’s study (Shibata et al., 2002) also demonstrated a strong correlation between lymph node metastasis and poor outcomes in patients with tumors, while Radespiel (Radespiel-Tröger et al., 2004; Turgeon et al., 2021) discovered that the higher the number of lymph node metastases, the greater the chance of tumor recurrence and the higher the postoperative mortality rate. This emphasizes the importance of thoroughly removing relevant lymph nodes during radical surgery for gastric cancer while avoiding compression of the tumor to prevent dissemination into the abdominal cavity.

Furthermore, the size of tumors has been shown to have a significant impact on patient prognosis. We hypothesize that larger tumors have a higher proliferation rate and generate more tumor vessels. Tomisaki conducted a study on 175 patients with gastrointestinal tumors and found a strong correlation between metastatic recurrence and microvessel density (MVD). The higher the MVD, the greater the likelihood of tumor cells entering the circulatory system (Tomisaki et al., 1996). Similarly, Park reported that larger tumors have a higher risk of shedding tumor cells into the abdominal and pelvic cavities and vascular tissues, thus increasing the potential risk of tumor recurrence after surgery (Park et al., 1999). Multiple gastric cancers also pose a challenge for treatment and are associated with a higher risk of tumor recurrence. Surgical removal of the primary tumor may reduce the concentration of tumor growth inhibitory factors and accelerate residual tumor recurrence. Li et al. investigated this hypothesis using two groups of mouse models, with the experimental group undergoing conventional tumor resection and the control group undergoing sham surgery. The results showed significant differences in tumor growth and recurrence between the experimental and control groups (Li et al., 2001).

The findings of the current investigation suggest that patients who display elevated levels of CEA after undergoing radical gastrectomy for gastric cancer, when followed up with immunotherapy, are at an increased risk of mortality. Tsuyoshi previously identified CEA as an acidic glycoprotein expressed by normal human mucosal cells that lacks specificity in detecting gastrointestinal tumors (Konishi et al., 2018). However, with the advancement of medical diagnostic techniques in recent years, clinicians have recognized the clinical significance of CEA. Polat conducted a prospective study to explore the association between serum levels of tumor markers and clinical variables in patients with gastrointestinal tumors. In a subsequent investigation, Tsuyoshi et al. demonstrated that most patients’ serum CEA levels returned to normal three months after combination therapy, while another group of patients with persistent CEA elevation after treatment had a rapid recurrence of tumors compared to their counterparts with normal posttreatment CEA levels. The results of the present study indicate that an increase in CEA levels after gastric cancer surgery could be indicative of tumor recurrence (Attallah et al., 2018; Konishi et al., 2018). Recently, some clinicians have employed a combination of CEA, CA19-9, cytokeratin-1 (CK-1), CA72-4 and mucin-1 (MUC-1) to predict unfavorable outcomes in gastrointestinal tumors, which has improved the sensitivity and specificity of tumor surveillance while also evaluating tumor stage and metastasis (Pua et al., 2020).

In recent years, medical practitioners have endeavored to utilize certain tests to prognosticate the outcomes of immunotherapy in conjunction with surgical interventions. However, it has been observed that such approaches exhibit a higher rate of misdiagnosis and fail to significantly influence patient prognoses. Consequently, we have opted to employ more precise machine learning algorithms for the purpose of identifying high-risk factors and enhancing patient prognoses. The present study provides a comprehensive evaluation of the model in terms of discrimination, calibration, and clinical utility, yet certain limitations exist. While the study accounted for multiple aspects of risk factors, imaging-related aspects were not considered. Furthermore, while the machine learning algorithms were more accurate, their models were more intricate and less transparent. The entire computational and decision-making process of the model is opaque, lacking the intuitive and clear features of the logistic regression model. Conversely, this retrospective study suffers from selection bias, distribution bias, and retrospective bias. Thus, further international, multicenter, large-scale studies are necessary to validate the reliability of our findings.

Conclusion

This study presented the development of a prediction model utilizing the XGBoost machine learning algorithm to assess the risk of mortality in tumor patients who underwent radical gastric cancer surgery along with immunotherapy. The model demonstrated promising accuracy and clinical value, enabling surgeons to diagnose patients promptly. The model identified that a negative outcome in gastric cancer patients correlated with various factors, including older age, tumor invasion, tumor lymph node metastasis, peripheral nerve invasion, presence of multiple tumors, larger tumor size, increased levels of CEA, CA125, and CA72-4, and H. pylori infection.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of the Affiliated Wuxi People’s Hospital of Nanjing Medical University, Wuxi Second People’s Hospital, and Shandong Provincial Hospital affiliated with Shandong First Medical University. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

YL and WS conceived the study. YL, WD, YH and LW drafted the manuscript. YG, CS, SN, JX and ZT analyzed and visualized the data. CC, JL and WS helped with the final revision of this manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Top Talent Support Program for young and middle-aged people of Wuxi Health Committee (Grant No. HB2020007).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcimb.2023.1207235/full#supplementary-material

Supplementary Table 1 | Raw data.

References

Afrash, M. R., Mirbagheri, E., Mashoufi, M., Kazemi-Arpanahi, H. (2023). Optimizing prognostic factors of five-year survival in gastric cancer patients using feature selection techniques with machine learning algorithms: a comparative study. BMC Med. Inform. Decis. Mak. 23 (1), 54. doi: 10.1186/s12911-023-02154-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Attallah, A. M., El-Far, M., Ibrahim, A. R., El-Desouky, M. A., Omran, M. M., Elbendary, M. S., et al. (2018). Clinical value of a diagnostic score for colon cancer based on serum cea, Ca19-9, cytokeratin-1 and mucin-1. Br. J. BioMed. Sci. 75 (3), 122–127. doi: 10.1080/09674845.2018.1456309

PubMed Abstract | CrossRef Full Text | Google Scholar

Bagheri, N., Sadeghiani, M., Rahimian, G., Mahsa, M., Shafigh, M., Rafieian-Kopaei, M., et al. (2018). Correlation between expression of mmp-9 and mmp-3 in helicobacter pylori infected patients with different gastroduodenal diseases. Arab J. Gastroenterol. 19 (4), 148–154. doi: 10.1016/j.ajg.2018.11.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Baj, J., Korona-Głowniak, I., Forma, A., Maani, A., Sitarz, E., Rahnama-Hezavah, M., et al. (2020). Mechanisms of the epithelial-mesenchymal transition and tumor microenvironment in helicobacter pylori-induced gastric cancer. Cells 9 (4). doi: 10.3390/cells9041055

PubMed Abstract | CrossRef Full Text | Google Scholar

Chi, H., Zhao, S., Yang, J., Gao, X., Peng, G., Zhang, J., et al. (2023). T-Cell exhaustion signatures characterize the immune landscape and predict hcc prognosis Via integrating single-cell rna-seq and bulk rna-sequencing. Front. Immunol. 14. doi: 10.3389/fimmu.2023.1137025

CrossRef Full Text | Google Scholar

DeGregory, K. W., Kuiper, P., DeSilvio, T., Pleuss, J. D., Miller, R., Roginski, J. W., et al. (2018). A review of machine learning in obesity. Obes. Rev. 19 (5), 668–685. doi: 10.1111/obr.12667

PubMed Abstract | CrossRef Full Text | Google Scholar

Deng, R., Zheng, H., Cai, H., Li, M., Shi, Y., Ding, S. (2022). Effects of helicobacter pylori on tumor microenvironment and immunotherapy responses. Front. Immunol. 13. doi: 10.3389/fimmu.2022.923477

CrossRef Full Text | Google Scholar

Figueiredo, C. A., Marques, C. R., Costa Rdos, S., da Silva, H. B., Alcantara-Neves, N. M. (2014). Cytokines, cytokine gene polymorphisms and helicobacter pylori infection: friend or foe? World J. Gastroenterol. 20 (18), 5235–5243. doi: 10.3748/wjg.v20.i18.5235

PubMed Abstract | CrossRef Full Text | Google Scholar

Hou, N., Li, M., He, L., Xie, B., Wang, L., Zhang, R., et al. (2020). Predicting 30-days mortality for mimic-iii patients with sepsis-3: a machine learning approach using xgboost. J. Transl. Med. 18 (1), 462. doi: 10.1186/s12967-020-02620-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Kakinuma, D., Arai, H., Yasuda, T., Kanazawa, Y., Matsuno, K., Sakurazawa, N., et al. (2021). Treatment of gastric cancer in Japan. J. Nippon Med. Sch 88 (3), 156–162. doi: 10.1272/jnms.JNMS.2021_88-315

PubMed Abstract | CrossRef Full Text | Google Scholar

Konishi, T., Shimada, Y., Hsu, M., Tufts, L., Jimenez-Rodriguez, R., Cercek, A., et al. (2018). Association of preoperative and postoperative serum carcinoembryonic antigen and colon cancer outcome. JAMA Oncol. 4 (3), 309–315. doi: 10.1001/jamaoncol.2017.4420

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, T. S., Kaneda, Y., Ueda, K., Hamano, K., Zempo, N., Esato, K. (2001). The influence of tumour resection on angiostatin levels and tumour growth–an experimental study in tumour-bearing mice. Eur. J. Cancer 37 (17), 2283–2288. doi: 10.1016/s0959-8049(01)00281-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, N., Tang, B., Jia, Y. P., Zhu, P., Zhuang, Y., Fang, Y., et al. (2017). Helicobacter pylori caga protein negatively regulates autophagy and promotes inflammatory response Via c-Met-Pi3k/Akt-Mtor signaling pathway. Front. Cell Infect. Microbiol. 7. doi: 10.3389/fcimb.2017.00417

CrossRef Full Text | Google Scholar

Li, K., Zhang, A., Li, X., Zhang, H., Zhao, L. (2021). Advances in clinical immunotherapy for gastric cancer. Biochim. Biophys. Acta Rev. Cancer 1876 (2), 188615. doi: 10.1016/j.bbcan.2021.188615

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, M. M., Wen, L., Liu, Y. J., Cai, Q., Li, L. T., Cai, Y. M. (2018). Application of data mining methods to improve screening for the risk of early gastric cancer. BMC Med. Inform. Decis. Mak. 18 (Suppl 5), 121. doi: 10.1186/s12911-018-0689-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, Y., Zhao, S., Du, W., Tian, Z., Chi, H., Chao, C., et al. (2023). Applying interpretable machine learning algorithms to predict risk factors for permanent stoma in patients after tme. Front. Surg. 10. doi: 10.3389/fsurg.2023.1125875

CrossRef Full Text | Google Scholar

Maomao, C., He, L., Dianqin, S., Siyi, H., Xinxin, Y., Fan, Y., et al. (2022). Current cancer burden in China: epidemiology, etiology, and prevention. Cancer Biol. Med. 19 (8), 1121–1138. doi: 10.20892/j.issn.2095-3941.2022.0231

PubMed Abstract | CrossRef Full Text | Google Scholar

Morgan, E., Arnold, M., Camargo, M. C., Gini, A., Kunzmann, A. T., Matsuda, T., et al. (2022). The current and future incidence and mortality of gastric cancer in 185 countries, 2020-40: a population-based modelling study. EClinicalMedicine 47, 101404. doi: 10.1016/j.eclinm.2022.101404

PubMed Abstract | CrossRef Full Text | Google Scholar

Niu, P. H., Zhao, L. L., Wu, H. L., Zhao, D. B., Chen, Y. T. (2020). Artificial intelligence in gastric cancer: application and future perspectives. World J. Gastroenterol. 26 (36), 5408–5419. doi: 10.3748/wjg.v26.i36.5408

PubMed Abstract | CrossRef Full Text | Google Scholar

Oster, P., Vaillant, L., Riva, E., McMillan, B., Begka, C., Truntzer, C., et al. (2022). Helicobacter pylori infection has a detrimental impact on the efficacy of cancer immunotherapies. Gut 71 (3), 457–466. doi: 10.1136/gutjnl-2020-323392

PubMed Abstract | CrossRef Full Text | Google Scholar

Park, Y. J., Park, K. J., Park, J. G., Lee, K. U., Choe, K. J., Kim, J. P. (1999). Prognostic factors in 2230 Korean colorectal cancer patients: analysis of consecutively operated cases. World J. Surg. 23 (7), 721–726. doi: 10.1007/pl00012376

PubMed Abstract | CrossRef Full Text | Google Scholar

Pua, Y. H., Kang, H., Thumboo, J., Clark, R. A., Chew, E. S., Poon, C. L., et al. (2020). Machine learning methods are comparable to logistic regression techniques in predicting severe walking limitation following total knee arthroplasty. Knee Surg. Sports Traumatol. Arthrosc. 28 (10), 3207–3216. doi: 10.1007/s00167-019-05822-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Radespiel-Tröger, M., Hohenberger, W., Reingruber, B. (2004). Improved prediction of recurrence after curative resection of colon carcinoma using tree-based risk stratification. Cancer 100 (5), 958–967. doi: 10.1002/cncr.20065

PubMed Abstract | CrossRef Full Text | Google Scholar

Sekiguchi, M., Oda, I., Matsuda, T., Saito, Y. (2022). Epidemiological trends and future perspectives of gastric cancer in Eastern Asia. Digestion 103 (1), 22–28. doi: 10.1159/000518483

PubMed Abstract | CrossRef Full Text | Google Scholar

Shibata, C., Nakano, T., Yasumoto, A., Mitamura, A., Sawada, K., Ogawa, H., et al. (2022). Comparison of cea and Ca19-9 as a predictive factor for recurrence after curative gastrectomy in gastric cancer. BMC Surg. 22 (1), 213. doi: 10.1186/s12893-022-01667-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Shibata, D., Paty, P. B., Guillem, J. G., Wong, W. D., Cohen, A. M. (2002). Surgical management of isolated retroperitoneal recurrences of colorectal carcinoma. Dis. Colon Rectum 45 (6), 795–801. doi: 10.1007/s10350-004-6300-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Sipos, G., Altdorfer, K., Pongor, E., Chen, L. P., Fehér, E. (2006). Neuroimmune link in the mucosa of chronic gastritis with helicobacter pylori infection. Dig Dis. Sci. 51 (10), 1810–1817. doi: 10.1007/s10620-006-9085-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Siregar, G., Sari, D., Sungkar, T. (2017). Serum vegf levels in helicobacter pylori infection and correlation with helicobacter pylori caga and vaca genes. Open Access Maced J. Med. Sci. 5 (2), 137–141. doi: 10.3889/oamjms.2017.031

PubMed Abstract | CrossRef Full Text | Google Scholar

Tomisaki, S., Ohno, S., Ichiyoshi, Y., Kuwano, H., Maehara, Y., Sugimachi, K. (1996). Microvessel quantification and its possible relation with liver metastasis in colorectal cancer. Cancer 77 (8 Suppl), 1722–1728. doi: 10.1002/(sici)1097-0142(19960415)77:8<1722::Aid-cncr46>3.0.Co;2-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Tseng, P. Y., Chen, Y. T., Wang, C. H., Chiu, K. M., Peng, Y. S., Hsu, S. P., et al. (2020). Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit. Care 24 (1), 478. doi: 10.1186/s13054-020-03179-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Turgeon, M. K., Gamboa, A. C., Keilson, J. M., Maniko, J., Maguire, L., Hrebinko, K., et al. (2021). Radiological assessment of persistent retroperitoneal and lateral pelvic lymph nodes after neoadjuvant therapy for rectal cancer: an analysis of the united states rectal cancer consortium. J. Surg. Oncol. 124 (5), 818–828. doi: 10.1002/jso.26600

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y. K., Kuo, F. C., Liu, C. J., Wu, M. C., Shih, H. Y., Wang, S. S., et al. (2015). Diagnosis of helicobacter pylori infection: current options and developments. World J. Gastroenterol. 21 (40), 11221–11235. doi: 10.3748/wjg.v21.i40.11221

PubMed Abstract | CrossRef Full Text | Google Scholar

Xiang, L., Jin, S., Zheng, P., Maswikiti, E. P., Yu, Y., Gao, L., et al. (2021). Risk assessment and preventive treatment for peritoneal recurrence following radical resection for gastric cancer. Front. Oncol. 11. doi: 10.3389/fonc.2021.778152

CrossRef Full Text | Google Scholar

Zhao, S., Chi, H., Yang, Q., Chen, S., Wu, C., Lai, G., et al. (2023). Identification and validation of neurotrophic factor-related gene signatures in glioblastoma and parkinson's disease. Front. Immunol. 14. doi: 10.3389/fimmu.2023.1090040

CrossRef Full Text | Google Scholar

Zhao, S., Zhang, L., Ji, W., Shi, Y., Lai, G., Chi, H., et al. (2022). Machine learning-based characterization of cuprotosis-related biomarkers and immune infiltration in parkinson's disease. Front. Genet. 13. doi: 10.3389/fgene.2022.1010361

CrossRef Full Text | Google Scholar

Zhou, X., Wang, H., Xu, C., Peng, L., Xu, F., Lian, L., et al. (2022). Application of knn and svm to predict the prognosis of advanced schistosomiasis. Parasitol. Res. 121 (8), 2457–2460. doi: 10.1007/s00436-022-07583-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Zuo, F., Somiah, T., Gebremariam, H. G., Jonsson, A. B. (2022). Lactobacilli downregulate transcription factors in helicobacter pylori that affect motility, acid tolerance and antimicrobial peptide survival. Int. J. Mol. Sci. 23 (24). doi: 10.3390/ijms232415451

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: gastric tumor, gastrectomy, immunotherapy, Helicobacter pylori, risk factor, machine learning

Citation: Liu Y, Wang L, Du W, Huang Y, Guo Y, Song C, Tian Z, Niu S, Xie J, Liu J, Cheng C and Shen W (2023) Identification of high-risk factors associated with mortality at 1-, 3-, and 5-year intervals in gastric cancer patients undergoing radical surgery and immunotherapy: an 8-year multicenter retrospective analysis. Front. Cell. Infect. Microbiol. 13:1207235. doi: 10.3389/fcimb.2023.1207235

Received: 17 April 2023; Accepted: 10 May 2023;
Published: 31 May 2023.

Edited by:

Jianguo Tang, Fudan University, China

Reviewed by:

Zhirui Zeng, Guizhou Medical University, China
Sun Hongfa, The Affiliated Hospital of Qingdao University, China
Ashutosh Kumar, Washington University in St. Louis, United States

Copyright © 2023 Liu, Wang, Du, Huang, Guo, Song, Tian, Niu, Xie, Liu, Cheng and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Wei Shen, c2hlbndlaWlqc0BvdXRsb29rLmNvbQ==; Chao Cheng, TXJfY2hlbmdjaGFvQDEyNi5jb20=; Jinhui Liu, amluaHVpbGl1QG5qbXUuZWR1LmNu

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Identification of high-risk factors associated with mortality at 1-, 3-, and 5-year intervals in gastric cancer patients undergoing radical surgery and immunotherapy: an 8-year multicenter retrospective analysis

Introduction

Materials and methods

Study subjects

Diagnosis of Helicobacter pylori infection and determination of associated factors

Study design and data collection

Statistical analysis

Development and evaluation of predictive models for machine learning algorithms

Results

Basic clinical information of the patient

Screening for risk factors for death at one, three, and five years after combination therapy in patients with gastric cancer

Model building and evaluation

Model external validation

Model explanation

Discussion

Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good