The development and validation of automated machine learning models for predicting lymph node metastasis in Siewert type II T1 adenocarcinoma of the esophagogastric junction

Lu, Chenghao; Liu, Lu; Yin, Minyue; Lin, Jiaxi; Zhu, Shiqi; Gao, Jingwen; Qu, Shuting; Xu, Guoting; Liu, Lihe; Zhu, Jinzhou; Xu, Chunfang

doi:10.3389/fmed.2024.1266278

ORIGINAL RESEARCH article

Front. Med. , 03 April 2024

Sec. Gastroenterology

Volume 11 - 2024 | https://doi.org/10.3389/fmed.2024.1266278

This article is part of the Research Topic Modelling Esophageal Adenocarcinoma View all 8 articles

The development and validation of automated machine learning models for predicting lymph node metastasis in Siewert type II T1 adenocarcinoma of the esophagogastric junction

Chenghao Lu^1,2

Lu Liu^1,2

Minyue Yin^1,2,3

Jiaxi Lin^1,2

Shiqi Zhu^1,2

Jingwen Gao^1,2

Shuting Qu^1,2

Guoting Xu^1,2

Lihe Liu^1,2

Jinzhou Zhu^1,2^*

Chunfang Xu^1,2,4^*

¹Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
²Suzhou Clinical Center of Digestive Diseases, Suzhou, Jiangsu, China
³Department of Gastroenterology, Beijing Friendship Hospital, Capital Medical University, National Clinical Research Center for Digestive Disease, Beijing Digestive Disease Center, State Key Laboratory of Digestive Health, Beijing, China
⁴The Forth Affiliated Hospital of Soochow University, Suzhou, China

Background: Lymph node metastasis (LNM) is considered an essential prognosis factor for adenocarcinoma of the esophagogastric junction (AEG), which also affects the treatment strategies of AEG. We aimed to evaluate automated machine learning (AutoML) algorithms for predicting LNM in Siewert type II T1 AEG.

Methods: A total of 878 patients with Siewert type II T1 AEG were selected from the Surveillance, Epidemiology, and End Results (SEER) database to develop the LNM predictive models. The patients from two hospitals in Suzhou were collected as the test set. We applied five machine learning algorithms to develop the LNM prediction models. The performance of predictive models was assessed using various metrics including accuracy, sensitivity, specificity, the area under the curve (AUC), and receiver operating characteristic (ROC) curve.

Results: Patients with LNM exhibited a higher proportion of male individuals, a poor degree of differentiation, and submucosal infiltration, with statistical differences. The deep learning (DL) model demonstrated relatively good accuracy (0.713) and sensitivity (0.868) among the five models. Moreover, the DL model achieved the highest AUC (0.781) and sensitivity (1.000) in the test set.

Conclusion: The DL model showed good predictive performance among five AutoML models, indicating the advantage of AutoML in modeling LNM prediction in patients with Siewert type II T1 AEG.

1 Introduction

The global incidence of adenocarcinoma of the esophagogastric junction (AEG) has been rapidly increasing (1–7). The incidence of AEG increased by 2.5 times between 1973 and 1992, according to the statistics from the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) program (4). A study in Japan showed that the proportion of AEG in patients with gastric adenocarcinoma increased from 2.3% (1962–1965) to 10.0% (2001–2005) (2). Similarly, an increasing trend of AEG was observed from 1988 to 2012 in a Chinese hospital (8).

AEG is commonly considered a separate digestive tract tumor (9–12). The Siewert classification categorizes AEG into three types based on the location of the tumor epicenter relative to the gastroesophageal junction (GEJ) (13–15). In Siewert type I AEG, the epicenter of the tumor is located 1 to 5 cm above the GEJ. For type II, the epicenter of the tumor is located 1 cm above to 2 cm below the GEJ. For type III, the epicenter of the tumor is located 2 to 5 cm below the GEJ. Among the three subtypes, Siewert type II is generally considered the true cardia carcinoma (13).

Due to its particular anatomical location, the treatment of Siewert type II AEG has been historically complicated. For locally advanced tumors, radical surgical resection is still the primary treatment for AEG (5, 11). However, with gastrointestinal endoscopy screening, patients with digestive tract cancer are diagnosed at an early stage, making it possible to treat early AEG without lymphatic and organ metastasis by endoscopy. The endoscopic resection of superficial AEG, such as endoscopic mucosal resection (EMR) and endoscopic submucosal dissection (ESD), is considered safe and effective (16–21). Endoscopic resection techniques are increasingly being employed for early AEG, leading to a reduction in the morbidity and mortality associated with gastrectomy or esophagectomy and an improvement in the quality of life (16).

Previous studies have shown lymph node metastasis (LNM) as an independent prognostic factor for AEG (9, 22, 23). In addition, some studies have constructed the prediction models for LNM of AEG using the traditional logistic regression method (24–27). However, machine learning-based models are increasingly used in the diagnosis, prediction, and prognosis evaluation of gastrointestinal diseases, such as inflammatory bowel disease and gastrointestinal tumors (28–31). In this study, we aimed to establish predictive models for LNM in Siewert type II T1 AEG using automated machine learning (AutoML) methods to help clinicians assess the availability of endoscopic treatment and individualize a suitable treatment for patients.

2 Materials and methods

2.1 Data source

Relevant data from the SEER database were retrieved in our study. The SEER database of the National Cancer Institute, an authoritative source of information on cancer incidence and survival, contains data on various tumor sites and from sources throughout the United States.¹ Currently, the SEER program collects and releases cancer data from 17 population-based registries, covering approximately one-third of the U.S. population, which can be used to conduct population-based case–control studies that clarify the etiology of cancers, especially some uncommon ones (32, 33). By using SEER ∗ Stat 8.4.0.1 software, we obtained demographic information and cancer incidence data collected from the SEER 17 Registries, November 2021 Sub (2000–2019 varying). To identify Siewert type II AEG, we used two parameters in the SEER database. Cancers simultaneously satisfying two conditions [“TNM 7/CS v0204 + Schema” encoded 28 (Esophagus GE Junction) and “Primary Site-Labeled” encoded 160 (Cardia, NOS)] were extracted and classified as Siewert type II AEG (4, 34).

In addition, the patients with Siewert type II T1 AEG diagnosed in the First Affiliated Hospital of Soochow University and the Second Affiliated Hospital of Soochow University from April 2003 to October 2022 were retrospectively selected as the research subjects.

2.2 Including criteria

The criteria for patient inclusion were as follows: (1) patients with available TNM stage information; (2) patients aged 18 years or above at diagnosis (in consideration of the tiny proportion of patients under 18 years); (3) patients pathologically diagnosed as T1M0 Siewert Type II AEG; (4) patients with the first or only primary malignancy; and (5) patients with available information on differentiation, extension, and size.

2.3 The development of models

2.3.1 Variable selection and data pre-processing

Patient demographics (age, sex, race, year of diagnosis, and marital status) and tumor characteristics (tumor grade, tumor size, T stage, N stage, M stage, number of lymph nodes examined, and number of positive lymph nodes) were collected from the SEER database and the hospitals in Suzhou. We finally obtained eight variables (age, race, sex, marital status, differentiation, extension, size, and LNM) for our analysis.

Missing values for variables can complicate data analysis and introduce potential bias into the final results. In this study, classification variables that could not be evaluated after missing value analysis were removed. Missing interpolation was performed on continuous variables, such as tumor size, using the multivariate imputation by chained equations package (MICE version 3.15.0) in R, version 4.2.2 (Institute for Statistics and Mathematics, Vienna, Austria; http://www.r-project.org), based on the random forest method (35). The SEER data were randomly divided into the training (n = 629) and validation (n = 238) sets in a ratio of 7:3. A total of 141 samples from Suzhou were used as the test set.

The distribution of LNM outcomes was imbalanced in the training and validation sets, leading to particular bias in modeling and evaluation of model performance. To address this issue, we used the SMOTE function in the DMwR package (version 0.4.1) in R to balance the training set and the validation set by applying undersampling and oversampling techniques. Finally, we got the balanced training set (n = 949) and validation set (n = 380).

2.3.2 Modeling methods and evaluation

In this study, five algorithms—generalized linear model (GLM), gradient boosting machine (GBM), deep learning (DL), distributed random forest (DRF), and stacked ensemble (SE)—were provided by H2O² to construct prediction models using the training set. Using the h2o package (version 3.38.0.4) in R, we set the response column and the predictor columns for the training, validation, and test sets, respectively. The H2O AutoML performs a hyperparameter search using a random grid search method over the five algorithms to deliver the best model automatically. Five predictive models were finally developed for this approach. We used the validation and test sets to score and rank models.

The models’ accuracy, misclassification, specificity, sensitivity, and precision (also named positive predictive value) were obtained by plotting the confusion matrix. To select the best model, the difference between the predicted and actual results was analyzed. The predictive ability of the models was evaluated using the receiver operating characteristics (ROC) curve and the area under the curve (AUC). The procedure of patient selection and modeling is shown in Figure 1.

Figure 1

Figure 1. Flowchart of patient selection and modeling procedure.

We selected a model with the best performance using the above indicators and further evaluated it with the calibration curve. The Brier score, a statistical metric to measure the accuracy of probabilistic forecasts, was used to assess the calibration degree of the models. The score ranges from 0 to 1; a model with perfect skill has a score of 0, and the poorest model has a score of 1 (36). The unreliability index and the p-value of the calibration curve were also used to evaluate the reliability of the model.

Finally, the results of the model are presented visually for better understanding. A variable importance plot was constructed to show the importance of different variables. A Local Interpretable Model-Agnostic Explanations (LIME) Feature Importance Visualization plot was constructed using the lime package (version 0.5.3) in R to show the contributions of variables of samples to the outcome.

2.4 Statistical analysis

The statistical analysis and the modeling process were performed using R software. The package, tableone (version 0.13.2), in R was used in data analysis. We compared the baseline information and characteristics between different groups, including demographic and clinicopathological data. The normality of the quantitative data was evaluated using the Kolmogorov–Smirnov test. When the quantitative variables were normally distributed, they were represented by the mean and standard deviation (SD). However, they were represented by the median and interquartile range (IQR) when they were not. Student’s t-test was used for intergroup comparison of normally distributed quantitative variables, and the Mann–Whitney U-test was employed to compare non-normally distributed quantitative variables. The classification data were expressed by frequency and percentage, and the chi-squared (χ2) test was used for intergroup comparison.

3 Results

3.1 Characteristics

A total of 867 patients from the SEER dataset and 141 patients from Suzhou were screened for our study. After balancing the SEER dataset, 1,329 samples were collected in the SEER dataset. The patients’ demographics and clinicopathological baseline information in the SEER dataset after balancing and the test set are summarized, respectively, in Table 1. The baseline characteristic information based on LNM in the SEER dataset after balancing and the test set is summarized in Tables 2, 3, respectively. In the SEER dataset, the median sizes of the LNM and non-LNM groups were 25.00 and 15.00 mm, respectively, with statistical significance (p < 0.001). Patients with LNM had a higher proportion of poor degree of differentiation and submucosal infiltration, with statistical differences. In the test set, the average ages of the LNM and non-LNM groups were 65.00 and 67.15 years, respectively, with no statistical significance (p = 0.344). Patients with LNM had a higher proportion of poor degree of differentiation, with a p-value of 0.051. Patients in the two groups showed statistical differences in the depth of tumor invasion.

Table 1

Table 1. Baseline characteristics of patients from the SEER dataset after balancing and the test set.

Table 2

Table 2. Clinicopathological characteristics of patients from the SEER dataset after balancing.

Table 3

Table 3. Clinicopathological characteristics of patients from the test set.

The clinicopathological features of the training set and the validation set obtained from the SEER dataset are presented in Table 4. In the balanced training set and validation set, the rate of positive events, just the LNM rate in the study, was 49.7% and 50%, respectively. There were no significant differences in gender, degree of differentiation, depth of tumor invasion, or tumor size between the two groups.

Table 4

Table 4. Clinicopathological characteristics of patients from the training set and the validation set after balancing.

3.2 Performance of the models

To calculate the accuracy, sensitivity, specificity, and other indicators of the models in the validation set and the test set, the confusion matrices of the models are shown in Figures 2, 3, respectively. These indicators and the AUC of the five different models in the validation and test sets are shown in Table 5. The DL model has good sensitivity (0.868) and accuracy (0.713) in the validation set, which means that the model can accurately identify patients with positive lymph node metastases. The DL model exhibited a sensitivity of 100% in the test set, indicating that the model was able to screen out node-positive patients well and reduce missed diagnoses. The GBM model achieved an accuracy of 0.763, a sensitivity of 0.821, and a specificity of 0.705 in the validation set. The sensitivity (0.700) of the model in the test is lower than that of the DL model. The confusion matrix revealed that the model failed to correctly predict three patients with positive lymph nodes in the test set. Although the GLM model exhibited the highest sensitivity in the validation set (0.916) and the test set (1.000), its specificity was lower than other models. This aspect suggests that the GLM model was less capable of predicting negative LNM.

Figure 2

Figure 2. Confusion matrices of five models in the validation set. In this figure, 1 of target represents lymph node metastasis in the population, while 1 of prediction represents the positive prediction of lymph node metastasis by the model. (A) Confusion matrix of the DL model in the validation set. (B) Confusion matrix of the GBM model in the validation set. (C) Confusion matrix of the SE model in the validation set. (D) Confusion matrix of the DRF model in the validation set. (E) Confusion matrix of the GLM model in the validation set.

Figure 3

Figure 3. Confusion matrices of five models in the test set. In this figure, 1 of target represents lymph node metastasis in the population, while 1 of prediction represents the positive prediction of lymph node metastasis by the model. (A) Confusion matrix of the DL model in the test set. (B) Confusion matrix of the GBM model in the test set. (C) Confusion matrix of the SE model in the test set. (D) Confusion matrix of the DRF model in the test set. (E) Confusion matrix of the GLM model in the test set.

Table 5

Table 5. Performance of models in the dataset.

The ROC curve and AUC can evaluate the predictive ability of the models. Figures 4A,B show the ROC curves of the five models in the validation and test sets, respectively. The DL model achieved a good AUC (0.769) in the validation set, and it exhibited the highest AUC in the test set compared to other models. The Matthews correlation coefficient (MCC) score is a commonly used metric for evaluating binary classification models. The DL model achieved a MCC score of 0.448. The MCC scores of the five predictive models on the test set are not high, which may be related to the small size of the test set.

Figure 4

Figure 4. ROC of predictive models in the sets. (A) ROC of predictive models in the validation set. (B) ROC of predictive models in the test set.

Considering that the predictive model is a preoperative screening model, sensitivity should have a high weight on the selection of models. Hence, we believe that the DL model is the best model for predicting LNM in patients with AEG, with high sensitivity and reasonable specificity. The DL model consists of an input layer, two hidden layers, and an output layer. Dropout is applied in both hidden layers at a rate of 30%, which helps to prevent overfitting by randomly dropping out a percentage of units during modeling. The regularization terms are set to 0 for all layers. Rectifier (ReLU) activation functions are used in the hidden layers, while Softmax activation is used in the output layer for classification.

3.3 The performance of the deep learning model

3.3.1 Calibration curve in the datasets

The calibration curves are shown in Figures 5A,B, which is another way to evaluate the model. The calibration curve of the DL model in the validation set shows a high degree of fit. The Brier scores of the DL model in the validation and test sets were 0.213 and 0.228, respectively, indicating that the prediction results of the model were in good agreement with the actual outcome. The unreliability index of the model in the validation set was 0.070, which suggests that the DL model is reliable for predicting the LNM in T1 Siewert type II AEG patients.

Figure 5

Figure 5. Calibration curves of the DL model in the sets. (A) The calibration curve of the DL model in the validation set. (B) The calibration curve of the DL model in the test set.

3.3.2 Model visualization

The variable importance in the DL model is shown in Figure 6. According to this figure, tumor size is the most important predictor of LNM in T1 Siewert type II patients. Furthermore, we randomly selected four cases to plot the LIME feature importance visualization, as shown in Figure 7. Take the first case as an example, the tumor is moderately differentiated, and the white male married patient supports the lymph node without metastasis. In the third case, male married patients with moderately differentiated tumors contradict the result of LNM. However, other parameters, such as tumor infiltration into the submucosa and patients from Asian or Pacific regions, support LNM. Under the comprehensive prediction, the probability of LNM in this patient was 61%. That is, the prediction result is the same as the actual outcome.

Figure 6

Figure 6. Variable importance in the DL model. In this figure, “Race.AI” means patients’ race is American Indian; “Race.W” means patients’ race is white; “Race.B” means patients’ race is Black; “Differentiation.1” means well-differentiated tumors; “Differentiation.3” means poorly differentiated tumors.

Figure 7

Figure 7. LIME feature importance visualization. In this figure, “Race = W” means patients’ race is white; “Race = API” means patients’ race is Asian or Pacific Islander; “Differentiation = 2” means tumor is moderately differentiated.

4 Discussion

In the study, we found that differentiation, the depth of invasion, the size of the AEG, and gender were related to LNM. Five predictive models were developed using AutoML. Among these models, the DL model is the most suitable for predicting and screening LNM in early AEG, with the highest sensitivity and AUC in the test set.

With the incidence rates rising, a series of problems in the treatment and prognosis of AEG have been gradually becoming global concerns. With the application of endoscopic screening technology, patients with digestive tract cancer (including AEG) are diagnosed at an early stage, making endoscopic treatment of superficial AEG possible. Because of the inherent differences in the anatomy of AEG, there are certain technical difficulties in treating AEG with ESD (37). Chen et al. (38) found that the procedure speed of ESD for early AEG is slower than that for early gastric carcinoma, possibly due to AEG extending beyond the cardia, including the angle of His. However, endoscopic treatment (including ESD) remains an effective alternative to surgery for the treatment of early AEG based on comparable long-term outcomes (18, 20). With the advancements in endoscopic treatment, early AEG can be effectively resected by EMR/ESD with fewer complications, better preservation of gastric function, a shorter duration of hospital stay, and a lower cost compared with traditional gastrectomy or esophagectomy (16, 17). Endoscopic treatment is gaining acceptance because it is more tolerable, especially in elderly patients. Chen et al. (20) revealed that endoscopic treatment may be considered in patients aged 65 years or those with submucosal (T1b stage) cancer of the AEG.

It has been widely accepted that LNM is an important prognostic factor for patients with Siewert type II AEG. In the study of Wang et al. (22), a prognostic model for the outcome of patients with AEG based on a traditional algorithm was established. The positive lymph nodes and the ratio of metastatic lymph nodes were identified as two of the prognostic factors according to the univariate analysis. Naoki et al. (23) found that LNM was the only independent prognostic factor for AEG in their study. To achieve better endoscopic treatment effects, early AEG should meet certain standards (including no lymph node and distant organ metastasis) (12, 16, 18). In some studies, the location of the LNM has been found to have a significant impact on the surgical method and the scope of lymph node dissection (39–41).

Preoperative diagnosis of LNM mainly relies on computed tomography (CT), endoscopic ultrasound, and magnetic resonance imaging (MRI), which are primarily based on the size of the lymph nodes. The preoperative prediction of LNM using the CT criteria has high specificity (23). However, the diagnostic accuracy of LNM prediction using these methods is not particularly high, as the evaluation of the lymph node size is greatly affected by other factors and thus heavily relies on the physician’s evaluation (26). In addition, detecting LNM in a narrow space (such as the diaphragm, aorta, and pericardium) by contrast-enhanced CT before surgery is more complex than in lymph nodes around the stomach or colon (23). Moreover, not all patients have access to contrast-enhanced CT for diagnosing clinical LNM.

Several studies have constructed LNM predictive models of AEG until now. However, most studies only used traditional logistic regression analysis for risk factors and did not perform independent external validation. Chen et al. (25) used the logistic regression method to predict the LNM risk in early AEG patients, and the AUC of the prediction model is 0.742. Feng et al. (26) provided a detailed explanation of the correlation between tumor size and LNM in AEG and used logistic regression to plot a nomogram, which can predict the LNM risk. Zheng et al. (24) used small samples to explore the risk factors for LNM in AEG while showing the specific groups of lymph nodes. All of these studies are consistent with our findings, but the predictive performance of their models is weaker than that of ours, indicating that machine learning has good advantages in the establishment of LNM predictive models.

In the present study, a SEER-based case–control analysis has been conducted. We found that most AEG patients (approximately 71.2%) with LNM had a submucosal invasion. Approximately 53.9% of patients without LNM had submucosal infiltration. Given the key role of LNM in the selection of endoscopic or surgical resection, we built predictive models of LNM using AutoML methods on data from the SEER database and validated the models with independent data. Among the five models, the DL model is highly sensitive to predict LNM in early AEG patients. A consistent performance of our new DL model across the datasets with different baseline characteristics provides evidence of its robustness and generalizability.

In our study, we found that the degree of tumor differentiation, the depth of tumor invasion, and gender were related to LNM. Lower degrees of differentiation have higher incidences of LNM. Higher incidences of LNM are observed in less differentiated tumors due to higher heterogeneity and more aggressive biological characteristics compared to other histological types. The risk of LNM is higher for AEG that invades the submucosa. The reason behind this observation may be due to the presence of substantial lymphatic capillaries in the submucosa and the large gap between adjacent endothelial cells. If the tumor infiltrates the submucosa or deeper, cancer cells could invade the lymphatic capillaries, resulting in LNM (24). In terms of gender, the exact reason remains unknown. However, several studies have shown more prolonged survival in female individuals than in male counterparts with esophageal cancer (42–45), which is attributed to both sex itself (sex hormones and reproductive factors) and other extrinsic risk factors (43).

The predictive model we have established can help clinicians predict the LNM risk of early AEG while combining imaging findings, thus helping us make better clinical decisions and personalized treatment plans for early AEG patients. Of note, the prediction model was developed using postoperative pathological data, which can also be obtained from endoscopically resected pathological specimens. Hence, we investigated LNM predictive models for T1 AEG. However, certain limitations still exist in our study. First, the inherent limitations of retrospective and non-randomized studies may lead to unavoidable bias. Second, the prediction model was based on postoperative pathological data, and therefore, further studies combined with preoperative data are needed to validate our model. Third, the patient data from AEG were collected from two hospitals in Suzhou to validate our prediction models. Due to the low rate of LNM in the population, the test set is highly imbalanced, with positive cases representing less than 10% of cases, which makes it less credible to validate the models’ predictive performance in the test set. That is why the precision of the model in the test set is not ideal, which means data from different hospitals in different regions need to be further collected to expand the sample size. Lymphovascular invasion has been repeatedly demonstrated as the most crucial risk factor for LNM (46). The esophageal invasion length is thought to be associated with mediastinal LNM (39, 47, 48). However, due to the limited data available in the SEER database, certain tumor characteristics (such as lymphovascular invasion, esophageal invasion length, and the groups of LNM), blood index, and imaging data were missing. Therefore, we cannot further improve the performance of the LNM predictive model in a multimodal way.

5 Conclusion

In summary, in this multicenter-based case–control study, we report that the degree of tumor differentiation, tumor size, gender, and depth of tumor invasion are correlated with the LNM of Siewert type II T1 AEG. Using AutoML algorithms, we built five models to predict LNM in the early AEG. The DL model is the best model for predicting LNM in patients with AEG, with high sensitivity and reasonable specificity. This model should be further applied in clinical practice, and the predictive performance of this model should be prospectively explored in further clinical follow-up.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of the First Affiliated Hospital of Soochow University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

CL: Formal analysis, Investigation, Methodology, Resources, Software, Validation, Writing – original draft. LuL: Validation, Writing – review & editing. MY: Validation, Visualization, Writing – review & editing. JL: Data curation, Methodology, Writing – review & editing. SZ: Methodology, Software, Writing – review & editing. JG: Data curation, Methodology, Writing – review & editing. SQ: Data curation, Validation, Writing – review & editing. GX: Validation, Writing – review & editing. LiL: Data curation, Validation, Writing – review & editing. JZ: Conceptualization, Funding acquisition, Supervision, Writing – review & editing. CX: Conceptualization, Funding acquisition, Project administration, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by the National Natural Science Foundation of China (82000540), the Suzhou Clinical Center of Digestive Diseases (Szlcyxzx202101), the Science and Technology Plan of Suzhou City (SKY2021038), and the Youth Program of Suzhou Health Committee (KJXW2019001).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2024.1266278/full#supplementary-material

Footnotes

1. ^ https://seer.cancer.gov/

2. ^https://h2o.ai/

References

1. Hasegawa, S, and Yoshikawa, T. Adenocarcinoma of the esophagogastric junction: incidence, characteristics, and treatment strategies. Gastric Cancer. (2010) 13:63–73. doi: 10.1007/s10120-010-0555-2

PubMed Abstract | Crossref Full Text | Google Scholar

2. Kusano, C, Gotoda, T, Khor, CJ, Katai, H, Kato, H, Taniguchi, H, et al. Changing trends in the proportion of adenocarcinoma of the esophagogastric junction in a large tertiary referral center in Japan. J Gastroenterol Hepatol. (2008) 23:1662–5. doi: 10.1111/j.1440-1746.2008.05572.x

PubMed Abstract | Crossref Full Text | Google Scholar

3. Hatta, W, Tong, D, Lee, YY, Ichihara, S, Uedo, N, and Gotoda, T. Different time trend and management of esophagogastric junction adenocarcinoma in three Asian countries. Dig Endosc. (2017) 29:18–25. doi: 10.1111/den.12808

PubMed Abstract | Crossref Full Text | Google Scholar

4. Buas, MF, and Vaughan, TL. Epidemiology and risk factors for gastroesophageal junction tumors: understanding the rising incidence of this disease. Semin Radiat Oncol. (2013) 23:3–9. doi: 10.1016/j.semradonc.2012.09.008

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zheng, Y-H, and Zhao, E-H. Recent advances in multidisciplinary therapy for adenocarcinoma of the esophagus and esophagogastric junction. World J Gastroenterol. (2022) 28:4299–309. doi: 10.3748/wjg.v28.i31.4299

PubMed Abstract | Crossref Full Text | Google Scholar

6. Blot, WJ. Rising incidence of adenocarcinoma of the esophagus and nGastric cardia. JAMA. (1991) 265:1287. doi: 10.1001/jama.1991.03460100089030

Crossref Full Text | Google Scholar

7. Imamura, Y, Watanabe, M, Oki, E, Morita, M, and Baba, H. Esophagogastric junction adenocarcinoma shares characteristics with gastric adenocarcinoma: literature review and retrospective multicenter cohort study. Ann Gastroenterol Surg. (2021) 5:46–59. doi: 10.1002/ags3.12406

PubMed Abstract | Crossref Full Text | Google Scholar

8. Liu, K, Yang, K, Zhang, W, Chen, X, Chen, X, Zhang, B, et al. Changes of Esophagogastric junctional adenocarcinoma and gastroesophageal reflux disease among surgical patients during 1988-2012: a single-institution, High-volume Experience in China. Ann Surg. (2016) 263:88–95. doi: 10.1097/SLA.0000000000001148

PubMed Abstract | Crossref Full Text | Google Scholar

9. Nakamura, T, Ide, H, Eguchi, R, Ota, M, Shimizu, S, and Isono, K. Adenocarcinoma of the esophagogastric junction: a summary of responses to a questionnaire on adenocarcinoma of the esophagus and the esophagogastric junction in Japan. Dis Esophagus. (2002) 15:219–25. doi: 10.1046/j.1442-2050.2002.00262.x

PubMed Abstract | Crossref Full Text | Google Scholar

10. Takeuchi, H, and Kitagawa, Y. Adenocarcinoma of the esophagogastric junction: territory of the esophagus or stomach, or an independent region? Ann Surg Oncol. (2013) 20:705–6. doi: 10.1245/s10434-012-2781-9

Crossref Full Text | Google Scholar

11. Cao, F, Hu, C, Xu, Z-Y, Zhang, Y-Q, Huang, L, Chen, J-H, et al. Current treatments and outlook in adenocarcinoma of the esophagogastric junction: a narrative review. Ann Transl Med. (2022) 10:377. doi: 10.21037/atm-22-1064

PubMed Abstract | Crossref Full Text | Google Scholar

12. Ye, H, Chen, P, Wang, Y-F, and Cai, X-J. Endoscopic versus surgical therapy for early Esophagogastric junction adenocarcinoma based on lymph node metastasis risk: a population-based analysis. Front Oncol. (2021) 11:716470. doi: 10.3389/fonc.2021.716470

Crossref Full Text | Google Scholar

13. von Rahden, BHA, Feith, M, and Stein, HJ. Carcinoma of the cardia: classification as esophageal or gastric cancer? Int J Color Dis. (2005) 20:89–93. doi: 10.1007/s00384-004-0646-9

Crossref Full Text | Google Scholar

14. Siewert, JR, and Stein, HJ. Classification of adenocarcinoma of the oesophagogastric junction. Br J Surg. (2003) 85:1457–9. doi: 10.1046/j.1365-2168.1998.00940.x

Crossref Full Text | Google Scholar

15. Rüdiger Siewert, J, Feith, M, Werner, M, and Stein, HJ. Adenocarcinoma of the esophagogastric junction: results of surgical therapy based on anatomical/topographic classification in 1,002 consecutive patients. Ann Surg. (2000) 232:353–61. doi: 10.1097/00000658-200009000-00007

PubMed Abstract | Crossref Full Text | Google Scholar

16. GuoHui, M, MingHua, Z, ZhenYu, C, JianHai, L, ChunXi, W, and ZeLong, Y. Comparable long-term outcomes after endoscopic therapy and gastrectomy of early adenocarcinoma of esophagogastric junction: a population-based study. Surg Endosc. (2022) 36:7521–8. doi: 10.1007/s00464-022-09187-w

Crossref Full Text | Google Scholar

17. Gong, EJ, Kim, DH, Ahn, JY, Jung, KW, Lee, JH, Choi, KD, et al. Comparison of long-term outcomes of endoscopic submucosal dissection and surgery for esophagogastric junction adenocarcinoma. Gastric Cancer. (2017) 20:84–91. doi: 10.1007/s10120-016-0679-0

PubMed Abstract | Crossref Full Text | Google Scholar

18. Abe, S, Ishihara, R, Takahashi, H, Ono, H, Fujisaki, J, Matsui, A, et al. Long-term outcomes of endoscopic resection and metachronous cancer after endoscopic resection for adenocarcinoma of the esophagogastric junction in Japan. Gastrointest Endosc. (2019) 89:1120–8. doi: 10.1016/j.gie.2018.12.010

PubMed Abstract | Crossref Full Text | Google Scholar

19. Liu, S, Chai, N, Lu, Z, Li, H, Xiong, Y, Zhai, Y, et al. Long-term outcomes of superficial neoplasia at the esophagogastric junction treated via endoscopic submucosal dissection and endoscopic submucosal tunnel dissection: a cohort study of a single center from China. Surg Endosc. (2020) 34:216–25. doi: 10.1007/s00464-019-06753-7

PubMed Abstract | Crossref Full Text | Google Scholar

20. Chen, H, Yu, X, Yang, R, Li, S, Zhang, G, Si, X, et al. The long-term outcomes of surgery versus endoscopic treatment in patients with Siewert type II T1M0N0 adenocarcinoma of the Esophagogastric junction. Cancer Control. (2022) 29:107327482211433. doi: 10.1177/10732748221143389

Crossref Full Text | Google Scholar

21. Doumbe-Mandengue, P, Beuvon, F, Belle, A, Dermine, S, Palmieri, L-J, Abou Ali, E, et al. Outcomes of endoscopic submucosal dissection for early esophageal and gastric cardia adenocarcinomas. Clin Res Hepatol Gastroenterol. (2021) 45:101700. doi: 10.1016/j.clinre.2021.101700

PubMed Abstract | Crossref Full Text | Google Scholar

22. Wang, J, Shi, L, Chen, J, Wang, B, Qi, J, Chen, G, et al. A novel risk score system for prognostic evaluation in adenocarcinoma of the oesophagogastric junction: a large population study from the SEER database and our center. BMC Cancer. (2021) 21:806. doi: 10.1186/s12885-021-08558-1

PubMed Abstract | Crossref Full Text | Google Scholar

23. Urakawa, N, Kanaji, S, Suzuki, S, Sawada, R, Harada, H, Goto, H, et al. Prognostic and Clinicopathological significance of lymph node metastasis in the Esophagogastric junction adenocarcinoma. Anticancer Res. (2022) 42:1051–7. doi: 10.21873/anticanres.15566

PubMed Abstract | Crossref Full Text | Google Scholar

24. Zheng, Z, Yin, J, Wu, H-W, Li, J, Cai, J, Qin, S-Q, et al. Explored risk factors for lymph node metastasis with Siewert II/III adenocarcinoma of the gastroesophageal junction. Anticancer Res. (2017) 37:4605–10. doi: 10.21873/anticanres.11860

PubMed Abstract | Crossref Full Text | Google Scholar

25. Chen, L, Tang, K, Wang, S, Chen, D, and Ding, K. Predictors of lymph node metastasis in Siewert type II T1 adenocarcinoma of the Esophagogastric junction: a population-based study. Cancer Control. (2021) 28:107327482110266. doi: 10.1177/10732748211026668

Crossref Full Text | Google Scholar

26. Feng, H, Zheng, J, Zheng, C, Deng, Z, Liao, Q, Wang, J, et al. The probability of lymph node metastasis with a tumor size larger than and smaller than 4 cm is different in stages T1-T3 of Siewert type II adenocarcinoma of esophagogastric junction: a population-based study. J Cancer. (2021) 12:6873–82. doi: 10.7150/jca.63392

PubMed Abstract | Crossref Full Text | Google Scholar

27. Zhu, M, Cao, B, Li, X, Li, P, Wen, Z, Ji, J, et al. Risk factors and a predictive nomogram for lymph node metastasis of superficial esophagogastric junction cancer. J Gastroenterol Hepatol. (2020) 35:1524–31. doi: 10.1111/jgh.15004

PubMed Abstract | Crossref Full Text | Google Scholar

28. Osman, MH, Mohamed, RH, Sarhan, HM, Park, EJ, Baik, SH, Lee, KY, et al. Machine learning model for predicting postoperative survival of patients with colorectal Cancer. Taehan Am Hakhoe Chi. (2022) 54:517–24. doi: 10.4143/crt.2021.206

PubMed Abstract | Crossref Full Text | Google Scholar

29. Arai, J, Aoki, T, Sato, M, Niikura, R, Suzuki, N, Ishibashi, R, et al. Machine learning-based personalized prediction of gastric cancer incidence using the endoscopic and histologic findings at the initial endoscopy. Gastrointest Endosc. (2022) 95:864–72. doi: 10.1016/j.gie.2021.12.033

PubMed Abstract | Crossref Full Text | Google Scholar

30. Nguyen, NH, Picetti, D, Dulai, PS, Jairath, V, Sandborn, WJ, Ohno-Machado, L, et al. Machine learning-based prediction models for diagnosis and prognosis in inflammatory bowel diseases: a systematic review. J Crohns Colitis. (2022) 16:398–413. doi: 10.1093/ecco-jcc/jjab155

PubMed Abstract | Crossref Full Text | Google Scholar

31. Zhu, H, Wang, G, Zheng, J, Zhu, H, Huang, J, Luo, E, et al. Preoperative prediction for lymph node metastasis in early gastric cancer by interpretable machine learning models: a multicenter study. Surgery. (2022) 171:1543–51. doi: 10.1016/j.surg.2021.12.015

PubMed Abstract | Crossref Full Text | Google Scholar

32. Daly, MC, and Paquette, IM. Surveillance, epidemiology, and end results (SEER) and SEER-Medicare databases: use in clinical research for improving colorectal Cancer outcomes. Clin Colon Rectal Surg. (2019) 32:061–8. doi: 10.1055/s-0038-1673355

PubMed Abstract | Crossref Full Text | Google Scholar

33. Engels, EA, Pfeiffer, RM, Ricker, W, Wheeler, W, Parsons, R, and Warren, JL. Use of surveillance, epidemiology, and end results-medicare data to conduct case-control studies of cancer among the US elderly. Am J Epidemiol. (2011) 174:860–70. doi: 10.1093/aje/kwr146

Crossref Full Text | Google Scholar

34. Miccio, JA, Oladeru, OT, Yang, J, Xue, Y, Choi, M, Zhang, Y, et al. Neoadjuvant vs. adjuvant treatment of Siewert type II gastroesophageal junction cancer: an analysis of data from the surveillance, epidemiology, and end results (SEER) registry. J Gastrointest Oncol. (2016) 7:403–10. doi: 10.21037/jgo.2015.10.06

PubMed Abstract | Crossref Full Text | Google Scholar

35. van Buuren, S, and Groothuis-Oudshoorn, K. Mice: multivariate imputation by chained equations in R. J Stat Softw. (2011) 45:1–67. doi: 10.18637/jss.v045.i03

Crossref Full Text | Google Scholar

36. Roulston, MS. Performance targets and the brier score. Meteorol Appl. (2007) 14:185–94. doi: 10.1002/met.21

Crossref Full Text | Google Scholar

37. Yamada, M, Oda, I, Nonaka, S, Suzuki, H, Yoshinaga, S, Taniguchi, H, et al. Long-term outcome of endoscopic resection of superficial adenocarcinoma of the esophagogastric junction. Endoscopy. (2013) 45:992–6. doi: 10.1055/s-0033-1344862

PubMed Abstract | Crossref Full Text | Google Scholar

38. Chen, Z, Liu, Y, Dou, L, Zhang, Y, He, S, and Wang, G. The efficacy of the application of the curative criteria of the 5rd edition Japanese gastric cancer treatment guidelines for early adenocarcinoma of the esophagogastric junction treated by endoscopic submucosal dissection. Saudi J Gastroenterol. (2021) 27:97–104. doi: 10.4103/sjg.SJG_403_20

Crossref Full Text | Google Scholar

39. Maatouk, M, Ben Safta, Y, Kbir, GH, Mabrouk, A, Ben Dhaou, A, Daldoul, S, et al. Can we predict mediastinal lymph nodes metastasis in esophagogastric junction cancer? Results of a systematic review and meta-analysis. Gen Thorac Cardiovasc Surg. (2021) 69:1165–73. doi: 10.1007/s11748-021-01665-7

PubMed Abstract | Crossref Full Text | Google Scholar

40. Harada, K, Hwang, H, Wang, X, Abdelhakeem, A, Iwatsuki, M, Blum Murphy, MA, et al. Frequency and implications of Paratracheal lymph node metastases in Resectable esophageal or gastroesophageal junction adenocarcinoma. Ann Surg. (2021) 273:751–7. doi: 10.1097/SLA.0000000000003383

Crossref Full Text | Google Scholar

41. Shiraishi, O, Yasuda, T, Kato, H, Iwama, M, Hiraki, Y, Yasuda, A, et al. Risk factors and prognostic impact of mediastinal lymph node metastases in patients with Esophagogastric junction Cancer. Ann Surg Oncol. (2020) 27:4433–40. doi: 10.1245/s10434-020-08579-3

PubMed Abstract | Crossref Full Text | Google Scholar

42. Bohanes, P, Yang, D, Chhibar, RS, Labonte, MJ, Winder, T, Ning, Y, et al. Influence of sex on the survival of patients with esophageal Cancer. J Clin Oncol. 30:2265–72. doi: 10.1200/JCO.2011.38.8751

PubMed Abstract | Crossref Full Text | Google Scholar

43. Xie, SH, and Lagergren, J. The male predominance in esophageal adenocarcinoma. Clin Gastroenterol Hepatol. (2016) 14:338–347.e1. doi: 10.1016/j.cgh.2015.10.005

PubMed Abstract | Crossref Full Text | Google Scholar

44. Corley, DA, and Buffler, PA. Oesophageal and gastric cardia adenocarcinomas: analysis of regional variation using the Cancer incidence in five continents database. Int J Epidemiol. (2001) 30:1415–25. doi: 10.1093/ije/30.6.1415

PubMed Abstract | Crossref Full Text | Google Scholar

45. Hidaka, H, Hotokezaka, M, Nakashima, S, Uchiyama, S, Maehara, N, and Chijiiwa, K. Sex difference in survival of patients treated by surgical resection for esophageal cancer. World J Surg. (2007) 31:1982–7. doi: 10.1007/s00268-007-9193-1

PubMed Abstract | Crossref Full Text | Google Scholar

46. Zheng, C, Feng, X, Zheng, J, Yan, Q, Hu, X, Feng, H, et al. Lymphovascular invasion as a prognostic factor in non-metastatic adenocarcinoma of Esophagogastric junction after radical surgery. Cancer Manag Res. (2020) 12:12791–9. doi: 10.2147/CMAR.S286512

PubMed Abstract | Crossref Full Text | Google Scholar

47. Nishiwaki, N, Noma, K, Matsuda, T, Maeda, N, Tanabe, S, Sakurama, K, et al. Risk factor of mediastinal lymph node metastasis of Siewert type I and II esophagogastric junction carcinomas. Langenbeck's Arch Surg. (2020) 405:1101–9. doi: 10.1007/s00423-020-02017-4

PubMed Abstract | Crossref Full Text | Google Scholar

48. Koyanagi, K, Kato, F, Kanamori, J, Daiko, H, Ozawa, S, and Tachimori, Y. Clinical significance of esophageal invasion length for the prediction of mediastinal lymph node metastasis in Siewert type II adenocarcinoma: a retrospective single-institution study. Ann Gastroenterol Surg. (2018) 2:187–96. doi: 10.1002/ags3.12069

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: machine learning, adenocarcinoma of the esophagogastric junction (AEG), lymph node metastasis (LNM), predictive model, Surveillance, Epidemiology, and End Results (SEER) program

Citation: Lu C, Liu L, Yin M, Lin J, Zhu S, Gao J, Qu S, Xu G, Liu L, Zhu J and Xu C (2024) The development and validation of automated machine learning models for predicting lymph node metastasis in Siewert type II T1 adenocarcinoma of the esophagogastric junction. Front. Med. 11:1266278. doi: 10.3389/fmed.2024.1266278

Received: 24 July 2023; Accepted: 15 March 2024;
Published: 03 April 2024.

Edited by:

Ajaz Bhat, Sidra Medicine, Qatar

Reviewed by:

Muzafar Ahmad Macha, Islamic University of Science and Technology, India
Ikhlak Ahmed, Sidra Medicine, Qatar

Copyright © 2024 Lu, Liu, Yin, Lin, Zhu, Gao, Qu, Xu, Liu, Zhu and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jinzhou Zhu, anp6aHVAemp1LmVkdS5jbg==; Chunfang Xu, eHVjaHVuZmFuZ0BzdWRhLmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

The development and validation of automated machine learning models for predicting lymph node metastasis in Siewert type II T1 adenocarcinoma of the esophagogastric junction

1 Introduction

2 Materials and methods

2.1 Data source

2.2 Including criteria

2.3 The development of models

2.3.1 Variable selection and data pre-processing

2.3.2 Modeling methods and evaluation

2.4 Statistical analysis

3 Results

3.1 Characteristics

3.2 Performance of the models

3.3 The performance of the deep learning model

3.3.1 Calibration curve in the datasets

3.3.2 Model visualization

4 Discussion

5 Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

Footnotes

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good