Background

AUTHOR=Xu Jinye , Zhou Jianghui , Hu Junxi , Ren Qinglin , Wang Xiaolin , Shu Yusheng 

TITLE=Development and validation of a machine learning model for survival risk stratification after esophageal cancer surgery

JOURNAL=Frontiers in Oncology

VOLUME=12

YEAR=2022

URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2022.1068198

DOI=10.3389/fonc.2022.1068198

ISSN=2234-943X

ABSTRACT=<sec><title>Background</title><p>Prediction of prognosis for patients with esophageal cancer(EC) is beneficial for their postoperative clinical decision-making. This study’s goal was to create a dependable machine learning (ML) model for predicting the prognosis of patients with EC after surgery.</p></sec><sec><title>Methods</title><p>The files of patients with esophageal squamous cell carcinoma (ESCC) of the thoracic segment from China who received radical surgery for EC were analyzed. The data were separated into training and test sets, and prognostic risk variables were identified in the training set using univariate and multifactor COX regression. Based on the screened features, training and validation of five ML models were carried out through nested cross-validation (nCV). The performance of each model was evaluated using Area under the curve (AUC), accuracy(ACC), and F1-Score, and the optimum model was chosen as the final model for risk stratification and survival analysis in order to build a valid model for predicting the prognosis of patients with EC after surgery.</p></sec><sec><title>Results</title><p>This study enrolled 810 patients with thoracic ESCC. 6 variables were ultimately included for modeling. Five ML models were trained and validated. The XGBoost model was selected as the optimum for final modeling. The XGBoost model was trained, optimized, and tested (AUC = 0.855; 95% CI, 0.808-0.902). Patients were separated into three risk groups. Statistically significant differences (p &lt; 0.001) were found among all three groups for both the training and test sets.</p></sec><sec><title>Conclusions</title><p>A ML model that was highly practical and reliable for predicting the prognosis of patients with EC after surgery was established, and an application to facilitate clinical utility was developed.</p></sec>