Development and validation of machine learning models for predicting prognosis and guiding individualized postoperative chemotherapy: A real-world study of distal cholangiocarcinoma

Wang, Di; Pan, Bing; Huang, Jin-Can; Chen, Qing; Cui, Song-Ping; Lang, Ren; Lyu, Shao-Cheng

doi:10.3389/fonc.2023.1106029

ORIGINAL RESEARCH article

Front. Oncol., 15 March 2023

Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1106029

Development and validation of machine learning models for predicting prognosis and guiding individualized postoperative chemotherapy: A real-world study of distal cholangiocarcinoma

Di Wang^†

Bing Pan^†

Jin-Can Huang

Qing Chen

Song-Ping Cui

Ren Lang^*

Shao-Cheng Lyu^*

Department of Hepatobiliary Surgery, Beijing Chao-Yang Hospital Capital Medical University, Beijing, China

Background: Distal cholangiocarcinoma (dCCA), originating from the common bile duct, is greatly associated with a dismal prognosis. A series of different studies based on cancer classification have been developed, aimed to optimize therapy and predict and improve prognosis. In this study, we explored and compared several novel machine learning models that might lead to an improvement in prediction accuracy and treatment options for patients with dCCA.

Methods: In this study, 169 patients with dCCA were recruited and randomly divided into the training cohort (n = 118) and the validation cohort (n = 51), and their medical records were reviewed, including survival outcomes, laboratory values, treatment strategies, pathological results, and demographic information. Variables identified as independently associated with the primary outcome by least absolute shrinkage and selection operator (LASSO) regression, the random survival forest (RSF) algorithm, and univariate and multivariate Cox regression analyses were introduced to establish the following different machine learning models and canonical regression model: support vector machine (SVM), SurvivalTree, Coxboost, RSF, DeepSurv, and Cox proportional hazards (CoxPH). We measured and compared the performance of models using the receiver operating characteristic (ROC) curve, integrated Brier score (IBS), and concordance index (C-index) following cross-validation. The machine learning model with the best performance was screened out and compared with the TNM Classification using ROC, IBS, and C-index. Finally, patients were stratified based on the model with the best performance to assess whether they benefited from postoperative chemotherapy through the log-rank test.

Results: Among medical features, five variables, including tumor differentiation, T-stage, lymph node metastasis (LNM), albumin-to-fibrinogen ratio (AFR), and carbohydrate antigen 19-9 (CA19-9), were used to develop machine learning models. In the training cohort and the validation cohort, C-index achieved 0.763 vs. 0.686 (SVM), 0.749 vs. 0.692 (SurvivalTree), 0.747 vs. 0.690 (Coxboost), 0.745 vs. 0.690 (RSF), 0.746 vs. 0.711 (DeepSurv), and 0.724 vs. 0.701 (CoxPH), respectively. The DeepSurv model (0.823 vs. 0.754) had the highest mean area under the ROC curve (AUC) than other models, including SVM (0.819 vs. 0.736), SurvivalTree (0.814 vs. 0.737), Coxboost (0.816 vs. 0.734), RSF (0.813 vs. 0.730), and CoxPH (0.788 vs. 0.753). The IBS of the DeepSurv model (0.132 vs. 0.147) was lower than that of SurvivalTree (0.135 vs. 0.236), Coxboost (0.141 vs. 0.207), RSF (0.140 vs. 0.225), and CoxPH (0.145 vs. 0.196). Results of the calibration chart and decision curve analysis (DCA) also demonstrated that DeepSurv had a satisfactory predictive performance. In addition, the performance of the DeepSurv model was better than that of the TNM Classification in C-index, mean AUC, and IBS (0.746 vs. 0.598, 0.823 vs. 0.613, and 0.132 vs. 0.186, respectively) in the training cohort. Patients were stratified and divided into high- and low-risk groups based on the DeepSurv model. In the training cohort, patients in the high-risk group would not benefit from postoperative chemotherapy (p = 0.519). In the low-risk group, patients receiving postoperative chemotherapy might have a better prognosis (p = 0.035).

Conclusions: In this study, the DeepSurv model was good at predicting prognosis and risk stratification to guide treatment options. AFR level might be a potential prognostic factor for dCCA. For the low-risk group in the DeepSurv model, patients might benefit from postoperative chemotherapy.

Introduction

Cholangiocarcinoma (CCA) includes intrahepatic CCA, perihilar CCA, and distal CCA (dCCA). Among them, dCCA, originating from the common biliary duct, is an aggressive tumor that probably accounts for 20%%–30% of all CCA cases (1, 2). Most patients with dCCA usually have advanced disease at presentation due to difficulties in early diagnosis (3). Surgical resection remains the primary treatment strategy for dCCA, such as pancreaticoduodenectomy (PD) with standard lymphadenectomy. The survival outcome of dCCA is still dismal because of the relatively low resection rate and the high relapse rate after the operation, and the 5-year overall survival rate is approximately between 20% and 50% (4, 5). Although reports have suggested that surgical resection combined with postoperative chemotherapy benefits patients with dCCA, an ideal medical model, which could lead to improvement in prediction accuracy and treatment options, remains of paramount importance (6).

The Union for International Cancer Control (UICC) TNM Classification is a globally recognized standard for classifying the extent of the spread of cancer, which records the primary and regional nodal extent of the tumor and the absence or presence of metastases. Nevertheless, the TNM Classification could not accurately predict patients’ prognosis once they received multimodality treatment. The TNM Classification only included anatomical prognostic factors and cannot incorporate non-anatomical factors associated with prognosis.

Machine learning is the name given to both the academic discipline and the collection of techniques that allow computers to undertake complex tasks. As an academic discipline, machine learning comprises elements of mathematics, statistics, and computer science. The application of machine learning mainly benefits diagnosis and outcome prediction in the medical field (7). Machine learning algorithms have been successfully applied to classify skin cancer by dermatologists (8) and to predict the progression from pre-diabetes to type II diabetes (9). Several machine learning models have been reported, including random survival forest (RSF) (10), support vector machine (SVM) (11), and DeepSurv (12), although inconsistency remains, and model-building approaches, effect estimates, and the overall accuracy and validation of these prediction models vary to the point that a consensus has not been reached.

In this study, we constructed several different machine learning models and canonical logistic Cox proportional hazards (CoxPH) and evaluated their predictive performance. The aim was to demonstrate the effectiveness of machine learning and guide individualized treatment options for dCCA patients.

Materials and methods

Patients and dataset

The clinical data of patients were collected through a retrospective review of medical records. Eligible patients were those who underwent PD for dCCA between October 2011 and December 2021. Patient data were retrieved from the hospital database. Patients were randomly divided into the training cohort and the validation cohort. The study was performed following the tenets of the Declaration of Helsinki (as revised in 2013) and was approved by the Ethics Committee of Beijing Chao-Yang Hospital Capital Medical University (No. 2020-D-301).

Data collection

We collected the demographic information of dCCA patients, such as age, gender, and history of smoking or diabetes. The preoperative blood values included white blood cell (WBC), hemoglobin (Hb), platelet (PLT), albumin (Alb), aspartate transaminase (AST), alanine transaminase (ALT), total bilirubin (TBIL), γ-glutamyl transpeptidase (GGT), plasma fibrinogen (Fib), carbohydrate antigen 19-9 (CA19-9), and carcinoembryonic antigen (CEA). Albumin-to-fibrinogen ratio (AFR) was calculated by dividing the Alb concentration by the Fib concentration. Peri-operative data included tumor differentiation, lymph node metastasis (LNM), T-stage, intraoperative blood loss, and operative duration. Postoperative chemotherapy and survival outcome were also recorded.

Flow chart of study

In this study, patients with dCCA were recruited and randomly divided into the training cohort and the validation cohort. Variables identified as independently associated with the primary outcome by least absolute shrinkage and selection operator (LASSO) regression, the RSF algorithm, and univariate and multivariate Cox regression analyses were introduced to establish the following different machine learning models and canonical regression model: SVM, SurvivalTree, Coxboost, RSF, DeepSurv, and CoxPH. The performance of models was measured and compared with mean AUC, IBS, and C-index following cross-validation. The machine learning model with the best performance was screened out and compared with the TNM Classification by mean AUC, IBS, and C-index. The DeepSurv model was further evaluated by calibration chart and decision curve analysis (DCA). Finally, patients were stratified based on the model with the best performance to assess whether they benefited from postoperative chemotherapy through the log-rank test (Supplementary Figure 1).

Modeling process

SVM is a supervised machine learning algorithm that is used for classification or regression. The objective of SVM is to find the hyperplane in high-dimensional space that best separates the data into classes. This hyperplane is called the maximum margin hyperplane and is selected based on the idea of maximizing the margin, which is the distance between the hyperplane and the closest training samples, called support vectors (13). The goal of SVM is to find the hyperplane that maximizes this margin while correctly separating the classes. Once the hyperplane is determined, new data can be classified by finding which side of the hyperplane it falls on. The formula is as follows:

b^{*} = y_{j} - \sum_{i = 1}^{1} α_{i}^{*} y_{i} (x_{i} x_{j})

SurvivalTree is a type of machine learning algorithm that is used to model and predict time-to-event data, also known as survival analysis. The main objective of SurvivalTree is to estimate the survival function, which represents the probability that an individual will survive past a given time point, given their specific characteristics or features (14). SurvivalTree is a tree-based algorithm, meaning it builds a tree-like structure to represent the relationships between different features and survival time. The algorithm splits the sample into different subgroups based on their features and predicts the survival function for each subgroup. By doing this, the algorithm can capture complex non-linear relationships between features and survival time and make accurate predictions for new individuals. The formula can be expressed as:

S (t) = e^{^} (- h (t) * γ (t))

Coxboost is a machine learning algorithm that combines the ideas of boosting and the Cox proportional hazards model, which is a popular method in survival analysis. The main objective of Coxboost is to model and predict time-to-event data, also known as survival analysis. Coxboost uses a combination of boosting and the Cox proportional hazards model to make predictions. Boosting is an ensemble learning technique that combines multiple weak learners to form a strong prediction model. In Coxboost, boosting is used to improve the performance of the Cox proportional hazards model by combining multiple models into a single model that has improved accuracy (15). The formula can be expressed as:

h (t) = h_{0} (t) * exp (\sum^{​} θ i * f (t, x i))

RSF is a machine learning algorithm that is used to model and predict time-to-event data, also known as survival analysis. The main objective of RSF is to estimate the survival function, which represents the probability that an individual will survive past a given time point, given their specific characteristics or features. RSF is an ensemble learning algorithm that combines multiple decision trees to form a random forest. Decision trees are tree-based algorithms that split the sample into different subgroups based on their features and predict the survival time for each subgroup. By combining multiple decision trees, RSF can capture complex non-linear relationships between features and survival time and make accurate predictions for new individuals (16). The test statistic function for the Z-value to accept the null hypothesis is as follows:

Z = \frac{\sum_{i = 1}^{k} (O_{1, i} - E_{1, i})}{\sqrt{\sum_{i = 1}^{k} V_{i}}} - N (0, 1)

CoxPH is a statistical method that is commonly used in survival analysis to model the relationship between covariates and time-to-event data. The main objective of CoxPH is to estimate the hazard function, which represents the instantaneous risk of an event occurring at a given time. CoxPH assumes that the hazard function is proportional for different individuals, meaning that the hazard ratio between two individuals is constant over time. This allows for the estimation of the hazard function using a simple linear regression model, where the hazard ratio between two individuals can be estimated as the exponential of the difference in their covariate values. The model can be written as follows (17):

I n h (t) = I n h_{0} (t) + b_{1} x_{1} + \dots + b_{p} x_{p}

The DeepSurv model was established based on the proposal by Katzman et al. to predict the prognosis of dCCA. DeepSurv is a multilayer neural network, including input, hidden, and output layers. This model imitates the actual clinical patients’ risk value and has tremendous generalization performance. DeepSurv contains weight decay regularization, batch normalization, and dropout, which can prevent overfitting to some extent. The loss function in the DeepSurv model is defined as Cox partial likelihood with constraints, the formula of which is as follows:

l (θ) = - \frac{1}{N_{E}} \sum_{i, E_{i} = 1} (\hat{h_{θ}} (x) - l o g \sum_{j \in R (T_{i})} e^{\hat{h_{θ}} (x)}) + α|| θ|| \begin{matrix} 2 \\ 2 \end{matrix}

We think that the smaller the value of loss function in this model, the better the stability. The Adam optimization algorithm was used and obtained current optimal parameters (18). Supplementary Table 1 shows the hyperparameters of DeepSurv. More detailed information about DeepSurv can be found on the website (https://github.com/jaredleekatzman/DeepSurv) (19).

Statistical analysis

Continuous variables were shown as mean ± standard deviation (SD) or median (interquartile range [IQR]). Categorical variables are presented as a percentage. Statistical methods are the t-test, Mann–Whitney U-test, and chi-squared test. Kaplan–Meier analysis and log-rank testing were operated using the lifelines module by Python. AFR was evaluated using “R-package Survminer” to obtain the best cutoff value. Eventually, the predictive ability was analyzed and compared between the machine learning model with the best performance and the CoxPH model. The accuracy of prognostic prediction models was assessed using the C-index, calibration chart, DCA, IBS, and AUC in the training and validation cohorts. Statistical analyses were performed using R software (version 4.0.4) and Python software (version 3.7.6). These tests of the proposed approaches were double-sided, and the standard of significance in the results was p < 0.05.

Results

The clinical characteristics of patients and selection of variables

We searched electronic medical records and identified 169 patients (105 men and 64 women, mean age 65 years, range 29–84 years) diagnosed with dCCA between October 2011 and December 2021 in this study. Among all patients, 113 (66.9%) had medium–high differentiation cancer, 141/169 (83.4%) were at T3/4 stage during presentation, 77 (45.6%) had LNM, and 41 (24.3%) received postoperative chemotherapy. Patients were randomly divided into the training cohort (n = 118) and the validation cohort (n = 51) by 7:3. There were no major differences in the demographic and clinical characteristics of patients between the two cohorts (Table 1). The cumulative incidence curves also had no significant difference in the two cohorts using the log-rank test (p = 0.21) (Supplementary Figure 2). Based on “R-package Survminer”, the best cutoff value for AFR was 11.26, and patients were divided into two groups: the high-AFR group (AFR > 11.26, n = 103, 60.9%) and the low-AFR group (AFR ≤ 11.26, n = 66, 39.1%). The univariate and multivariate Cox regression analyses showed that patients with low AFR had a dismal prognosis (HR 2.370 vs. HR 1.933, 95% CI 1.470–3.779 vs. 95% CI 1.144–3.266, respectively). LASSO analysis and RSF also demonstrated that AFR played an important role in prognosis. The Kaplan–Meier survival curve of overall survival (OS) showed that compared with the high-AFR group, patients with dCCA in the low-AFR group had worse OS (Supplementary Figure 3).

TABLE 1

Table 1 Clinical and pathologic characteristics of 169 patients with dCCA.

Selecting the variables and establishing the machine learning models

From the basic information, laboratory examinations, and peri-operative and postoperative data, 22 characteristics were reduced to five potential predictors. Algorithms, including LASSO regression (Figure 1A), RSF (Figure 1B), and univariate and multivariate Cox regression analyses (Tables 2, 3), showed that tumor differentiation, T-stage, LNM, AFR, and CA19-9 were prognostic variables identified as independently associated with the primary outcome. Six models, namely, SVM, SurvivalTree, Coxboost, DeepSurv, RSF, and CoxPH, were established based on the identified five prognostic variables. In the training cohort, the predictive performance of the SVM model (0.763) was better than that of the other models, with C-indexes of SurvivalTree (0.749), Coxboost (0.747), DeepSurv (0.746), RSF (0.745), and CoxPH (0.724). However, in the validation cohort, the C-index of DeepSurv (0.711) was better than that of the other models: SVM (0.686), SurvivalTree (0.692), Coxboost (0.690), and RSF (0.690). We also performed time-dependent ROC analysis in different models (Figure 2). The results in the training cohort showed that DeepSurv (0.823) had a higher mean AUC than other models, including SVM (0.819), SurvivalTree (0.814), Coxboost (0.816), RSF (0.813), and CoxPH (0.788). In the validation cohort, DeepSurv (0.754) also had a higher mean AUC than the other models, including SVM (0.736), SurvivalTree (0.737), Coxboost (0.734), RSF (0.730), and CoxPH (0.753). Regarding the IBS in the training and validation cohorts, DeepSurv (0.132 vs. 0.147) had a lower value than the other models, including SurvivalTree (0.135 vs. 0.236), Coxboost (0.141 vs. 0.207), RSF (0.140 vs. 0.225), and CoxPH (0.145 vs. 0.196).

FIGURE 1

Figure 1 The results of LASSO regression analysis and the RSF plot for models. (A) LASSO coefficient profiles of the expression of 22 variables. (B) The length of the horizontal axis where each variable is located represents the variable’s contribution to the outcome. LASSO, least absolute shrinkage and selection operator; RSF, random survival forest.

TABLE 2

Table 2 Univariate Cox regression analyses for predicting OS of patients with dCCA in the training cohort.

TABLE 3

Table 3 Multivariate Cox regression analyses for predicting OS of patients with dCCA in the training cohort.

FIGURE 2

Figure 2 The time-dependent ROC analysis in different models. The time-dependent ROC analysis in SVM, SurvivalTree, Coxboost, RSF, DeepSurv models, CoxPH, and DeepSurv had a higher mean AUC than other models in the training cohort. ROC, receiver operating characteristic; AUC, area under the ROC curve.

The agreement of DeepSurv between predictions and observations in prognosis was assessed using a calibration plot. The 1-, 2-, and 3-year calibration plots showed good agreement between the predictive value and the actual value in the training cohort (Figures 3A–C). DeepSurv had good performance in AUC of 1, 2, and 3 years in the training cohort (0.734, 0.824, and 0.844, respectively) (Figure 3D) and in the validation cohort (0.734, 0.760, and 0.799, respectively) (Figure 3E). DCA was applied to calculate a clinical “net benefit” for the prediction model, and the result of DCA indicated that the DeepSurv model had a better net benefit at most threshold probabilities (Figures 4A, B). After comprehensive consideration of the C-index, time-dependent ROC, and IBS, the DeepSurv model was found to have a better predictive performance than the other models, including the traditional Cox model, CoxPH (Table 4). We randomly selected three patients for the individual postoperative prognosis demonstration. It showed the individual survival probability of prognosis according to the DeepSurv model (Figure 4C).

FIGURE 3

Figure 3 Calibration plots and ROC curve for the DeepSurv model. Calibration plots in (A) 1 year, (B) 2 years, and (C) 3 years in the training cohort. The ROC of 1, 2, and 3 years between the DeepSurv model in the training cohort (D) and the validation cohort (E). ROC, receiver operating characteristic.

FIGURE 4

Figure 4 DCA of the DeepSurv model and the individual postoperative prognostic prediction. The 1-year (B) and 2-year (C) DCA of the DeepSurv model. (C) The estimated prognosis of patients in the training cohort. The blue line represents patient 2, the yellow line represents patient 35, and the red line represents patient 46. DCA, decision curve analysis.

TABLE 4

Table 4 Comparison of the bootstrapped C-indexes, mean AUC, and IBS of different models for dCCA patients in the training cohort and the validation cohort.

Comparison between the DeepSurv model and the TNM classification

As described previously, the TNM Classification is a unified standard and is a prerequisite for ensuring the quality of care, in which oncologists could communicate regarding the cancer extent for individual patients as a basis for decision making on treatment management and individual prognosis, but can also be used to inform and evaluate treatment guidelines, national cancer planning, and research. First, variables involved in the TNM Classification were applied to establish machine learning models. The result of the mean AUC showed that the predictive performance with the time-dependent ROC of DeepSurv (0.613) was better than that of SVM (0.594), SurvivalTree (0.590), Coxboost (0.594), RSF (0.591), and CoxPH (0.594) (Supplementary Figure 4). Then, the identified five prognostic variables and TNM Classification variables were employed to develop DeepSurv. Finally, the predictive performance between the DeepSurv model and the TNM Classification was estimated using the value of C-index, mean AUC, and IBS. The results in the training cohort showed that the DeepSurv model is better than the TNM Classification (0.746 vs. 0.589, 0.823 vs. 0.613, and 0.132 vs. 0.186, respectively) (Supplementary Table 2). In the validation cohort, the performance of the DeepSurv model was also better than that of the TNM Classification estimated by C-index, mean AUC, and IBS (0.711 vs. 0.568, 0.753 vs. 0.599, and 0.147 vs. 0.172, respectively). The results mentioned above indicated that the DeepSurv model was better at predictive performance and accuracy than the TNM Classification in this study.

Risk stratification and guidance of individualized chemotherapy for patients with dCCA

It is crucial to develop a stratified treatment recommendation to ensure individualized medicine. Patients in this study were stratified into the low-risk group and the high-risk group based on the DeepSurv model in the training cohort and the validation cohort, respectively (Supplementary Figure 5). The benefit from chemotherapy was estimated between the high-risk group and the low-risk group by the Kaplan–Meier analysis and log-rank test in the two cohorts. In the training cohort (Figures 5A, B), results showed that there was no statistical difference in the prognosis in the high-risk group, regardless of whether receiving chemotherapy or not (p = 0.519). In the low-risk group, patients who received postoperative chemotherapy had a better survival prognosis (p = 0.035). In the validation cohort (Figures 5C, D), there was no statistical difference in prognosis in the high-risk group (p = 0.643) and the low-risk group (p = 0.071), regardless of whether receiving chemotherapy or not. However, we still believe that risk stratification based on the DeepSurv model has potential clinical application in which postoperative chemotherapy might benefit patients with dCCA in the low-risk group.

FIGURE 5

Figure 5 Kaplan–Meier survival analysis in different risk groups. There was no significant difference in prognosis for high-risk patients in the training cohort (A) and the validation cohort (C). Patients who received chemotherapy had a better prognosis than those who did not in the training cohort (B) and the validation cohort (D).

Discussion

dCCA, originating from the common bile duct and arising from the biliary epithelium, is a heterogeneous and exceptionally aggressive malignant tumor with poor prognosis (20), and the incidence of dCCA is increasing globally. The clinical manifestations of dCCA are frequently non-specific and are related to the biliary obstruction caused by the tumor (21). The silent and asymptomatic nature of dCCA, particularly in its early stages, in combination with its high aggressiveness, intra- and inter-tumor heterogeneity, and chemoresistance, significantly compromises the efficacy of current therapeutic options, contributing to a dismal prognosis (22). Surgery is a potential curative option of reference for early stage tumors. Surgical strategies for dCCA usually require performing a pancreaticoduodenectomy, with the removal of the head of the pancreas, the first part of the duodenum, the gallbladder, and the bile duct (23). However, only 35% of patients are eligible for surgical treatment, and there is a very high rate of postoperative local recurrence (24). The 5-year OS of patients with dCCA is 23% and is slightly higher (27%) if R0 resection is achieved (the median survival after R0 resection is 25 months) (25). Therefore, it is important to increase awareness of this cancer. In the past decade, increasing efforts have been made to understand the complexity of these tumors and to develop new diagnostic tools and therapies that might help to improve patient outcomes (26). Moreover, specific prognostic models must be established for the early diagnosis, prevention, and targeted and personalized treatment options for patients with dCCA (3).

Machine learning is the name given to both the academic discipline and the collection of techniques that allow computers to undertake complex tasks. As an academic discipline, machine learning comprises elements of mathematics, statistics, and computer science. Machine learning techniques are attracting substantial interest from medical researchers and clinicians, which could accommodate different configurations of raw data, assign context weighting, and calculate the predictive power of every combination of variables available to assess diagnostic and prognostic elements. Machine learning algorithms could handle risk profiles that are highly individualized, allowing analysis of disorders with multiple etiologies and incomplete data, as is typical in real clinical settings. Using decision trees, medical researchers could then extract the minimum data necessary to make a diagnosis or therapeutic recommendations. For instance, a feature selection algorithm reduced the number of features (from 29 to 8) necessary for diagnosing autism spectrum disorder (ASD) with 100% accuracy among 612 patients with ASD (27). In this study, the number of patients’ clinical features was also reduced from 21 to 5 after the analysis of the algorithm, which would reduce the time needed to make an accurate diagnosis and improve patient outcomes. We then established algorithms on data from the training cohort and predicted the diagnostic outcome in the validation cohort. We compared the predictive performance between machine learning modes and the traditional Cox model in the training cohort and the validation cohort and found that the DeepSurv model was good at predicting prognosis and risk stratification to guide treatment options.

There is no widely used staging system for dCCA, although it can be staged according to the TNM Classification, which has become the benchmark for classifying patients with cancer, defining prognosis, and determining the best treatment approaches (28, 29). Despite providing a clinically meaningful classification correlated with prognosis (30), the current TNM Classification has some limitations. First, it has limited discriminatory ability between T2 and T3 tumors in surgically resected dCCA (31). T2 tumors include multifocal disease or disease with a vascular invasion that probably reflects disseminated disease, and the OS in patients with these tumors does not differ from the OS in patients with metastatic disease. Second, although size has been included as a prognostic factor for dCCA in the 8th edition of the American Joint Committee on Cancer (AJCC) Cancer Staging Manual, the only cutoff size considered is 5 cm in T1 tumors. Several reports have shown that a 2-cm cutoff value might identify very early tumors with a very low likelihood of dissemination and potentially long-term survival with low recurrence rates (32). Finally, the TNM Classification misses relevant prognostic factors, such as the presence of cancer-related symptoms (abdominal pain or malaise) or the degree of liver function impairment. Notably, Chaiteerakij R et al. proposed a new staging system for dCCA based on tumor size and number, vascular encasement, lymph node and peritoneal metastasis, Eastern Cooperative Oncology Group performance status, and CA19-9 level, which has shown a better performance in predicting survival than the TNM Classification (33). After the analysis in this study, we found that 1) the predictive performance of DeepSurv was better than that of other machine learning models and CoxPH by assessing C-index, mean AUC, and IBS; 2) the identified five prognostic variables by algorithms, including AFR, tumor differentiation, T stage, LNM, and CA19-9, were better than TNM Classification variables at predicting prognosis; and 3) the predictive performance of DeepSurv was better than that of the TNM Classification.

Presently, numerous studies have found that the nutritional status of the tumor patient is one of the key factors influencing the progression of the tumor (34). Malnutrition in cancer patients is a well-recognized phenomenon, especially for patients with digestive system tumors, driven by a combination of reduced food intake, decreased physical activity, and abnormal catabolic–metabolic balance caused by tumors (35). At the same time, inflammation also plays an important role in the pathogenesis of malignant tumors, which is considered to be the seventh characteristic of tumors (36). In daily practice, serum Alb has been used as a simple and reproducible parameter to assess nutrition status, an independent predictor of survival outcome in cancer patients, such as in gastric cancer in which lower serum Alb concentration was associated with worse patient prognosis (37). Xifeng Xu et al. showed that hypoalbuminemia is an independent poor prognostic indicator in patients with non-metastatic breast cancer (38). Inflammation promotes the release of Fib, which further promotes tumor cell proliferation and metastasis by participating in extracellular matrix formation and inducing epithelial–mesenchymal transition (39, 40). Guoying Wang et al. confirmed that elevated levels of Fib predicted poor outcomes in patients with hepatocellular carcinoma (41). Juan Zhao et al. found that high levels of Fib were related to poor prognosis in patients with early stage resectable extrahepatic CCA (42). AFR, which takes both Alb and Fib into account, has been indicated as a prognostic factor for various malignancies, including non-small cell lung cancer (43), chronic lymphocytic leukemia (44), and breast cancer (45). In this study, we also found that AFR was an independent prognostic variable, and the dCCA patients with low AFR might have poor OS.

Even in clinical settings where cancer patients undergo uniform therapeutic regimens, the response and OS rates are highly variable (46). Risk stratification is essential in the evaluation and management of cancer patients. Jianzhen Lin et al. (47) demonstrated that patients with refractory biliary tract carcinomas can derive considerable benefit from receiving personalized therapy guided by molecular profiling. Consequently, it is important to formulate a simple and powerful risk stratification system to identify patients with aggressive cancer courses and assist in optimizing treatment strategies. Machine learning has helped refine risk stratification and triage patients for treatment options. Therefore, we next stratified patients in this study according to the DeepSurv model to validate the prognostic value of this risk stratification model through a retrospective analysis of patients in our hospital that reflected real-world clinical conditions. For low-risk groups, patients with dCCA would benefit from postoperative chemotherapy and have a better survival outcome.

Study limitation

This study had potential limitations. First, this was a small study with a small sample size in each cohort or group, which could provide results quickly. However, it might also not yield reliable or precise estimates. Another limitation of this small study is that many of the nuances and complexities of machine learning analyses, such as sparsity or high dimensionality, are not well represented in the data. Second, this study represented the data of our single center only, which made it difficult to determine whether a particular outcome was a true finding. Additionally, this was a retrospective study, and an inferior level of evidence, selection bias, and information bias is inevitable. Future studies, preferably with larger patient cohorts from multi-centers and prospective design, should be encouraged to further confirm our preliminary outcomes.

Conclusions

We constructed accurate prediction models for the survival of patients with dCCA using a novel machine learning platform based on medical data. In this study, the DeepSurv model was good at predicting prognosis and risk stratification to guide treatment options. AFR level might be a potential prognostic factor for dCCA. For the low-risk group in the DeepSurv model, patients might benefit from postoperative chemotherapy. Machine learning has the potential to transform the way that medicine works. We look toward a future of medical research and practice greatly enhanced by the power of machine learning.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The authors are responsible for all matters of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki and was reviewed and approved by the Ethics Committee of Beijing Chao-Yang Hospital Capital Medical University (No. 2020-D-301).

Author contributions

(I) Conception and design: DW and BP. (II) Administrative support: RL and S-CL. (III) Provision of study materials: S-CL and RL. (IV) Collection and assembly of data: J-CH, QC and S-PC. (V) Data analysis and interpretation: DW and BP. (VI) Manuscript writing: DW and BP. (VII) Final approval of manuscript: all authors. All authors made substantial contributions to the conception and design, acquisition of data, or analysis and interpretation of data; took part in drafting the article or revising it critically for important intellectual content; agreed to submit to the current journal; gave final approval of the version to be published; and agreed to be accountable for all aspects of the work.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1106029/full#supplementary-material

Supplementary Figure 1 | Flow chart of this study.

Supplementary Figure 2 | Cumulative incidence curves of the training cohort and the validation cohort. Cumulative incidence curve in the training cohort and the validation cohort. There was no statistically significant difference between the survival of the two cohorts using the log-rank test (p = 0.21).

Supplementary Figure 3 | The Kaplan–Meier analysis and log-rank test in different AFR groups. The Kaplan–Meier analysis in different AFR groups. There was a statistically significant difference between the survival of the two groups using the log-rank test (p = 0.0013).

Supplementary Figure 4 | The time-dependent ROC analysis in TNM Classification. The time-dependent ROC analysis in TNM Classification and the DeepSurv algorithm had the highest mean AUC.

Supplementary Figure 5 | The Kaplan–Meier analysis and log-rank test in different risk group according to DeepSurv model. (A) The DeepSurv risk stratification of patients in the training cohort. (B) The DeepSurv risk stratification of patients in the validation cohort.

Abbreviations

AFR, albumin-to-fibrinogen ratio; Alb, albumin; ALT, alanine transaminase; AST, aspartate transaminase; ASD, autism spectrum disorder; AUC, area under the ROC curve; CA19-9, carbohydrate antigen 19-9; CCA, cholangiocarcinoma; CEA, carcinoembryonic antigen; C-index, concordance index; CoxPH, Cox proportional hazards; DCA, decision curve analysis; dCCA, distal cholangiocarcinoma; Fib, fibrinogen; GGT, glutamyl transpeptidase; Hb, hemoglobin; IQR, interquartile range; IBS, integrated Brier score; LNM, lymph node metastasis; OS, overall survival; PD, pancreaticoduodenectomy; PLT, platelet; ROC, receiver operating characteristic; RSF, random survival forest; SD, standard deviation; SVM, support vector machine; TNM, tumor–node–metastasis; TBIL, total bilirubin; WBC, white blood cell.

References

1. Moeini A, Haber PK, Sia D. Cell of origin in biliary tract cancers and clinical implications. JHEP Rep (2021) 3(2):100226. doi: 10.1016/j.jhepr.2021.100226

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Xu W, Yu S, Xiong J, Long J, Zheng Y, Sang X. CeRNA regulatory network-based analysis to study the roles of noncoding RNAs in the pathogenesis of intrahepatic cholangiocellular carcinoma. Aging (Albany NY). (2020) 12(2):1047–86. doi: 10.18632/aging.102634

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Brindley PJ, Bachini M, Ilyas SI, Khan SA, Loukas A, Sirica AE, et al. Cholangiocarcinoma. Nat Rev Dis Primers. (2021) 7(1):65. doi: 10.1038/s41572-021-00300-2

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Clements O, Eliahoo J, Kim JU, Taylor-Robinson SD, Khan SA. Risk factors for intrahepatic and extrahepatic cholangiocarcinoma: A systematic review and meta-analysis. J Hepatol (2020) 72(1):95–103. doi: 10.1016/j.jhep.2019.09.007

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Valle JW, Kelley RK, Nervi B, Oh DY, Zhu AX. Biliary tract cancer. Lancet (2021) 397(10272):428–44. doi: 10.1016/S0140-6736(21)00153-7

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Montal R, Sia D, Montironi C, Leow WQ, Esteban-Fabró R, Pinyol R, et al. Molecular classification and therapeutic targets in extrahepatic cholangiocarcinoma. J Hepatol (2020) 73(2):315–27. doi: 10.1016/j.jhep.2020.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Sidey-Gibbons JAM, Sidey-Gibbons CJ. Machine learning in medicine: A practical introduction. BMC Med Res Methodol (2019) 19(1):64. doi: 10.1186/s12874-019-0681-4

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature (2017) 542(7639):115–8. doi: 10.1038/nature21056

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Anderson JP, Parikh JR, Shenfeld DK, Ivanov V, Marks C, Church BW, et al. Reverse engineering and evaluation of prediction models for progression to type 2 diabetes: An application of machine learning using electronic health records. J Diabetes Sci Technol (2015) 10(1):6–18. doi: 10.1177/1932296815620200

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Jardillier R, Koca D, Chatelain F, Guyon L. Optimal microRNA sequencing depth to predict cancer patient survival with random forest and cox models. Genes (Basel). (2022) 13(12):2275. doi: 10.3390/genes13122275

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Huang S, Cai N, Pacheco PP, Narrandes S, Wang Y, Xu W. Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genomics Proteomics. (2018) 15(1):41–51. doi: 10.21873/cgp.20063

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Yin M, Lin J, Liu L, Gao J, Xu W, Yu C, et al. Development of a deep learning model for malignant small bowel tumors survival: A SEER-based study. Diagnostics (Basel) (2022) 12(5):1247. doi: 10.3390/diagnostics12051247

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Wu G, Zheng R, Tian Y, Liu D. Joint ranking SVM and binary relevance with robust low-rank learning for multi-label classification. Neural Netw (2020) 122:24–39. doi: 10.1016/j.neunet.2019.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Lorenzo D, Ochoa M, Piulats JM, Gutiérrez C, Arias L, Català J, et al. Prognostic factors and decision tree for long-term survival in metastatic uveal melanoma. Cancer Res Treat (2018) 50(4):1130–9. doi: 10.4143/crt.2017.171

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Park J, Hwang IC, Yoon YE, Park JB, Park JH, Cho GY. Predicting long-term mortality in patients with acute heart failure by using machine learning. J Card Fail (2022) 28(7):1078–87. doi: 10.1016/j.cardfail.2022.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Akai H, Yasaka K, Kunimatsu A, Nojima M, Kokudo T, Kokudo N, et al. Predicting prognosis of resected hepatocellular carcinoma by radiomics analysis with random survival forest. Diagn Interv Imaging. (2018) 99(10):643–51. doi: 10.1016/j.diii.2018.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Yang CH, Chen YS, Moi SH, Chen JB, Wang L, Chuang LY. Machine learning approaches for the mortality risk assessment of patients undergoing hemodialysis. Ther Adv Chronic Dis (2022) 13:20406223221119617. doi: 10.1177/20406223221119617

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Huang C, Dai Y, Chen Q, Chen H, Lin Y, Wu J, et al. Development and validation of a deep learning model to predict survival of patients with esophageal cancer. Front Oncol (2022) 12:971190. doi: 10.3389/fonc.2022.971190

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Katzman JL, Shaham U, Cloninger A, Bates J, Jiang T, Kluger Y. DeepSurv: Personalized treatment recommender system using a cox proportional hazards deep neural network. BMC Med Res Methodol (2018) 18(1):24. doi: 10.1186/s12874-018-0482-1

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Rodrigues PM, Olaizola P, Paiva NA, Olaizola I, Agirre-Lizaso A, Landa A, et al. Pathogenesis of cholangiocarcinoma. Annu Rev Pathol (2021) 16:433–63. doi: 10.1146/annurev-pathol-030220-020455

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Banales JM, Cardinale V, Carpino G, Marzioni M, Andersen JB, Invernizzi P, et al. Expert consensus document: Cholangiocarcinoma: current knowledge and future perspectives consensus statement from the European network for the study of cholangiocarcinoma (ENS-CCA). Nat Rev Gastroenterol Hepatol (2016) 13(5):261–80. doi: 10.1038/nrgastro.2016.51

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Banales JM, Marin JJG, Lamarca A, Rodrigues PM, Khan SA, Roberts LR, et al. Cholangiocarcinoma 2020: The next horizon in mechanisms and management. Nat Rev Gastroenterol Hepatol (2020) 17(9):557–88. doi: 10.1038/s41575-020-0310-z

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Valle JW, Borbath I, Khan SA, Huguet F, Gruenberger T, Arnold D. Biliary cancer: ESMO clinical practice guidelines for diagnosis, treatment and follow-up. Ann Oncol (2016) 27(suppl 5):v28–37. doi: 10.1093/annonc/mdw324

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Rizvi S, Khan SA, Hallemeier CL, Kelley RK, Gores GJ. Cholangiocarcinoma - evolving concepts and therapeutic strategies. Nat Rev Clin Oncol (2018) 15(2):95–111. doi: 10.1038/nrclinonc.2017.157

PubMed Abstract | CrossRef Full Text | Google Scholar

25. DeOliveira ML, Cunningham SC, Cameron JL, Kamangar F, Winter JM, Lillemoe KD, et al. Cholangiocarcinoma: Thirty-one-year experience with 564 patients at a single institution. Ann Surg (2007) 245(5):755–62. doi: 10.1097/01.sla.0000251366.62632.d3

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Li H, Long J, Xie F, Kang K, Shi Y, Xu W, et al. Transcriptomic analysis and identification of prognostic biomarkers in cholangiocarcinoma. Oncol Rep (2019) 42(5):1833–42. doi: 10.3892/or.2019.7318

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Wall DP, Kosmicki J, Deluca TF, Harstad E, Fusaro VA. Use of machine learning to shorten observation-based screening and diagnosis of autism. Transl Psychiatry (2012) 2(4):e100. doi: 10.1038/tp.2012.10

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Amin MB, Greene FL, Edge SB, Compton CC, Gershenwald JE, Brookland RK, et al. The eighth edition AJCC cancer staging manual: Continuing to build a bridge from a population-based to a more "personalized" approach to cancer staging. CA Cancer J Clin (2017) 67(2):93–9. doi: 10.3322/caac.21388

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Forner A, Vidili G, Rengo M, Bujanda L, Ponz-Sarvisé M, Lamarca A. Clinical presentation, diagnosis and staging of cholangiocarcinoma. Liver Int (2019) 39 Suppl 1:98–107. doi: 10.1111/liv.14086

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Spolverato G, Bagante F, Weiss M, Alexandrescu S, Marques HP, Aldrighetti L, et al. Comparative performances of the 7th and the 8th editions of the American joint committee on cancer staging systems for intrahepatic cholangiocarcinoma. J Surg Oncol (2017) 115(6):696–703. doi: 10.1002/jso.24569

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Kang SH, Hwang S, Lee YJ, Kim KH, Ahn CS, Moon DB, et al. Prognostic comparison of the 7th and 8th editions of the American joint committee on cancer staging system for intrahepatic cholangiocarcinoma. J Hepatobiliary Pancreat Sci (2018) 25(4):240–8. doi: 10.1002/jhbp.543

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Sapisochin G, Rodríguez de Lope C, Gastaca M, Ortiz de Urbina J, Suarez MA, Santoyo J, et al. "Very early" intrahepatic cholangiocarcinoma in cirrhotic patients: Should liver transplantation be reconsidered in these patients? Am J Transplant (2014) 14(3):660–7. doi: 10.1111/ajt.12591

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Chaiteerakij R, Harmsen WS, Marrero CR, Aboelsoud MM, Ndzengue A, Kaiya J, et al. A new clinically based staging system for perihilar cholangiocarcinoma. Am J Gastroenterol (2014) 109(12):1881–90. doi: 10.1038/ajg.2014.327

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Mantzorou M, Koutelidakis A, Theocharis S, Giaginis C. Clinical value of nutritional status in cancer: What is its impact and how it affects disease progression and prognosis? Nutr Cancer (2017) 69(8):1151–76. doi: 10.1080/01635581.2017.1367947

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Arends J, Bachmann P, Baracos V, Barthelemy N, Bertz H, Bozzetti F, et al. ESPEN guidelines on nutrition in cancer patients. Clin Nutr (2017) 36(1):11–48. doi: 10.1016/j.clnu.2016.07.015

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hanahan D, Weinberg RA. Hallmarks of cancer: The next generation. Cell (2011) 144(5):646–74. doi: 10.1016/j.cell.2011.02.013

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Toiyama Y, Yasuda H, Ohi M, Yoshiyama S, Araki T, Tanaka K, et al. Clinical impact of preoperative albumin to globulin ratio in gastric cancer patients with curative intent. Am J Surg (2017) 213(1):120–6. doi: 10.1016/j.amjsurg.2016.05.012

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Liu X, Meng QH, Ye Y, Hildebrandt MA, Gu J, Wu X. Prognostic significance of pretreatment serum levels of albumin, LDH and total bilirubin in patients with non-metastatic breast cancer. Carcinogenesis (2015) 36(2):243–8. doi: 10.1093/carcin/bgu247

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Zhang F, Wang Y, Sun P, Wang ZQ, Wang DS, Zhang DS, et al. Fibrinogen promotes malignant biological tumor behavior involving epithelial-mesenchymal transition via the p-AKT/p-mTOR pathway in esophageal squamous cell carcinoma. J Cancer Res Clin Oncol (2017) 143(12):2413–24. doi: 10.1007/s00432-017-2493-4

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Shu YJ, Weng H, Bao RF, Wu XS, Ding Q, Cao Y, et al. Clinical and prognostic significance of preoperative plasma hyperfibrinogenemia in gallbladder cancer patients following surgical resection: A retrospective and in vitro study. BMC Cancer. (2014) 14:566. doi: 10.1186/1471-2407-14-566

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Dai T, Deng M, Ye L, Liu R, Lin G, Chen X, et al. Prognostic value of combined preoperative gamma-glutamyl transpeptidase to platelet ratio and fibrinogen in patients with HBV-related hepatocellular carcinoma after hepatectomy. Am J Transl Res (2020) 12(6):2984–97.

PubMed Abstract | Google Scholar

42. Li S, Zhang X, Lou C, Gu Y, Zhao J. Preoperative peripheral blood inflammatory markers especially the fibrinogen-to-lymphocyte ratio and novel FLR-n score predict the prognosis of patients with early-stage resectable extrahepatic cholangiocarcinoma. Front Oncol (2022) 12:1003845. doi: 10.3389/fonc.2022.1003845

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Chen S, Yan H, Du J, Li J, Shen B, Ying H, et al. Prognostic significance of pre-resection albumin/fibrinogen ratio in patients with non-small cell lung cancer: A propensity score matching analysis. Clin Chim Acta (2018) 482:203–8. doi: 10.1016/j.cca.2018.04.012

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Zou YX, Qiao J, Zhu HY, Lu RN, Xia Y, Cao L, et al. Albumin-to-Fibrinogen ratio as an independent prognostic parameter in untreated chronic lymphocytic leukemia: A retrospective study of 191 cases. Cancer Res Treat (2019) 51(2):664–71. doi: 10.4143/crt.2018.358

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Hwang KT, Chung JK, Roh EY, Kim J, Oh S, Kim YA, et al. Prognostic influence of preoperative fibrinogen to albumin ratio for breast cancer. J Breast Cancer. (2017) 20(3):254–63. doi: 10.4048/jbc.2017.20.3.254

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Durie BGM, Hoering A, Sexton R, Abidi MH, Epstein J, Rajkumar SV, et al. Longer term follow-up of the randomized phase III trial SWOG S0777: Bortezomib, lenalidomide and dexamethasone vs. lenalidomide and dexamethasone in patients (Pts) with previously untreated multiple myeloma without an intent for immediate autologous stem cell transplant (ASCT). Blood Cancer J (2020) 10(5):53. doi: 10.1038/s41408-020-0311-8

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Lin J, Cao Y, Yang X, Li G, Shi Y, Wang D, et al. Mutational spectrum and precision oncology for biliary tract carcinoma. Theranostics (2021) 11(10):4585–98. doi: 10.7150/thno.56539

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: distal cholangiocarcinoma, DeepSurv, AFR, machine learning, risk stratification, individualized treatment, post-operative chemotherapy

Citation: Wang D, Pan B, Huang J-C, Chen Q, Cui S-P, Lang R and Lyu S-C (2023) Development and validation of machine learning models for predicting prognosis and guiding individualized postoperative chemotherapy: A real-world study of distal cholangiocarcinoma. Front. Oncol. 13:1106029. doi: 10.3389/fonc.2023.1106029

Received: 23 November 2022; Accepted: 27 February 2023;
Published: 15 March 2023.

Edited by:

Wenqing Cao, New York University, United States

Reviewed by:

Junyu Long, Peking Union Medical College Hospital (CAMS), China
Xing Niu, China Medical University, China

Copyright © 2023 Wang, Pan, Huang, Chen, Cui, Lang and Lyu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ren Lang, ZHJfbGFuZ3JlbkAxMjYuY29t; Shao-Cheng Lyu, c2hhb2NoZW5nMDUwMkAxNjMuY29t

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.