Construction of a risk prediction model for postoperative deep vein thrombosis in colorectal cancer patients based on machine learning algorithms

Liu, Xin; Shu, Xingming; Zhou, Yejiang; Jiang, Yifan

doi:10.3389/fonc.2024.1499794

ORIGINAL RESEARCH article

Front. Oncol., 27 November 2024

Sec. Gastrointestinal Cancers: Colorectal Cancer

Volume 14 - 2024 | https://doi.org/10.3389/fonc.2024.1499794

This article is part of the Research TopicApplication of Bioinformatics, Machine Learning, and Artificial Intelligence to Improve Diagnosis, Prognosis and Treatment of CancerView all 11 articles

Construction of a risk prediction model for postoperative deep vein thrombosis in colorectal cancer patients based on machine learning algorithms

Xin Liu¹

Xingming Shu¹

Yejiang Zhou²

Yifan Jiang^2*

¹Department of Clinical Medicine, Southwest Medical University, Luzhou, China
²Department of Gastrointestinal Surgery, The Affiliated Hospital of Southwest Medical University, Luzhou, Sichuan, China

Background: Colorectal cancer is a prevalent malignancy of the digestive system, with an increasing incidence. Lower extremity deep vein thrombosis (DVT) is a frequent postoperative complication, occurring in up to 40% of cases.

Objective: This research aims to develop and validate a machine learning model (ML) to predict the risk of lower limb deep vein thrombosis in patients with colorectal cancer, facilitating preventive and therapeutic measures to enhance recovery and ensure safety.

Methods: In this retrospective cohort study, we collected data from 429 colorectal cancer patients from January 2021 to January 2024. The medical records included age, blood test results, body mass index, underlying diseases, clinical staging, histological typing, surgical methods, and postoperative complications. We employed the Synthetic Minority Oversampling Technique to address imbalanced data and split the dataset into training and validation sets in a 7:3 ratio. Feature selection was performed using Random Forest (RF), XGBoost, and Least Absolute Shrinkage and Selection Operator algorithms (LASSO). We then trained six machine learning models: Logistic Regression (LR), Naive Bayes (NB), Gaussian Process (GP), Random Forest, XGBoost, and Multilayer Perceptron (MLP). The model’s performance was evaluated using metrics such as area under the Receiver Operating Characteristic curve, accuracy, sensitivity, specificity, F1 score, and confusion matrix. Additionally, SHAP and LIME were used to enhance the interpretability of the results.

Results: The study combined Random Forest, XGBoost algorithms, and LASSO regression with univariate regression analysis to identify significant predictive factors, including age, preoperative prealbumin, preoperative albumin, preoperative hemoglobin, operation time, PIKVA2, CEA, and preoperative neutrophil count. The XGBoost model outperformed other ML algorithms, achieving an AUC of 0.996, an accuracy of 0.9636, a specificity of 0.9778, and an F1 score of 0.9576. Moreover, the SHAP method identified age and preoperative prealbumin as the primary determinants influencing ML model predictions. Finally, the study employed LIME for more precise prediction and interpretation of individual predictions.

Conclusion: The machine learning algorithms effectively predicted postoperative lower limb deep vein thrombosis in colorectal cancer patients. The XGBoost model demonstrated strong potential for improving early detection and treatment in clinical settings.

1 Introduction

Colorectal cancer is among the most prevalent malignant tumors of the digestive system globally, ranking third in both incidence and mortality rates among malignant tumors (1). Currently, surgical treatment is the primary approach for colorectal cancer. However, ostoperative lower limb deep vein thrombosis has consistently been an issue that cannot be overlooked. Literature reports that the incidence of lower limb deep vein thrombosis after abdominal surgery is 15%-19%. Alarmingly, the incidence in colorectal cancer patients post-surgery is 40% (2). Additionally, since only 50% of patients with lower limb deep vein thrombosis exhibit symptoms and signs such as swelling and tenderness, many cases are overlooked postoperatively (3). Without timely diagnosis and intervention, the clot may detach and move through the veins to the lungs, leading to a life-threatening pulmonary embolism (4). However, lower limb deep vein thrombosis can be prevented in advance. Research suggests that prophylactic anticoagulant treatment can be suitably applied to bedridden patients in the perioperative phase (5, 6). Currently, the Caprini risk assessment model is the most widely used model in surgery. However, all colorectal cancer patients stratified postoperatively according to the Caprini model are considered high risk. Therefore, the Caprini model may not be a completely accurate indicator for DVT occurrence and intervention in colorectal cancer patients (7).

Additionally, most existing studies utilize traditional statistical methods rather than advanced machine learning algorithms, which often limits the models’ ability to handle nonlinear relationships and multivariable interactions, thereby affecting their predictive performance and applicability (8). The purpose of this study is to integrate these common high-risk factors using machine learning by selecting shared features through three different machine learning algorithms and constructing multiple models to identify the optimal deep vein thrombosis risk prediction model for colorectal cancer patients. This model will assist clinicians in more accurately identifying high-risk patients and providing personalized, precise guidance for the prevention and treatment of deep vein thrombosis.

2 Materials and methods

2.1 Study design

The aim of this research is to develop a machine learning-based model to predict the risk of lower limb deep vein thrombosis in postoperative colorectal cancer patients. A retrospective study was conducted, including 429 colorectal cancer patients who underwent surgical treatment. Data were extracted from the hospital’s electronic medical record system, which included demographic details, medical history, treatment information, disease severity, blood test results, and postoperative complications. The SMOTE algorithm was employed to address the issue of class imbalance. LASSO regression, Xgboost, and random forest were applied for feature selection to identify the features most associated with the risk of lower limb deep vein thrombosis. Following this, a range of ML models, such as LR, RF, GB, MLP, XGB, and KNN, were developed and optimized using the 10-fold cross-validation approach. The performance of these models was assessed through a range of metrics, including accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, Kappa score, AUC, calibration curve, clinical impact curve, and confusion matrix. To enhance the transparency and interpretability of the model, SHAP and LIME methods were used to explain the prediction results, clarifying the impact of each feature on the predictions and thereby offering useful references for clinicians. Figure 1 illustrates the overall workflow of the proposed system more clearly.

Figure 1

Figure 1. Research process.

2.2 Study data

We retrospectively selected 429 colorectal cancer patients who visited the Department of Gastrointestinal Surgery at the First Affiliated Hospital of Southwest Medical University from January 2022 to January 2024. Exclusion criteria include: patients with a history of prolonged bed rest or restricted activity; patients with a history of venous thrombosis; patients with a history of coagulation disorders; patients using drugs affecting coagulation function; patients with malignancies outside the gastrointestinal tract; and patients preoperatively diagnosed with lower extremity deep vein thrombosis. (Exclusion criteria are shown in Figure 1). As this study is retrospective, patients are exempt from providing informed consent according to the ethics review board’s policy. The ethics committee has encrypted all personal information of patients involved in this study to prevent any leaks.

2.3 Study variables

The study includes 44 variables related to demographic factors (gender, age), medical history (history of diabetes, hypertension, coronary artery disease, chronic obstructive pulmonary disease), physical characteristics (BMI), disease severity (clinical stage, histological grade, presence of cancer embolus, nerve invasion, vascular invasion), treatment information (surgical method, surgery duration, use of specific cancer treatments), laboratory values (white blood cell count, neutrophil count, lymphocyte count, monocyte count, NLR, hemoglobin, prealbumin, albumin, creatinine clearance, platelet count, prothrombin time PT, fibrinogen, thrombin time TT, D-dimer), and postoperative complications (postoperative high fever, anastomotic leak). Venous blood samples were collected within 24 hours of admission.

2.4 Diagnosis

Patients were tested within 14 days postoperatively according to the diagnostic criteria for lower limb deep vein thrombosis. Specifically, color Doppler ultrasound showed an uneven echo solid mass in the lower limb, reduced or absent color blood flow and spectral signals, non-collapse of the venous lumen after compression, and venous incompressibility (9).

2.5 Data preprocessing

The structured database initially included 44 clinical variables. First, clinical variables with more than 30% missing data (n = 2) were excluded. The missing data were handled using 10-fold crossvalidation combined with the KNN imputation method. Subsequently, to prevent bias during later model training and improve interpretability, the Variance Inflation Factor (VIF) was employed to examine multicollinearity among the chosen features, ensuring all features’ VIF values were less than 10. Additionally, we also removed variables with nearly zero variance to simplify the model and enhance its robustness. In the end, 39 clinical features of patients were chosen to construct the predictive model. The SMOTE algorithm was used to address the class imbalance issue, balancing the dataset and avoiding bias. Subsequently, patient data were randomly divided into two datasets: (1) a training dataset (70%) for feature selection and model training, and (2) a testing dataset (30%) for model performance evaluation.

2.6 Feature selection

For predicting postoperative DVT occurrence in colorectal cancer patients, features were selected using training group samples through three machine learning models: LASSO regression, random forest, and XGboost. The results showed that 29, 15, and 15 feature vectors were selected in the three models, Ultimately, we selected 8 common feature variables from the three models: age, preoperative prealbumin, preoperative albumin, preoperative hemoglobin, CEA, PIKVA2, surgery time, and preoperative white blood cell count.

2.7 Model development and evaluation

The machine learning task is to predict the probability distribution of patients developing lower extremity deep vein thrombosis based on these clinical variables. Model development involves experimenting with six machine learning algorithms: Logistic Regression (LR), Multilayer Perceptron (MLP), Extreme Gradient Boosting (XGBoost), Gaussian Process (GP), Random Forest (RF), and Naive Bayes (NB). During the training phase, we employed the 10-fold cross-validation method to train the models in order to achieve optimal predictive performance. To evaluate the predictive performance of each model, we primarily measured the Receiver Operating Characteristic (ROC) curve. In addition, we calculated sensitivity, specificity, accuracy, false positive (FP) rate, positive predictive value (PPV), negative predictive value (NPV), Brier score, F1 score, Decision Curve Analysis (DCA) curve, calibration curve, and Clinical Impact Curve (CIC) for a comprehensive assessment of the model’s performance.

2.8 Statistical analysis

All data analyses in this study were carried out using SPSS (27.0) and R language (version 4.3.3). Preliminary analysis of the dataset used descriptive statistics. Data points that followed a normal distribution were represented by mean ± standard deviation, whereas data points deviating from a normal distribution were shown as median (interquartile range). Subsequently, an independent samples t-test was employed to compare two groups of normally distributed data. In contrast, the Mann-Whitney U test was used for comparing two groups of non-normally distributed data. We resolved the sample imbalance problem by oversampling the minority classes using the SMOTE function from the DMwR2 package in R. To build the predictive model, the dataset was randomly split into a training subset comprising 70% of the total data and a testing subset making up 30% of the total data. Subsequently, various machine learning methods were executed using R, including logistic regression (glm package), Gaussian model (e1071 package), random forest (randomForest package), XGBoost (XGBoost package), feedforward neural network (nnet package), and naive Bayes model (e1071 package). Models were trained using the training subset data with these six ML algorithms. During the model training, a 10-fold cross-validation method was adopted to optimize the model parameters, aiming to prevent overfitting. Statistical significance was defined at the level of P<0.05.

2.9 Feature interpretation

We used the Shapley Additive Explanations (SHAP) algorithm and the Local Interpretable ModelAgnostic Explanations (LIME) algorithm to interpret the main feature contributions after machine learning model training. In particular, the SHAP algorithm assesses the average contribution of each feature value by computing its Shapley value within all possible combinations of features. By taking the weighted average of each feature value’s Shapley value, we can assess the impact of that feature on the overall prediction. Meanwhile, the LIME algorithm analyzes the model from a local perspective to explain the feature importance of specific predictions, providing an additional layer of interpretation and transparency. The combination of these two methods provides us with a multidimensional understanding of model interpretability.

3 Results

3.1 Characteristics of patients

This study encompassed 429 colorectal cancer patients who underwent surgical treatment. The median age of the patients was 67 years (range: 16-91), with 258 males (60.24%) and 171 females (39.76%). The original data from 429 cases includes 267 cases without lower extremity deep vein thrombosis (62.23%) and 162 cases with lower extremity deep vein thrombosis (37.77%). The baseline characteristics comparison of the two patient groups in the original data reveals that age, preoperative white blood cell count, preoperative lymphocyte count, preoperative hemoglobin, preoperative albumin, preoperative prealbumin count, preoperative glomerular filtration rate, gender, preoperative acute complete intestinal obstruction, and surgical method are all statistically significant (refer to Table 1).

Table 1

Table 1. Raw data in Three-Baseline table.

3.2 Prediction factor screening

A total of 1134 patients with colorectal cancer receiving surgical treatment were involved after data imbalance. Patients were split into a training group with 796 cases and a test group with 338 cases in a 7:3 ratio. LASSO regression, as a shrinkage estimation method, achieves variable selection and complexity adjustment by formulating an optimization objective function with a penalty term. This study utilized LASSO regression to identify features including age, surgical procedure, acute intestinal obstruction, nerve invasion, preoperative lymphocyte count, preoperative fibrinogen, preoperative prothrombin time, coronary artery disease, and diabetes (Figure 2A). Random forest builds multiple decision trees through the random selection of data subsets and features. Each feature’s importance score reflects its contribution to the model’s predictions, allowing the extraction of the most predictive features and the identification of characteristic factors. Features including age, preoperative prealbumin, preoperative albumin, preoperative hemoglobin, CA724, CEA, and CA242 were selected (Figure 2B). Xgboost improves prediction performance by constructing multiple weak learners and using an additive model approach. The importance of features is assessed by calculating gain, coverage, and frequency for each one, identifying factors like age, preoperative prealbumin, preoperative white blood cell count, preoperative hemoglobin, preoperative glomerular filtration rate, BMI, and preoperative prothrombin time (Figure 2C). By comparing the selection results of LASSO regression, Xgboost algorithm, and random forest algorithm, we identified the common subset of features selected by these three methods. These selected features were eventually used to construct the model, including age, preoperative prealbumin, preoperative albumin, preoperative hemoglobin, operation time, PIKVA2, CEA, and preoperative neutrophil count (Figure 2D).

Figure 2

Figure 2. (A) AUC curve, path diagram, and importance ranking of selected feature variables from univariate combined with LASSO regression. 1. Penalization process of variables in LASSO. 2. Evaluation of predictive performance of LASSO model in testing set. 3. Feature importance ranking in LASSO model. (B) AUC curve, OOB plot, and importance ranking of selected feature variables from random forest. 1. Evaluation of predictive performance of RF model in testing set. 2.Feature importance ranking in RF model. 3. Relationship between number of trees and OOB (Out-of-Bag) error. (C) AUC curve, feature importance ranking, and SHAP visualization for XGBOOST model evaluation. 1. Evaluation of predictive performance of XGBOOST model in testing set. 2.Feature importance ranking in XGBOOST model. 3.SHAP value visualization for XGBOOST variables. (D) Eight common feature variables selected by three predictive models.

3.3 Model performance

In the training dataset, the RF model demonstrated excellent predictive performance with an AUC of 1.00, indicating very high prediction accuracy. In comparison, the AUC values for the remaining five models are as follows: XGB’s AUC is 0.996 (95% CI [0.994, 0.999]), GP’s AUC is 0.950 (95% CI [0.935, 0.966]), MLP’s AUC is 0.938 (95% CI [0.918, 0.958]), NB’s AUC is 0.882 (95% CI [0.859, 0.905]), and LR’s AUC is 0.814 (95% CI [0.785, 0.844]) (Figure 3A). The F1 scores of these models are as follows: RF 1.0, XGB 0.976, GP 0.878, MLP 0.889, NB 0.740, LR 0.720. In the testing dataset, the AUC values for XGB, GP, MLP, NB, LR, and RF are 0.936 (95% CI [0.907, 0.966]), 0.919 (95% CI [0.890, 0.949]), 0.884 (95% CI [0.843, 0.925]), 0.826 (95% CI [0.781, 0.871]), 0.806 (95% CI [0.760, 0.853]), and 0.973 (95% CI [0.959, 0.986]), respectively (Figure 3B). The F1 scores for XGB, GP, MLP, NB, LR, and RF are respectively 0.853, 0.816, 0.825, 0.693, 0.696, and 0.881. In this research, the accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and kappa value of each model were computed and compared (Figures 3C, D). The RF model performed excellently in the training dataset. Due to concerns about potential overfitting, the XGB model was ultimately selected as the optimal model.

Figure 3

Figure 3. (A) Comparison of AUC models in the training set. (B) Comparison of AUC models in the testing set. (C) Comparison of F1 score, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and kappa value in the training set. (D) Comparison of F1 score, accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and kappa value in the testing set.

3.4 Model performance evaluation

In our study, we evaluated the predictive accuracy and calibration of the model by analyzing calibration curves for the training and test sets. The calibration curve results showed that the model in the training set had high predictive accuracy, with a Somers’ D coefficient of 0.992 and an area under the ROC curve of 0.996, indicating good discriminatory power (Figure 4A). Additionally, the regression calibration slope of the training set model is 0.9934, close to the ideal value of 1.000, and the intercept is -0.0175, demonstrating excellent calibration ability. The Brier score is 0.038, reflecting the high reliability of the model’s predictions. In contrast, the model’s discriminatory power in the test set decreased but still maintained a high level, with an area under the ROC curve of 0.936 and a Somers’ D coefficient of 0.873 (Figure 4B). Decision curves for the training set (Figure 4C) indicate that the model’s net benefit is significantly above the baseline strategy. On the test set (Figure 4D), the model likewise exhibits good net benefit, particularly in the threshold probability range of 0.1 to 0.95, where it maintains a high level of net benefit. The confusion matrix results show the performance differences of the model across different datasets. In the training set (Figure 4E), the model correctly identified 440 true negatives and 327 true positives, with 10 false positives and 19 false negatives, the true positive rate is 85.0%, and the true negative rate is 89.7%.In the test set (Figure 4F), the model correctly identified 119 true negatives and 178 true positives, misidentifying 20 false positives and 21 false negatives, with a true positive rate of 85.0% and a true negative rate of 89.7%. During the model development process, we considered applying a penalty to the confusion matrix to reduce Type II errors (false negatives). Specifically, we explored methods such as adjusting the classification threshold and using weighted loss functions to impose a higher penalty on false negatives during model training. However, after several experiments, we found that while these adjustments could reduce false negatives, they also led to an increase in false positives, which in turn affected the overall performance metrics of the model (such as AUC and accuracy). Therefore, we ultimately decided not to apply such penalties to maintain the overall balanced performance of the model. Finally, we plotted Clinical Impact Curves (CICs) to evaluate the net benefit of the model with the highest diagnostic value in terms of clinical utility and applicability. Clinical Impact Curves (Figures 4G, H) offer insights into the model’s capability to predict high-risk patients at various cost-benefit ratio thresholds. The test set’s curve indicates that when prediction score probabilities exceed 65%, the model’s predictions for postoperative colorectal cancer patients align closely with those who actually develop lower extremity deep vein thrombosis, confirming the model’s high clinical efficacy.

Figure 4

Figure 4. (A) XGBOOST model calibration curve in the training set. (B) XGBOOST model calibration curve in the testing set. (C) XGBOOST model clinical decision curve in the training set. (D) XGBOOST model clinical decision curve in the testing set. (E) XGBOOST model confusion matrix in the training set. (F) XGBOOST model confusion matrix in the testing set. (G) XGBOOST Model Clinical Impact Curve (CIC) in the training set. (H) XGBOOST model Clinical Impact Curve (CIC) in the testing set.

3.5 Model-based interpretability analysis

This study evaluated the relative importance of various factors affecting the susceptibility of colorectal cancer patients to developing lower extremity deep vein thrombosis post-surgery. Figure 5A visually represents this ranking, with each point indicating a sample and the color gradient from purple to yellow indicates the magnitude of sample feature values. The vertical axis shows the importance ranking of features alongside the correlation and distribution of feature values with SHAP values. Figure 5B illustrates the hierarchical significance of features in the XGB model. The vertical axis shows individual features ranked in descending order of importance, and the horizontal axis represents the average SHAP values. The analysis shows that age, preoperative albumin, preoperative white blood cell count, surgery duration, and preoperative hemoglobin are the top five ranked features in terms of importance, indicating their critical impact on the occurrence of DVT. To better understand the model’s decision-making process at the individual level, we performed detailed interpretability analyses using LIME on two representative samples(As illustrated in Figures 5C, D). Through model visualization, we can discern the impact of each feature on the model predictions for these specific instances.

Figure 5

Figure 5. (A) SHAP interpretability analysis. The color gradient from purple to yellow represents the magnitude of the sample feature values. The vertical axis displays the importance ranking of features, along with the correlation and distribution of feature values with SHAP values. (B) Hierarchical importance ranking of features in the XGBOOST model. (C, D) Detailed interpretability analysis of two representative samples using LIME.

4 Discussion

The migration of deep vein thrombosis from the lower extremities into the pulmonary artery through the circulatory system is a major trigger for fatal pulmonary embolism (10). The differences in disease onset and progression characteristics across various specialties result in varying incidence rates of lower extremity DVT (11). Literature reports indicate that the incidence of lower extremity deep vein thrombosis in colorectal cancer patients post-surgery is 40% (2). At present, there is a lack of effective evidence-based research on the risk factors, clinical characteristics, and targeted prevention and treatment measures for lower extremity DVT following gastrointestinal surgery. The American College of Chest Physicians Guidelines define cancer surgery as a high-risk factor for venous thromboembolism and recommend the use of intermittent pneumatic compression and certain medications (such as low molecular weight heparin, low-dose unfractionated heparin, and Xa inhibitors) to prevent the occurrence of venous thromboembolism (7). Caprini, Geneva, and Rapt scores are commonly used tools for assessing DVT, but they are limited in their applicability to colorectal cancer patients. The Caprini assessment rates all colorectal cancer patients undergoing abdominal surgery as high-risk, therefore, current risk assessment models are insufficient to identify patients truly at risk of DVT post-surgery. Many studies have examined the risk factors for postoperative DVT in colorectal cancer patients, such as open surgery, age, D-dimer, pulmonary disease, hemoglobin, and more (12, 13). Although many risk factors have been identified, the available assessment systems are still limited and unable to accurately predict the occurrence of postoperative DVT.

With the continuous advancement of surgical techniques for colorectal cancer, the differences in intraoperative factors are becoming less apparent. Therefore, we aim to develop a preoperative risk assessment tool similar to the Caprini score to facilitate early diagnosis and prevention of postoperative DVT in colorectal cancer patients.

Traditional approaches to identifying risk factors usually depend on developing risk models through univariate or multivariate regression, yet these models often ignore the interactions among variables and nonlinear relationships. In contrast, machine learning models are flexible enough to handle nonlinear and complex data structures, and can effectively address the challenges of high dimensional data and missing values. By training models on large datasets and continuously optimizing their performance, they improve prediction and classification accuracy (14–18). The SHAP algorithm utilizes the Shapley value concept from game theory, calculating the average contribution of each feature to the prediction. This approach enables us to thoroughly quantify each feature’s influence on the model’s overall predictions, thus providing a deeper understanding of the model’s workings (19). On the other hand, the LIME algorithm provides localized and transparent explanations by analyzing the feature importance of individual predictions. This local interpretability allows us to understand the reasons behind specific predictions in detail (20). The combination of these two approaches provides us a multidimensional model interpretation framework, capable of capturing global feature impacts and providing thorough insights into specific predictions.

In this study, we first used three machine learning models to construct a prediction model for DVT in patients with gastrointestinal tumors among postoperative colorectal cancer patients. Lasso, Xgboost, and Random Forest each filtered out 29, 15, and 15 feature vectors, respectively. In the end, we selected 8 common feature variables among the three models. During the feature selection process, we adopted a model-based feature selection method. This approach selects the most relevant features by evaluating each feature’s contribution to the model’s performance. Specifically, we employed algorithms such as Lasso regression, Xgboost, and Random Forest, which effectively handle high-dimensional data and identify features that most significantly impact the prediction results. Existing studies have shown that feature selection plays an important role in cancer prediction models; for example, Sun Tao employed LASSO regression combined with the Boruta algorithm for feature selection, thereby enhancing the accuracy of predicting the risk of pulmonary infection in lung cancer patients post-chemotherapy (21). The ROC curve constructed from these feature vectors indicates that the AUC values for Xgboost and Decision Tree are both greater than 0.900, and the AUC value for Lasso regression is 0.823. The findings indicate that the Lasso, Xgboost, and Decision Tree models have high clinical value in predicting postoperative DVT occurrence in colorectal cancer patients. In contrast, in the research conducted by Xiuying L et al. (22) the DVT model developed through the Caprini Risk Assessment Model exhibited an AUC value of merely 0.701, with a sensitivity of 80.6% and specificity of 56.3%. These comparative results highlight the superiority of the machine learning models in this study, providing powerful tools for accurately predicting postoperative DVT in colorectal cancer patients, indicating that machine learning technology has high potential for application in clinical research. We utilized six machine learning models to build and compare prediction models, from which we selected the optimal model. Through comparison, we found that the XGBOOST model has extremely high prediction accuracy, with an area under the ROC curve larger than 0.99. Additionally, the internally validated DCA and calibration curve confirmed the model’s consistency in net clinical benefit and prediction probability, indicating its high predictive value. Literature has shown that the Xgboost model has a higher predictive value for DVT prediction in gastrointestinal tumors, with an AUC value significantly higher than that of nomograms (23). Additionally, RuifengD et al. (24) constructed a model using the Xgboost model to predict early postoperative DVT in patients after hip surgery. In their study, the Xgboost model achieved an AUC of 0.991 ± 0.012 in the training cohort and an AUC of 0.982 in the validation cohort, with a sensitivity of 0.913 and a specificity of 0.998.The calibration and DCA curves in the validation cohort indicated good performance by the Xgboost model. Our study showed similar performance on these evaluation metrics, validating the model’s effectiveness and reliability.

Consistent with some studies (25), advanced age is a predictive factor for VTE occurrence. In our predictive model, SHAP feature importance ranking shows that advanced age is the most important predictive factor. This indicates that age plays a crucial role in predicting the risk of VTE occurrence. As age increases, reduced vascular elasticity and changes in coagulation mechanisms can increase the risk of thrombosis. Additionally, reduced activity and the presence of multiple comorbidities in the elderly also increase the likelihood of VTE occurrence.

Prealbumin is a protein synthesized in the liver, commonly used to assess nutritional status and liver function. Its levels can reflect a person’s nutritional state and inflammatory response (26, 27). Low levels of prealbumin are often associated with malnutrition, which may increase the risk of DVT (28). Malnutrition can lead to increased blood viscosity and endothelial dysfunction, thereby promoting thrombosis. Meanwhile, prealbumin levels decrease during acute inflammation or infection. The inflammatory response is a crucial mechanism in thrombosis as it can lead to endothelial damage and activation of coagulation factors (29, 30).

Studies have shown that there is a complex relationship between leukocyte activity and venous thrombosis, and the activity of inflammatory cells may play an important role in the natural history of thrombosis (31). Furthermore, research points out that when hematocrit is controlled, an increased white blood cell count (>12) is significantly correlated with the risk of thrombotic events (32). These discoveries highlight the significance of including white blood cell count as a factor in managing VTE, particularly among high-risk groups like surgical and cancer patients.

Our diagnostic tools encompass several additional features, including preoperative hemoglobin, preoperative albumin, CEA, and PIKVA2, all of which are essential preoperative laboratory checks. Additionally, we included surgery duration as a history-related feature. Some features in the tool have SHAP values that are inconsistent with clinical knowledge. However, it is important to consider that these features contribute differently to the overall model and should be viewed as a whole.

Our study has some limitations. Due to the limitations of retrospective studies, we were unable to include some highly valuable data that could be crucial and closely related to colorectal cancer. Despite extensive literature indicating that DD values might be closely linked to the occurrence of postoperative DVT (6, 33), unfortunately, due to a large number of missing values in preoperative DD, it was removed during preprocessing. We anticipate that with the advancements in genetics and bioinformatics, more predictive biomarkers will be identified and utilized, such as tumor genomic features in the Tic-ONCO model (34), among others. Additionally, due to limitations of the constraints of the data system, we could not perform extended observations on patients who were moved to rehabilitation facilities approximately 10 days after surgery. Finally, due to the lack of external validation, it is unclear whether our results are applicable to other populations, necessitating further research on more groups. In summary, these limitations hinder the clinical application of this predictive model, requiring further prospective studies with larger samples and meticulous design. As an initial exploration of this research theme, we hope this study offers some guidance for future prospective research.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by the Medical Ethics Committee of the Affiliated Hospital of Southwest Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because according to national legislation and institutional requirements, participants and their legal guardians/next of kin are not required to provide written informed consent.

Author contributions

XL: Conceptualization, Data curation, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. XS: Investigation, Project administration, Resources, Writing – original draft, Writing – review & editing. YZ: Conceptualization, Data curation, Formal analysis, Resources, Writing – original draft, Writing – review & editing. YJ: Conceptualization, Data curation, Formal analysis, Funding acquisition, Resources, Validation, Visualization, Writing – review & editing, Writing – original draft.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA A Cancer J Clin. (2021) 71:7–33. doi: 10.3322/caac.21654

PubMed Abstract | Crossref Full Text | Google Scholar

2. Bikdeli B, Caraballo C, Trujillo-Santos J, Galanaud JP, di Micco P, Rosa V, et al. Clinical presentation and short- and long-term outcomes in patients with isolated distal deep vein thrombosis vs proximal deep vein thrombosis in the RIETE registry. JAMA Cardiol. (2022) 7:857–65. doi: 10.1001/jamacardio.2022.1988

PubMed Abstract | Crossref Full Text | Google Scholar

3. Behrendt CA, Twerenbold R, Blankenberg S. The everlasting challenge to identify deep vein thrombosis in both clinical practice and research. Eur Heart J. (2022) 43:1882–3. doi: 10.1093/eurheartj/ehac164

PubMed Abstract | Crossref Full Text | Google Scholar

4. Mrinalini Tadigiri M, Imam A J, Martins R, Daruwala F. A rare case of rectal carcinoma with pulmonary artery thrombosis. Cureus. (2024) 16:e56095. doi: 10.7759/cureus.56095

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lyman GH, Bohlke K, Falanga A. Venous thromboembolism prophylaxis and treatment in patients with cancer: American society of clinical oncology clinical practice guideline update. JOP. (2015) 11:e442–4. doi: 10.1200/JOP.2015.004473

PubMed Abstract | Crossref Full Text | Google Scholar

6. Zhang W, Sun R, Hu X, Chen Z, Lai C. Caprini risk assessment model combined with D-dimer to predict the occurrence of deep vein thrombosis and guide intervention after laparoscopic radical resection of colorectal cancer. World J Surg Onc. (2023) 21:299. doi: 10.1186/s12957-023-03183-7

PubMed Abstract | Crossref Full Text | Google Scholar

7. Gould MK, Garcia DA, Wren SM, Arcelus JI, Heit JA, Samama CM. Prevention of VTE in nonorthopedic surgical patients. Chest. (2012) 141:e227S–77S. doi: 10.1378/chest.11-2297

PubMed Abstract | Crossref Full Text | Google Scholar

8. Rafique R, Islam SR, Kazi JU. Machine learning in the prediction of cancer therapy. Comput Struct Biotechnol J. (2021) 19:4003. doi: 10.1016/j.csbj.2021.07.003

PubMed Abstract | Crossref Full Text | Google Scholar

9. Needleman L, Cronan JJ, Lilly MP, Merli GJ, Adhikari S, Hertzberg BS, et al. Ultrasound for lower extremity deep venous thrombosis: multidisciplinary recommendations from the society of radiologists in ultrasound consensus conference. Circulation. (2018) 137:1505–15. doi: 10.1161/circulationaha.117.030687

PubMed Abstract | Crossref Full Text | Google Scholar

10. Barrosse-Antle ME, Patel KH, Kramer JA, Baston CM. Point-of-care ultrasound for bedside diagnosis of lower extremity DVT. CHEST. (2021) 160:1853–63. doi: 10.1016/j.chest.2021.07.010

PubMed Abstract | Crossref Full Text | Google Scholar

11. Panpikoon T, Chuntaroj S, Treesit T, Chansanti O, Bua-ngam C. Lower-extremity venous ultrasound in DVT-unlikely patients with positive D-dimer test. Acad Radiology. (2022) 29:1058–64. doi: 10.1016/j.acra.2020.06.028

PubMed Abstract | Crossref Full Text | Google Scholar

12. Wei Q, Wei ZQ, Jing CQ, Li YX, Zhou DB, Lin MB, et al. Incidence, prevention, risk factors, and prediction of venous thromboembolism in Chinese patients after colorectal cancer surgery: a prospective, multicenter cohort study. Int J Surg. (2023) 109:3003–12. doi: 10.1097/JS9.0000000000000553

PubMed Abstract | Crossref Full Text | Google Scholar

13. Moghadamyeghaneh Z, Hanna MH, Carmichael JC, Nguyen NT, Stamos MJ. A nationwide analysis of postoperative deep vein thrombosis and pulmonary embolism in colon and rectal surgery. J Gastrointestinal Surgery. (2014) 18:2169–77. doi: 10.1007/s11605-014-2647-5

PubMed Abstract | Crossref Full Text | Google Scholar

14. Chen K, Shiomi A, Kagawa H, Matsuda T, Tanaka Y, Yamamoto S, et al. Efficacy of a robotic stapler on symptomatic anastomotic leakage in robotic low anterior resection for rectal cancer. Surg Today. (2024) 54:1–10. doi: 10.1007/s00595-021-02313-6citeas

PubMed Abstract | Crossref Full Text | Google Scholar

15. Mponponsuo K, Leal J, Spackman E, Somayaji R, Gregson D, Rennert-May E. Mathematical model of the cost-effectiveness of the BioFire FilmArray Blood Culture Identification (BCID) Panel molecular rapid diagnostic test compared with conventional methods for identification of Escherichia coli bloodstream infections. J Antimicrobial Chemotherapy. (2022) 77:507–16. doi: 10.1093/jac/dkab398

PubMed Abstract | Crossref Full Text | Google Scholar

16. Johnson PM, Lin DJ, Zbontar J, Zitnick CL, Sriram A, Muckley M, et al. Deep learning reconstruction enables prospectively accelerated clinical knee MRI. Radiology. (2023) 307:e220425. doi: 10.1148/radiol.220425

PubMed Abstract | Crossref Full Text | Google Scholar

17. Aromolaran O, Aromolaran D, Isewon I, Oyelade J. Machine learning approach to gene essentiality prediction: a review. Briefings Bioinf. (2021) 22:1–10. doi: 10.1093/bib/bbab128

PubMed Abstract | Crossref Full Text | Google Scholar

18. Garriga R, Mas J, Abraha S, Harrison O, Tadros G, Matic A. Machine learning model to predict mental health crises from electronic health records. Nat Med. (2022) 28:1240–8. doi: 10.1038/s41591-022-01811-5

PubMed Abstract | Crossref Full Text | Google Scholar

19. Sun J, Sun CK, Tang YX, Liu TC, Lu CJ. Application of SHAP for explainable machine learning on age-based subgrouping mammography questionnaire data for positive mammography prediction and risk factor identification. Healthcare (Basel). (2023) 11:2000. doi: 10.3390/healthcare11142000

PubMed Abstract | Crossref Full Text | Google Scholar

20. Mateussi N, Rogers MP, Grimsley EA, Read M, Parikh R, Pietrobon R, et al. Clinical applications of machine learning. Ann Surg Open. (2024) 5:e423. doi: 10.1097/AS9.0000000000000423

PubMed Abstract | Crossref Full Text | Google Scholar

21. Sun T, Liu J, Yuan H, Li X, Yan H. Construction of a risk prediction model for lung infection after chemotherapy in lung cancer patients based on the machine learning algorithm. Front Oncol. (2024) 14:1403392. doi: 10.3389/fonc.2024.1403392

PubMed Abstract | Crossref Full Text | Google Scholar

22. Lu X, Zeng W, Zhu L, Liu L, Du F, Yang Q. Application of the Caprini risk assessment model for deep vein thrombosis among patients undergoing laparoscopic surgery for colorectal cancer. Med (Baltimore). (2021) 100:e24479. doi: 10.1097/MD.0000000000024479

PubMed Abstract | Crossref Full Text | Google Scholar

23. Zhang Y, Ma Y, Wang J, Guan Q, Yu B. Construction and validation of a clinical prediction model for deep vein thrombosis in patients with digestive system tumors based on a machine learning. Am J Cancer Res. (2024) 14:155–68. doi: 10.62347/LNDL8700

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ding R, Ding Y, Zheng D, Huang X, Dai J, Jia H, et al. Machine learning-based screening of risk factors and prediction of deep vein thrombosis and pulmonary embolism after Hip arthroplasty. Clin Appl Thromb Hemost. (2023) 29:10760296231186145. doi: 10.1177/10760296231186145

PubMed Abstract | Crossref Full Text | Google Scholar

25. Tan WJ, Chen L, Yang SJ, Zhang BY, Sun ML, Lin YB, et al. Development and validation of a prediction model for venous thrombus embolism (VTE) in patients with colorectal cancer. Technol Cancer Res Treat. (2023) 22:1–10. doi: 10.1177/15330338231186790

PubMed Abstract | Crossref Full Text | Google Scholar

26. Wang QR, Long J, Wang CC, Hu JL, Lin N, Tang SH. Case report of atypical undernutrition of hypoproteinemia type. Open Life Sci. (2023) 18:20220766. doi: 10.1515/biol-2022-0766

PubMed Abstract | Crossref Full Text | Google Scholar

27. Loftus TJ, Brown MP, Slish JH, Rosenthal MD. Serum levels of prealbumin and albumin for preoperative risk stratification. Nutr Clin Pract. (2019) 34:340–8. doi: 10.1002/ncp.10271

PubMed Abstract | Crossref Full Text | Google Scholar

28. Truong A, Hanna MH, Moghadamyeghaneh Z, Stamos MJ. Implications of preoperative hypoalbuminemia in colorectal surgery. World J Gastrointest Surg. (2016) 8:353–62. doi: 10.4240/wjgs.v8.i5.353

PubMed Abstract | Crossref Full Text | Google Scholar

29. López B, Castañón-Apilánez M, Molina-Gil J, Fernández-Gordón Sánchez S, González G, Reguera Acuña A, et al. Serum prealbumin levels on admission as a prognostic marker in stroke patients treated with mechanical thrombectomy. Cerebrovasc Dis Extra. (2022) 12:102–7. doi: 10.1159/000526354

PubMed Abstract | Crossref Full Text | Google Scholar

30. Bittar LF, da Silva LQ, Orsi FL de A, Zapponi KCS, Mazetto de B M, de Paula EV, et al. Increased inflammation and endothelial markers in patients with late severe post-thrombotic syndrome. PloS One. (2020) 15:e0227150. doi: 10.1371/journal.pone.0227150

PubMed Abstract | Crossref Full Text | Google Scholar

31. Saha P, Humphries J, Modarai B, Mattock K, Waltham M, Evans CE, et al. Leukocytes and the natural history of deep vein thrombosis: current concepts and future directions. Arterioscler Thromb Vasc Biol. (2011) 31:506–12. doi: 10.1161/ATVBAHA.110.213405

PubMed Abstract | Crossref Full Text | Google Scholar

32. Cutsem EV, Mahé I, Felip E, Agnelli G, Awada A, Cohen A, et al. Treating cancer-associated venous thromboembolism: A practical approach. Eur J Cancer. (2024) 209:1–10. doi: 10.1016/j.ejca.2024.114263

PubMed Abstract | Crossref Full Text | Google Scholar

33. Wu Y, Wang L, Yin Q, Deng L, Ma J, Tian X. Establishment and validation of a postoperative VTE prediction model in patients with colorectal cancer undergoing radical resection: CRSPOT nomogram. Clin Appl Thromb Hemost. (2023) 29:10760296231216966. doi: 10.1177/10760296231216966

PubMed Abstract | Crossref Full Text | Google Scholar

34. Muñoz Martín AJ, Ortega I, Font C, Pachón V, Castellón V, Martínez-Marín V, et al. Multivariable clinical-genetic risk model for predicting venous thromboembolic events in patients with cancer. Br J Cancer. (2018) 118:1056–61. doi: 10.1038/s41416-018-0027-8

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: colorectal cancer, venous thrombosis, machine learning, prediction model, postoperative complications

Citation: Liu X, Shu X, Zhou Y and Jiang Y (2024) Construction of a risk prediction model for postoperative deep vein thrombosis in colorectal cancer patients based on machine learning algorithms. Front. Oncol. 14:1499794. doi: 10.3389/fonc.2024.1499794

Received: 21 September 2024; Accepted: 05 November 2024;
Published: 27 November 2024.

Edited by:

Mohsin Saleet Jafri, George Mason University, United States

Reviewed by:

Eric Munger, United States Department of Veterans Affairs, United States
Soukaina Amniouel, National Center for Advancing Translational Sciences (NIH), United States

Copyright © 2024 Liu, Shu, Zhou and Jiang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yifan Jiang, anlmMDE2MDIzQDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.