Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 04 March 2022
Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers
This article is part of the Research Topic Identification of Novel Biomarkers for Pancreatic and Hepatocellular Cancers View all 20 articles

A Promising Preoperative Prediction Model for Microvascular Invasion in Hepatocellular Carcinoma Based on an Extreme Gradient Boosting Algorithm

Weiwei Liu&#x;Weiwei Liu1†Lifan Zhang&#x;Lifan Zhang2†Zhaodan XinZhaodan Xin1Haili ZhangHaili Zhang3Liting YouLiting You1Ling BaiLing Bai1Juan Zhou*Juan Zhou1*Binwu Ying*Binwu Ying1*
  • 1Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, China
  • 2Department of Gastroenterology and Hepatology, West China Hospital, Sichuan University, Chengdu, China
  • 3Department of Liver Surgery & Liver Transplantation Center, West China Hospital, Sichuan University, Chengdu, China

Background: The non-invasive preoperative diagnosis of microvascular invasion (MVI) in hepatocellular carcinoma (HCC) is vital for precise surgical decision-making and patient prognosis. Herein, we aimed to develop an MVI prediction model with valid performance and clinical interpretability.

Methods: A total of 2160 patients with HCC without macroscopic invasion who underwent hepatectomy for the first time in West China Hospital from January 2015 to June 2019 were retrospectively included, and randomly divided into training and a validation cohort at a ratio of 8:2. Preoperative demographic features, imaging characteristics, and laboratory indexes of the patients were collected. Five machine learning algorithms were used: logistic regression, random forest, support vector machine, extreme gradient boosting (XGBoost), and multilayer perception. Performance was evaluated using the area under the receiver operating characteristic curve (AUC). We also determined the Shapley Additive exPlanation value to explain the influence of each feature on the MVI prediction model.

Results: The top six important preoperative factors associated with MVI were the maximum image diameter, protein induced by vitamin K absence or antagonist-II, α-fetoprotein level, satellite nodules, alanine aminotransferase (AST)/aspartate aminotransferase (ALT) ratio, and AST level, according to the XGBoost model. The XGBoost model for preoperative prediction of MVI exhibited a better AUC (0.8, 95% confidence interval: 0.74–0.83) than the other prediction models. Furthermore, to facilitate use of the model in clinical settings, we developed a user-friendly online calculator for MVI risk prediction based on the XGBoost model.

Conclusions: The XGBoost model achieved outstanding performance for non-invasive preoperative prediction of MVI based on big data. Moreover, the MVI risk calculator would assist clinicians in conveniently determining the optimal therapeutic remedy and ameliorating the prognosis of patients with HCC.

1 Introduction

Hepatocellular carcinoma (HCC) is one of the most common malignancies and the third leading cause of cancer-related death worldwide (1). Surgical resection is one of the predominant treatments for early-stage HCC; however, the high incidence of postoperative recurrence and metastasis largely threatens the long-term survival of patients (2). Microvascular invasion (MVI), the embolus of cancer cells with micro-metastasis in liver vessels, is an independent prognostic factor for recurrence and metastasis in HCC (3). Recently, an increasing number of studies (4, 5) have shown that a precise surgical approach and timely postoperative adjuvant therapy for patients with HCC and MVI could reduce recurrence and improve survival.

Patients with HCC and MVI have been demonstrated to achieve better prognosis through anatomical resection than through non-anatomical resection (6). In addition, scholars have suggested that patients with HCC and MVI should be treated with wide margin resection rather than narrow margin resection, as it achieves better relapse-free survival (7). Besides, surgical resection provides better tumor control than radiofrequency ablation (RFA) treatment in patients with small HCC, especially those with a high risk of MVI (8). MVI status is crucially important for clinicians to choose the optimal therapy, while MVI can only be confirmed by postoperative histopathological examination; therefore, preoperative prediction of MVI is urgent.

Early studies have focused on blood biomarkers that can predict MVI; among these, α-fetoprotein (AFP) is considered one of the most notable biomarkers (9). However, its predictive efficacy for MVI was poor in univariate analysis (10, 11), while the combination of multiple biomarkers showed greater potential (12, 13). With the development of big-data-driven approaches, machine learning (ML) has been extensively used in various diseases, such as cardiac abnormalities (14), pulmonary diseases (15), neurological disorders (16), and oncology (1720), showing great ability in prediction model construction. ML algorithms demonstrate the advantages of robust feature selection and the ability to identify clinically important risks among patients, are dedicated to finding complex patterns in big data with high accuracy and are suitable for constructing predictive models from numerous multidimensional factors, especially non-linear complex data.

However, different ML algorithms have their own advantages and disadvantages. Recently, Deng et al. combined the neutrophil-to-lymphocyte ratio, tumor size, and AFP to establish a nomogram for predicting MVI in 513 patients with HCC, but the sensitivity and specificity of the model were only 61.64% and 71.53%, respectively (21). Lei et al. constructed a nomogram to predict MVI in 1004 patients with HCC, but the high false-positive (23.4%) and false-negative rates (26.5%) still need to be considered when it is applied in clinical decision-making (22). These models were based on logistic regression algorithms; although simple to construct, they were prone to underfitting, and the clinical application accuracy was not ideal. Therefore, more ML algorithms have been applied to predict the occurrence of MVI and achieve better performance. Chen et al. proposed an MVI prediction model by integrating blood tests based on a deep learning method with concordance indexes of 0.9341 and 0.9052 in the training and validation cohorts, respectively (23). Additionally, the inclusion of radiomic features and multi-omics data improved the model’s predictive performance of MVI. Xu et al. integrated radiomics, clinical features, and liver and renal function indicators and developed a multivariate logistic regression MVI prediction model in 495 patients with HCC, with an area under the receiver operating characteristic curve (AUC) of 0.889 (24). However, these radiomic features require sophisticated techniques and experts and are not easy to popularize. Overall, the sample size of previous studies was small, ranging from 150 to 1004 patients (24, 25), and it was demonstrated that the robustness of the model based on big data was better than that based on small data (26). Therefore, it is necessary to establish an MVI prediction model with reliable and excellent performance using conveniently available clinical indicators and big data.

The noninvasive preoperative diagnosis of MVI in HCC is vital for precise surgical decision-making and patient prognosis. In this study, we attempted to use multiple ML algorithms to develop a preoperative MVI prediction model and select an optimal one, based on the big data of patients with HCC at the West China Hospital, from multidimensional and conveniently available variables. Simultaneously, we quantified and explained the important variables related to MVI, visually exhibiting them using the Shapley Additive exPlanation (SHAP) algorithm. Remarkably, to make it more convenient in clinical situations, we created an MVI risk online calculator for clinicians to assist in precise HCC treatment visually and operationally.

2 Methods

2.1 Ethics and Statements

This study was approved by the Institutional Ethics Committee of the West China Hospital, Sichuan University [number: 2019 (203)].

2.2 Study Design and Patients

Patients with HCC who underwent surgery at the West China Hospital between January 2015 and June 2019 were retrospectively enrolled (Figure 1). The inclusion criteria were as follows: 1) patients who had undergone hepatectomy or liver transplantation for the first time and were pathologically diagnosed with HCC alone, regardless of whether they had received transcatheter arterial chemoembolization (TACE) before; and 2) the Guidelines for Diagnosis and Treatment of Primary Liver Cancer in China (2017 Edition) (27) were used as diagnostic criteria. The exclusion criteria were as follows: 1) Patients with HCC with recurrence who had previously undergone hepatectomy or RFA; 2) HCC with macroscopic invasion; 3) Patients with HCC and tumors at other sites; and 4) Patients with HCC and other tumors, such as bile duct adenocarcinoma, intrahepatic cholangiocarcinoma, and sarcoma.

FIGURE 1
www.frontiersin.org

Figure 1 Flow chart of patient selection.

MVI diagnosis has relied on the judgment of two or more experienced pathologists based on seven-point sampling (28) to ensure MVI detection ability since 2015 in our hospital; hence, we used data from 2015.

2.3 Preoperative Examination and Clinicopathologic Variables

In total, 88 indicators were extracted from Electronic Health Records (EHR) of all patients. All of the indicators were all collected within 1 month before surgery, including three categories of characteristics: 1) demographic data: age, gender, height, weight, body mass index, ethnicity, preoperative TACE (yes or no), hepatitis B, and hepatitis C; 2) imaging features: single or multiple tumors, maximum diameter of the tumors, imaging cirrhosis, tumor margin, etc.; 3) laboratory examination results: routine blood test results, biochemical test results (hepatic function indexes, kidney function indexes, glucose, etc.), routine coagulation test results (activated partial thromboplastin time [APTT], prothrombin time, etc.), hepatitis B virus (HBV)-DNA load and tumor markers (AFP, carbohydrate antigen, carcinoembryonic antigen, cancer antigen 125, protein induced by vitamin K absence or antagonist-II [PIVKA-II]). The details of all indicators are listed in Supplemental Table 1.

2.4 Statistical Analysis

Continuous variables with normal distribution are expressed as mean ± standard deviation and were compared between the MVI and non-microvascular invasion (NMVI) groups using Student’s t-test. Non-normal variables were analyzed using the Kruskal-Wallis rank sum test. Categorical variables are expressed as frequency (%), and chi-square tests or Fisher exact tests were applied to these data, as appropriate. All statistical analyses were performed using Python (version 3.7.9), and p < 0.05 indicated statistical significance.

2.5 Machine Learning Models Establishment

The patients with HCC in this study were randomly divided into a training cohort and a validation cohort at a ratio of 8:2. To overcome the imbalance in the data, an under-sampling approach was applied (29). We attempted to develop MVI predictive models using five ML algorithms: logistic regression (LR), random forest (RF), support vector machine (SVM), multilayer perception (MLP), and extreme gradient boosting (XGBoost). LR involves modeling the relationship between explanatory variables and the log odds of a binary outcome by employing the maximum likelihood algorithm (30). RF, a tree-like model, integrates multiple decision trees through major voting, reducing variance, and increasing robustness (31). SVM is a computational algorithm that separates binary labeled data based on a line to realize the maximum distance between the labeled data using hinge loss to calculate the empirical risk (32). An MLP is typically built into structured node groups with activation functions and connection weights that mimic the behavior of biological neural networks and processes distributed and parallel information (33). XGBoost is an optimized distributed gradient boosting algorithm that uses a second-order Taylor expansion to approximate the loss function, which efficiently avoids overfitting problems by adding a regularization term to the objective function, providing excellent predictions by transforming a set of weak learners into strong learners (34).

In this study, we randomly divided the data into five equal subsets. Four subsets were used to train the model and were then validated in the remaining subsets. In this process, hyperparameter adjustment was performed for the higher area under the receiver operating characteristic curve (AU-ROC), which could evaluate the prediction ability of the model. The hyperparameters were determined using a grid search, which can be tuned and scored in a loop. Changing the subset ratio to display the learning curve of the AU-ROC model helps prevent overfitting and underfitting. After adjusting the hyperparameters, the final model of the entire training set was obtained, and then the model was evaluated on the test set. The LR, RF, SVM, and MLP models were implemented in Python (version 3.7.9) using the scikit-learn package. The XGBoost model was implemented using the Python XGBoost package.

2.6 Hyperparameters Adjustment of Microvascular Invasion-Predicting Model

Hyperparameters were fully optimized since the training log-loss decreased as the number of trees increased; when the test log-loss was <0.693(-log0.5 = 0.693, the test log-loss of blind guess was 0.693; a lower log-loss means a better prediction) or only slightly larger than the training log-loss, the hyperparameters were fully adjusted. We ran 100 bootstrap iterations to determine the number of trees in the final model, as recommended in previous literature. Based on the grid search, the hyperparameters used in XGBoost were set as follows: learning rate = 0.13, minimum child weight = 1, maximum tree depth = 6, and number of rounds = 100. The hyperparameters used in the other models are presented in Supplemental Table 2.

2.7 Model Performance Evaluation

To evaluate the prediction performance of the various ML models, the AUC was measured and compared. We also used precision recall curve (PRC) to measure the number of positive examples that were correctly classified, which better reflecting the predictive performance when an imbalance between the groups exists. The confusion matrix was used to visually describe the accuracy of XGBoost in identifying the MVI and NMVI statuses, including true positive (TP), false positive (FP), true negative (TN), and false negative (FN).

Accuracy=(TP+TN)/(TP+FN+TN+FP)
Specificity=TN/(TN+FP)

2.8 Interpretation of the Model by the Shapley Additive exPlanation

It is critical to correctly interpret the prediction model. Thus, the SHAP algorithm (35), a game-theoretic approach to explain the output of any ML model, was employed to obtain accurate attribution values for each feature within the prediction model. The SHAP value can be considered a quantified contribution. We can easily determine the contribution of all features and which contribution is the most.

3 Results

3.1 Basic Characteristics

The characteristics of the 2160 patients with HCC are summarized in Table 1; 575 (27%) had MVI and 1585 (73%) had NMVI. The mean age of the patients was 53.2 years. The Han ethnic group accounted for 94.7% of the population. HBV positivity was found in 1773 (82%) cases. Hepatitis C virus positivity was observed in 23 cases (1.1%). HCC with cirrhosis was observed in 911 (42.2%) patients. We randomly divided these patients and allocated 80% of them to the training set and the remaining 20% to the test set. For all variables, the differences between the training and test sets were not significant. Details are presented in Supplemental Table 3.

TABLE 1
www.frontiersin.org

Table 1 The participant baseline characteristics data.

3.2 Clinical Characteristic Differences Between the Study Groups

The preoperative clinical characteristics of all the patients are shown in Table 2. Overall, in terms of imaging features, the MVI group had a larger maximum tumor diameter than the NMVI group (7.1 cm ± 3.7 cm versus [vs.] 4.9 cm ± 3.1 cm, p < 0.001). The occurrence frequency of satellite nodules (19.5% vs. 7.4%, p < 0.001) and intra-tumoral artery (29.1% vs. 14.4%, p < 0.001) were higher in the MVI group than in the NMVI group. Regarding laboratory examinations, the MVI group had a higher PLT count (158.9 × 109/L ± 77.9 × 109/L vs. 136.2 × 109/L ± 67.6 × 109/L, p < 0.001), aspartate aminotransferase (AST) level (55.3 IU/L ± 49.1 IU/L vs. 43.7 IU/L ± 38.3 IU/L, p < 0.001), AST/alanine aminotransferase (ALT) ratio (1.3 ± 0.9 vs. 1.1 ± 0.5, p < 0.001), γ-glutamyl transferase (GGT) level (116.7 IU/L ± 127.2 IU/L vs. 85.6 IU/L ± 122.8 IU/L, p < 0.001), lactate dehydrogenase level (215.2 IU/L ± 118.9 IU/L vs. 186.8 IU/L ± 69.5 IU/L, p < 0.001), hydroxybutyrate dehydrogenase (HBDH) level (160.6 IU/L ± 83.1 IU/L vs. 146.1 IU/L ± 54.7 IU/L, p < 0.001), AFP level >400 ng/mL (50.0% vs. 28.9%, p < 0.001), and PIVKA-II level (11905 mAU/mL ± 21680.9 mAU/mL vs. 3009.1 mAU/mL ± 9716.8 mAU/mL, p < 0.001) than the NMVI group. Moreover, the MVI group had more abnormal imaging and laboratory examination results than the NMVI group (Supplemental Table 1).

TABLE 2
www.frontiersin.org

Table 2 The clinical characteristic differences between MVI and NMVI group.

3.3 Development and Validation of the MVI-Predicting Model

3.3.1 Development of the MVI-Predicting Model

All models were parameterized with these hyperparameters, and bootstrap validation training log-loss decreased as the number of integration trees increased. The bootstrap validation testing log-loss was <0.693, which was only slightly higher than the training log-loss when the number of rounds increased. Here, we only show the training curve of the XGBoost model as an example, which indicated a good fitting (Figure 2A). When the sample size reached 200 rounds, the log-loss of the training and test sets gradually tended to be stable, indicating that the model was well turned.

FIGURE 2
www.frontiersin.org

Figure 2 Development and validation of MVI-prediction model (A) The training process of XGBoost model. Train-log-loss-mean value for the training datasets is shown in the vertical axis. The horizontal axis represents the number of times iterative cross-validation. (B) The learning curve of the score of training cohort and testing cohort. The score for training and test cohorts is shown in the vertical axis. The horizontal axis represents the number of samples trained.

The learning curve showed that as the score of the training cohort decreased, the score of the validation cohort increased with an increase in training samples. We also used the XGBoost model for demonstration (Figure 2B). This revealed that as the sample size increased, the model had not been overfitted or underfitted, indicating a robust predictive performance.

3.3.2 Model Performance

We used RF, SVM, LR, XGBoost, and MLP algorithms to construct and optimize the MVI prediction model, and found that the XGBoost model achieved the highest AUC (0.8, 95% confidence interval [CI]: 0.74–0.83) (Figure 3A), followed by the RF (0.77, 95% CI: 0.73–0.81), LR (0.73, 95% CI: 0.70–0.77), SVM (0.66, 95% CI: 0.61–0.71), and MLP models (0.65, 95% CI: 0.60–0.70). Since the positive and negative samples were highly skewed datasets in our study, we used PRC to reflect the performance of the classifier more effectively. The area under the precision recall curve (AUPRC) value of the XGBoost model was much higher (0.71, 95% CI: 0.64–0.78) than that of the other models (Figure 3B). Other algorithms showed the following AUPRC values: RF model, 0.7, 95% CI: 0.65–0.77; LR model, 0.65, 95% CI: 0.62–0.71; SVM model, 0.53, 95% CI: 0.41–0.60; and MLP model, 0.53, 95% CI: 0.47–0.61. Additionally, the confusion matrix showed that the accuracy and specificity of the XGBoost model were 73% and 84%, respectively (Figure 3C).

FIGURE 3
www.frontiersin.org

Figure 3 Performance of the predictive models. (A) The ROC curve analysis of various prediction model. (B) The PRC curve of different models. The confusion matrix of XGBoost model in the validation cohort. (C) The confusion matrix of XGBoost model. The confusion matrix was composed of the True negative in the first quadrant, the false negative samples in the second quadrant, the true positive example in the third quadrant and the false positive example in the fourth quadrant.

3.4 Model Interpretation

The feature importance matrix plot sorted the most important variables, revealing the contribution of each variable to MVI versus NMVI. The top six factors associated with MVI were the maximum image diameter, PIVKA-II level, AFP level, satellite nodules, AST/ALT ratio, and AST level (Figure 4A). To illustrate the influence of each feature on model prediction, we drafted the SHAP value summary chart, and only the top 15 variables of the model are shown (Figure 4B). The chart shows the correlation between the high or low SHAP values and the prediction model. We observed that the red dots, which represent the high values of the maximum image diameter, AFP level, satellite nodules, AST/ALT ratio, and AST level, appeared more on the side of the higher probability risk of MVI. This indicates that the SHAP values of these indicators were positively correlated with the possibility of the occurrence of MVI. The red dots representing the high values of the PIVKA-II level were covered by blue dots, indicating lower values of the PIVKA-II level. This indicated that predication was affected by the extreme values, and even if the PIVKA-II values were high, some points were still blue and tended to predict the occurrence of NMVI. Furthermore, the SHAP values of the above features are displayed, which show a clear distinction between MVI and NMVI. The cutoff values of the maximum image diameter, PIVKA-II level, AFP level, satellite nodules, AST/ALT ratio, and AST level were 5 cm, 500 mAU/mL, 200 ng/mL, one nodule, 1, and 50 U/L, respectively (Figure 4C). By integrating the SHAP value summary chart and the SHAP scatter plot, the sensitivity of PIVKA-II was found to be good, while the specificity of PIVKA-II was not. Other indicators showed good specificity and sensitivity.

FIGURE 4
www.frontiersin.org

Figure 4 Model interpretation. (A) Feature importance matrix plot derived from XGBoost model. (B) SHAP summary plot of the XGBoost model. The higher the SHAP value for each feature, the higher risk of MVI development. A dot represents each feature contribution for each patient in the model. Red indicates a high SHAP value, blue indicates a low SHAP value. (C) SHAP dependance plot of XGBoost model. The SHAP dependance plot represents the contribution of each feature that we care about to the output of the XGBoost model. If the SHAP value of the feature we care about is exceeds zero, the higher the risk of MVI will be.

3.5 Online Calculator Based on the Extreme Gradient Boosting Microvascular Invasion Prediction Model

We established a website based on the XGBoost model to predict the risk of MVI (https://260147169.github.io/MVI-calculator/MVI-calculator.html). We only needed to fill in the corresponding parameters of each indicator, including the maximum image diameter, PIVKA-II level, AFP level, satellite nodules, AST/ALT ratio, AST levels, HBV, total bile acid level, PLT count, GGT level, APTT, HBDH level, mean red blood cell volume, and white blood cell count. The online calculator automatically and promptly converted the MVI risk score (Supplemental Figure 1).

4 Discussion

In this study, we developed an XGBoost model for the preoperative prediction of MVI based on the EHR information of 2160 patients with HCC at the West China Hospital, and it exhibited the best AUC in the validation set compared with the other ML algorithms and showed good interpretability, as well as the importance of MVI-related factors. Furthermore, we built an online calculator based on this model to make prediction of MVI more practical. Overall, we provide a valuable and reliable preoperative MVI prediction model, which may be effective in optimizing surgical treatment and further improving the survival of patients with HCC.

Notably, the lack of specific and effective preoperative indicators is one of the bottlenecks in the diagnosis of MVI. In this retrospective cohort study, we developed an XGBoost model using 88 objective preoperative demographic, imaging, and laboratory indicators to predict the possibility of MVI. AUC analysis alone is often insufficient for comparing predictive models, particularly in an imbalanced dataset. Therefore, we used both the AUC and AUPRC to evaluate the performance of the five ML methods. Compared with other models, the XGBoost model exhibited a better performance with an AUC of 0.8 and an AUPRC of 0.71.

Several ML methods have been developed to predict the risk of MVI. Some methods were based on radiomics or included only a few vital tumor biomarkers to build ML algorithms to predict MVI (36). Dong et al. (37) established a radiomic algorithm to make preoperative predictions of MVI based on grayscale ultrasonograms. The radiomic signatures based on the features of the gross tumor region (GTR), peri-tumoral region, and gross peritumoral region (GPTR) showed AUCs of 0.708, 0.710, and 0.726, respectively. After incorporating some important tumor biomarkers, the AUC of the GPTR radiomic signature was 0.744, and the AUC of the GTR radiomic signature was 0.806. This might ignore useful information, especially some serum biomarkers. It is worth noting that our model was built based on three multidimensional preoperative indicators, including patient clinical characteristics, imaging examination features, and laboratory examination results, without invasive examination. Compared with the MVI prediction models using radiomics as the only predictor (25, 38), the potential significance of this model with multivariable predictors was that we could predict the possibility of MVI using routine clinical information before surgery.

Our work also has the advantages of convenient data collection, ready availability, and objectivity, which are suitable for use in the evaluation of MVI in most clinical situations. Remarkably, all the variables in our model have a short detection time, which can help clinicians quickly obtain reference diagnostic information for patients with no immediate access to pathological diagnosis. Additionally, previous studies constructed an MVI prediction model based on a small sample size (25). Studies have shown that the models with a large sample size have higher robustness than those with a small sample size (26); the sample size of our study was much bigger than that of previous studies as far as we know. Furthermore, our model showed consistent performance between the observed and predicted MVI risks by SHAP values, implying the interpretation and robustness of the model.

The importance ranking of the correlation between the top 15 conventional imaging and laboratory variables with the occurrence of MVI was identified through XGBoost model learning. Specifically, the maximum image diameter, PIVKA-II level, AFP level, satellite nodules, AST/ALT ratio, and AST level showed the top six significant contributions to the prediction of MVI, whether using the importance matrix plot or SHAP summary plot of the XGBoost model. Among them, the maximum image diameter was ranked first. In addition, satellite nodules, one of the imaging indicators, were in the top six importance rankings. Roayaie et al. defined satellite nodules as tumors ≤2 cm in size located ≤2 cm from the main tumor (39). In our SHAP scatter plot of satellite nodules, most of the scatter point values were 1. Our SHAP scatter plot of the maximum image diameter showed that the cutoff value was 5 cm. Recently, Zhang et al. (40) demonstrated that the maximum image diameter and emergence of satellite nodules aggravated the MVI of HCC. Their studies set the cutoff value of the maximum image diameter to 5 cm by referring to different guidelines. They also indicated that the presence of satellite nodules might be a risk factor for predicting the occurrence of MVI. This finding is consistent with our results.

In addition to imaging indicators, laboratory indicators, such as the PIVKA-II level, AFP level, AST/ALT ratio, and AST level, also appeared in the top six SHAP value rankings. This observation was consistent with that of a previous report in which high levels of AFP and PIVKA-II were found to be closely related to MVI (41, 42). Meanwhile, in the SHAP scatter plot, the cutoff of our PIVKA-II level was >500 mAU/L. There were some patients with NMVI even though PIVKA-II values were high; our data showed that the specificity of the PIVKA-II level was relatively poor, and we thought that the cutoff value should be improved. In previous studies, Fumitoshi et al. (43) performed a univariate analysis of 167 patients, revealing that a PIVKA-II level ≥150 mAU/mL on preoperative examination was a high risk factor for MVI. In a study by Pote et al., a PIVKA-II level >90 mAU/mL was an independent predictor of MVI (10). However, this finding is in line with the clinical expectation that a larger PIVKA-II value is more strongly correlated with HCC.

In our research, a consistent tendency was found in that the SHAP values with scattered points above 0 was almost always >200 ng/mL on the SHAP plot of the AFP level. You et al. (44) analyzed 215 patients who underwent liver resection using univariate and multivariate analyses, and showed that an AFP cutoff level of 400 ng/mL was an independent risk factor associated with MVI. The cutoff value of the AFP level in the present study was slightly smaller than that reported in previous studies.

As indicators of impaired liver function, the AST/ALT ratio and AST level were also ranked in importance. In our study, patients with a higher AST/alkaline phosphatase (ALP) ratio and AST level were more likely to develop MVI than those with a lower AST/ALP ratio and AST level, and the cutoff value of the AST/ALT ratio was almost higher than 1 on the SHAP scatter plot. The SHAP-scattered points of serum AST levels were almost greater than 50 U/L. A previous study reported that ALT is mainly present in the cytoplasm of hepatocytes; whereas, AST mainly exists in the mitochondria of hepatocytes, and an increase in its level indicates that hepatocytes have damaged organelles. Therefore, an increased AST/ALT ratio could generally be considered indicative of the deterioration of liver cell damage in patients with cirrhosis and HCC (45). Yang et al. (46). reported that the AST/ALT ratio is often >1 due to the invasion of hepatic carcinoma cells. Dong et al. revealed that an AST level >40 U/L was an independent factor for overall survival in HCC (47). Our SHAP scatter plot confirmed this. Overall, the variables selected for our prediction model were the most clinically common, readily available, and short-duration imaging and laboratory indicators, and they showed good interpretability and consistency with clinical experience, further proving the reliability of the model. This also shows the possibility that our model can be applied to countries and regions with relatively limited medical resources.

The strengths of this study are as follows. First, we used a large dataset to build an ML model for the preoperative prediction of MVI. This could contribute to improving the effective training and rational explanation of the prediction model so that the model was closer to the real situation of the prediction power. Second, we used multiple dimensional indicators to build the prediction model, thus improving its performance. All predictors have the advantages of convenient data collection, ready availability, and objectivity. Third, we used a variety of ML algorithms to select the optimal model that best fits this dataset. Finally, we transformed the model into a visual software based on the selected 15 common clinical indicators that facilitate rapid detection. Thus, the prediction model can be easily applied to countries and regions with relatively limited medical resources.

Despite these advantages, our study also has some limitations. First, this was a retrospective study, and the findings need to be validated in prospective studies. Second, our model was developed based on a single center, so its generalizability needs to be verified in multiple centers. Third, we only constructed a preoperative MVI prediction model; therefore, the clinical benefit of precise surgical choice based on the model needs to be evaluated in the future.

5 Conclusion

In conclusion, our study constructed and validated different ML algorithm models for the preoperative diagnosis of MVI by utilizing preoperative readily available, short-duration, and general noninvasive preoperative indicators. In the final model, we chose the XGBoost algorithm because it had the best performance in predicting MVI. The maximum image diameter, PIVKA-II level, AFP level, satellite nodules, AST/ALT ratio, and AST level were found to be important for predicting the occurrence of MVI. Further, development of the MVI risk-scoring web-calculator based on this model is convenient for clinical application. Meanwhile, the developed model is helpful in preoperatively predicting MVI and assists clinicians in conveniently determining the optimal therapeutic remedy and ameliorating the prognosis of patients with HCC.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Ethics Statement

The studies involving human participants were reviewed and approved by This study was approved by the Institutional Ethics Committee of the West China Hospital, Sichuan University [No. 2019 (203)]. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

Research conception: BY, JZ, and WL. Data processing: LZ and WL. Drafting of the manuscript: WL, LZ, ZX, LY, and LB. Data acquisition: HZ and JZ. Revision of the manuscript: BY and JZ. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the National Natural Science Foundation of China (NO. 81873979), the Project of Sichuan Provincial Department of Science and Technology (NO. 2020YFS0214, 2020YJ0106).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We would like to thank all the researchers who participated in this study.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.852736/full#supplementary-material

Glossary

www.frontiersin.org

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA: A Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

CrossRef Full Text | Google Scholar

2. He W, Peng B, Tang Y, Yang J, Zheng Y, Qiu J, et al. Nomogram to Predict Survival of Patients With Recurrence of Hepatocellular Carcinoma After Surgery. Clin Gastroenterol Hepatol (2018) 16(5):756–64.e10. doi: 10.1016/j.cgh.2017.12.002

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Erstad DJ, Tanabe KK. Prognostic and Therapeutic Implications of Microvascular Invasion in Hepatocellular Carcinoma. Ann Surg Oncol (2019) 26(5):1474–93. doi: 10.1245/s10434-019-07227-9

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Lu L, Wei W, Huang C, Li S, Zhong C, Wang J, et al. A New Horizon in Risk Stratification of Hepatocellular Carcinoma by Integrating Vessels That Encapsulate Tumor Clusters and Microvascular Invasion. Hepatol Int (2021) 15(3):651–62. doi: 10.1007/s12072-021-10183-w

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Chen Z, Zhang X, Feng J, Li L, Zhang F, Hu Y, et al. Actual Long-Term Survival in Hepatocellular Carcinoma Patients With Microvascular Invasion: A Multicenter Study From China. Hepatol Int (2021) 15(3):642–50. doi: 10.1007/s12072-021-10174-x

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Jiao S, Li G, Zhang D, Xu Y, Liu J, Li G. Anatomic Versus Non-Anatomic Resection for Hepatocellular Carcinoma, Do We Have an Answer? A Meta-Analysis. Int J Surg (2020) 80:243–55. doi: 10.1016/j.ijsu.2020.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Yamashita Y, Tsuijita E, Takeishi K, Fujiwara M, Kira S, Mori M, et al. Predictors for Microinvasion of Small Hepatocellular Carcinoma ≤ 2 Cm. Ann Surg Oncol (2012) 19(6):2027–34. doi: 10.1245/s10434-011-2195-0

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Lee S, Kang TW, Song KD, Lee MW, Rhim H, Lim HK, et al. Effect of Microvascular Invasion Risk on Early Recurrence of Hepatocellular Carcinoma After Surgery and Radiofrequency Ablation. Ann Surg (2021) 273(3):564–71. doi: 10.1097/sla.0000000000003268

PubMed Abstract | CrossRef Full Text | Google Scholar

9. McHugh PP, Gilbert J, Vera S, Koch A, Ranjan D, Gedaly R. Alpha-Fetoprotein and Tumour Size Are Associated With Microvascular Invasion in Explanted Livers of Patients Undergoing Transplantation With Hepatocellular Carcinoma. HPB (Oxford) (2010) 12(1):56–61. doi: 10.1111/j.1477-2574.2009.00128.x

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Poté N, Cauchy F, Albuquerque M, Voitot H, Belghiti J, Castera L, et al. Performance of PIVKA-II for Early Hepatocellular Carcinoma Diagnosis and Prediction of Microvascular Invasion. J Hepatol (2015) 62(4):848–54. doi: 10.1016/j.jhep.2014.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Zheng J, Seier K, Gonen M, Balachandran VP, Kingham TP, D’Angelica MI, et al. Utility of Serum Inflammatory Markers for Predicting Microvascular Invasion and Survival for Patients With Hepatocellular Carcinoma. Ann Surg Oncol (2017) 24(12):3706–14. doi: 10.1245/s10434-017-6060-7

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Liu J, Kuang S, Zheng Y, Liu M, Wang L. Prognostic and Predictive Significance of the Tumor Microenvironment in Hepatocellular Carcinoma. Cancer Biomarkers: Section A Dis Markers (2021) 32(1):99–100. doi: 10.3233/cbm-203003

CrossRef Full Text | Google Scholar

13. Qi L, Ma L, Wu F, Chen Y, Xing W, Jiang Z, et al. S100P as a Novel Biomarker of Microvascular Invasion and Portal Vein Tumor Thrombus in Hepatocellular Carcinoma. Hepatol Int (2021) 15(1):114–26. doi: 10.1007/s12072-020-10130-1

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Diller G-P, Kempny A, Babu-Narayan SV, Henrichs M, Brida M, Uebing A, et al. Machine Learning Algorithms Estimating Prognosis and Guiding Therapy in Adult Congenital Heart Disease: Data From a Single Tertiary Centre Including 10 019 Patients. Eur Heart J (2019) 40(13):1069–77. doi: 10.1093/eurheartj/ehy915

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Moll M, Qiao D, Regan EA, Hunninghake GM, Make BJ, Tal-Singer R, et al. Machine Learning and Prediction of All-Cause Mortality in COPD. Chest (2020) 158(3):952–64. doi: 10.1016/j.chest.2020.02.079

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Heo J, Yoon JG, Park H, Kim YD, Nam HS, Heo JH. Machine Learning–Based Model for Prediction of Outcomes in Acute Stroke. Stroke (2019) 50(5):1263–5. doi: 10.1161/STROKEAHA.118.024293

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Mobadersany P, Yousefi S, Amgad M, Gutman DA, Barnholtz-Sloan JS, Velázquez Vega JE, et al. Predicting Cancer Outcomes From Histology and Genomics Using Convolutional Networks. Proc Natl Acad Sci USA (2018) 115(13):E2970–E9. doi: 10.1073/pnas.1717139115

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Tao K, Bian Z, Zhang Q, Guo X, Yin C, Wang Y, et al. Machine Learning-Based Genome-Wide Interrogation of Somatic Copy Number Aberrations in Circulating Tumor DNA for Early Detection of Hepatocellular Carcinoma. EBioMedicine (2020) 56:102811–. doi: 10.1016/j.ebiom.2020.102811

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Ji G-W, Zhu F-P, Xu Q, Wang K, Wu M-Y, Tang W-W, et al. Machine-Learning Analysis of Contrast-Enhanced CT Radiomics Predicts Recurrence of Hepatocellular Carcinoma After Resection: A Multi-Institutional Study. EBioMedicine (2019) 50:156–65. doi: 10.1016/j.ebiom.2019.10.057

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Leung WK, Cheung KS, Li B, Law SYK, Lui TKL. Applications of Machine Learning Models in the Prediction of Gastric Cancer Risk in Patients After Helicobacter Pylori Eradication. Alimentary Pharmacol Ther (2021) 53(8):864–72. doi: 10.1111/apt.16272

CrossRef Full Text | Google Scholar

21. Deng G, Yao L, Zeng F, Xiao L, Wang Z. Nomogram for Preoperative Prediction of Microvascular Invasion Risk in Hepatocellular Carcinoma. Cancer Manage Res (2019) 11:9037–45. doi: 10.2147/CMAR.S216178

CrossRef Full Text | Google Scholar

22. Lei Z, Li J, Wu D, Xia Y, Wang Q, Si A, et al. Nomogram for Preoperative Estimation of Microvascular Invasion Risk in Hepatitis B Virus–Related Hepatocellular Carcinoma Within the Milan Criteria. JAMA Surg (2016) 151(4):356–63. doi: 10.1001/jamasurg.2015.4257

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Chen G, Wang R, Zhang C, Gui L, Xue Y, Ren X, et al. Integration of Pre-Surgical Blood Test Results Predict Microvascular Invasion Risk in Hepatocellular Carcinoma. Comput Struct Biotechnol J (2021) 19:826–34. doi: 10.1016/j.csbj.2021.01.014

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Xu X, Zhang HL, Liu QP, Sun SW, Zhang J, Zhu FP, et al. Radiomic Analysis of Contrast-Enhanced CT Predicts Microvascular Invasion and Outcome in Hepatocellular Carcinoma. J Hepatol (2019) 70(6):1133–44. doi: 10.1016/j.jhep.2019.02.023

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Feng S, Jia Y, Liao B, Huang B, Zhou Q, Li X, et al. Preoperative Prediction of Microvascular Invasion in Hepatocellular Cancer: A Radiomics Model Using Gd-EOB-DTPA-Enhanced MRI. Eur Radiol (2019) 29(9):4648–59. doi: 10.1007/s00330-018-5935-8

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Wu C, Zhou M, Liu P, Yang M. Analyzing COVID-19 Using Multisource Data: An Integrated Approach of Visualization, Spatial Regression, and Machine Learning. GeoHealth (2021) 5(8):e2021GH000439. doi: 10.1029/2021gh000439

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Zhou J, Sun HC, Wang Z, Cong WM, Wang JH, Zeng MS, et al. Guidelines for Diagnosis and Treatment of Primary Liver Cancer in China (2017 Edition). Liver Cancer (2018) 7(3):235–60. doi: 10.1159/000488035

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Sheng X, Ji Y, Ren G-P, Lu C-L, Yun J-P, Chen L-H, et al. A Standardized Pathological Proposal for Evaluating Microvascular Invasion of Hepatocellular Carcinoma: A Multicenter Study by LCPGC. Hepatol Int (2020) 14(6):1034–47. doi: 10.1007/s12072-020-10111-4

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Imbalanced-Learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning . Available at: https://jmlr.org/papers/v18/16-365.

Google Scholar

30. Richardson AM, Joshy G, D’Este CA. Understanding Statistical Principles in Linear and Logistic Regression. Med J Aust (2018) 208(8):332–4. doi: 10.5694/mja17.00222

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Janßen R, Zabel J, von Lukas U, Labrenz M. An Artificial Neural Network and Random Forest Identify Glyphosate-Impacted Brackish Communities Based on 16S Rrna Amplicon Miseq Read Counts. Marine Pollution Bull (2019) 149:110530. doi: 10.1016/j.marpolbul.2019.110530

CrossRef Full Text | Google Scholar

32. Almansour NA, Syed HF, Khayat NR, Altheeb RK, Juri RE, Alhiyafi J, et al. Neural Network and Support Vector Machine for the Prediction of Chronic Kidney Disease: A Comparative Study. Comput Biol Med (2019) 109:101–11. doi: 10.1016/j.compbiomed.2019.04.017

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Su F, Yuan P, Wang Y, Zhang C. The Superior Fault Tolerance of Artificial Neural Network Training With a Fault/Noise Injection-Based Genetic Algorithm. Protein Cell (2016) 7(10):735–48. doi: 10.1007/s13238-016-0302-5

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Bi Y, Xiang D, Ge Z, Li F, Jia C, Song J. An Interpretable Prediction Model for Identifying N(7)-Methylguanosine Sites Based on Xgboost and SHAP. Mol Ther Nucleic Acids (2020) 22:362–72. doi: 10.1016/j.omtn.2020.08.022

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Hathaway QA, Roth SM, Pinti MV, Sprando DC, Kunovac A, Durr AJ, et al. Machine-Learning to Stratify Diabetic Patients Using Novel Cardiac Biomarkers and Integrative Genomics. Cardiovasc Diabetol (2019) 18(1):78–. doi: 10.1186/s12933-019-0879-0

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Chen Y, Xia Y, Tolat PP, Long L, Jiang Z, Huang Z, et al. Comparison of Conventional Gadoxetate Disodium-Enhanced MRI Features and Radiomics Signatures With Machine Learning for Diagnosing Microvascular Invasion. AJR Am J Roentgenol (2021) 216(6):1510–20. doi: 10.2214/ajr.20.23255

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Dong Y, Zhou L, Xia W, Zhao X, Zhang Q, Jian J, et al. Preoperative Prediction of Microvascular Invasion in Hepatocellular Carcinoma: Initial Application of a Radiomic Algorithm Based on Grayscale Ultrasound Images. Front Oncol (2020) 10:353. doi: 10.3389/fonc.2020.00353

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Chen Y, Xia Y, Tolat PP, Long L, Jiang Z, Huang Z, et al. Comparison of Conventional Gadoxetate Disodium–Enhanced MRI Features and Radiomics Signatures With Machine Learning for Diagnosing Microvascular Invasion. Am J Roentgenology (2021) 216(6):1510–20. doi: 10.2214/AJR.20.23255

CrossRef Full Text | Google Scholar

39. Roayaie S, Blume IN, Thung SN, Guido M, Fiel MI, Hiotis S, et al. A System of Classifying Microvascular Invasion to Predict Outcome After Resection in Patients With Hepatocellular Carcinoma. Gastroenterology (2009) 137(3):850–5. doi: 10.1053/j.gastro.2009.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Zhang K, Tao C, Siqin T, Wu J, Rong W. Establishment, Validation and Evaluation of Predictive Model for Early Relapse After R0 Resection in Hepatocellular Carcinoma Patients With Microvascular Invasion. J Transl Med (2021) 19(1):293. doi: 10.1186/s12967-021-02940-0

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Hwang S, Song GW, Lee YJ, Kim KH, Ahn CS, Moon DB, et al. Multiplication of Tumor Volume by Two Tumor Markers Is a Post-Resection Prognostic Predictor for Solitary Hepatocellular Carcinoma. J Gastrointest Surg (2016) 20(11):1807–20. doi: 10.1007/s11605-016-3187-y

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Qi F, Zhou A, Yan L, Yuan X, Wang D, Chang R, et al. The Diagnostic Value of PIVKA-II, AFP, AFP-L3, CEA, and Their Combinations in Primary and Metastatic Hepatocellular Carcinoma. J Clin Lab Anal (2020) 34(5):e23158. doi: 10.1002/jcla.23158

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Hirokawa F, Hayashi M, Miyamoto Y, Asakuma M, Shimizu T, Komeda K, et al. Outcomes and Predictors of Microvascular Invasion of Solitary Hepatocellular Carcinoma. Hepatol Res (2014) 44(8):846–53. doi: 10.1111/hepr.12196

PubMed Abstract | CrossRef Full Text | Google Scholar

44. You Z, Chen LP, Ye H. Predictors of Microvascular Invasion in Patients With Solitary Small Hepatitis B Related Hepatocellular Carcinoma. Pak J Med Sci (2014) 30(2):331–4. doi: 10.12669/pjms.302.4652

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Åberg F, Danford CJ, Thiele M, Talbäck M, Rasmussen DN, Jiang ZG, et al. A Dynamic Aspartate-to-Alanine Aminotransferase Ratio Provides Valid Predictions of Incident Severe Liver Disease. Hepatol Commun (2021) 5(6):1021–35. doi: 10.1002/hep4.1700

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Yang J, He X, Huang B, Zhang H, He Y. Rule of Changes in Serum GGT Levels and GGT/ALT and AST/ALT Ratios in Primary Hepatic Carcinoma Patients With Different AFP Levels. Cancer biomarkers: section A Dis Markers (2018) 21(4):743–6. doi: 10.3233/cbm-170088

CrossRef Full Text | Google Scholar

47. Dong W, Yan K, Yu H, Huo L, Xian Z, Zhao Y, et al. Prognostic Nomogram for Sorafenib Benefit in Hepatitis B Virus-Related Hepatocellular Carcinoma After Partial Hepatectomy. Front Oncol (2020) 10:605057. doi: 10.3389/fonc.2020.605057

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: microvascular invasion, non-invasive predictive models, machine learning, extreme gradient boosting (XGBoost), hepatocellular carcinoma

Citation: Liu W, Zhang L, Xin Z, Zhang H, You L, Bai L, Zhou J and Ying B (2022) A Promising Preoperative Prediction Model for Microvascular Invasion in Hepatocellular Carcinoma Based on an Extreme Gradient Boosting Algorithm. Front. Oncol. 12:852736. doi: 10.3389/fonc.2022.852736

Received: 11 January 2022; Accepted: 11 February 2022;
Published: 04 March 2022.

Edited by:

Qingfeng Zhu, Johns Hopkins Medicine, United States

Reviewed by:

Andrea Laurenzi, University Hospital of Bologna Policlinico S. Orsola-Malpighi, Italy
Yang Deng, Shandong First Medical University, China

Copyright © 2022 Liu, Zhang, Xin, Zhang, You, Bai, Zhou and Ying. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Juan Zhou, zhoujuan39@qq.com; Binwu Ying, docbwy@126.com

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.