Development of prognostic models for advanced multiple hepatocellular carcinoma based on Cox regression, deep learning and machine learning algorithms

Shen, Jie; Zhou, Yu; Pei, Junpeng; Yang, Dashuai; Zhao, Kailiang; Ding, Youming

doi:10.3389/fmed.2024.1452188

ORIGINAL RESEARCH article

Front. Med., 27 September 2024

Sec. Hepatobiliary Diseases

Volume 11 - 2024 | https://doi.org/10.3389/fmed.2024.1452188

This article is part of the Research TopicHepatocellular Carcinoma: From Diagnostic Approaches to Surgical and Systemic TherapiesView all 18 articles

Development of prognostic models for advanced multiple hepatocellular carcinoma based on Cox regression, deep learning and machine learning algorithms

Jie Shen¹^†

Yu Zhou¹^†

Junpeng Pei²

Dashuai Yang¹

Kailiang Zhao¹^*

Youming Ding¹^*

¹Department of Hepatobiliary Surgery, Renmin Hospital of Wuhan University, Wuhan, China
²Department of Hepatobiliary Surgery, 521 Hospital of Norinco Group, Xi’an, China

Background: Most patients with multiple hepatocellular carcinoma (MHCC) are at advanced stage once diagnosed, so that clinical treatment and decision-making are quite tricky. The AJCC-TNM system cannot accurately determine prognosis, our study aimed to identify prognostic factors for MHCC and to develop a prognostic model to quantify the risk and survival probability of patients.

Methods: Eligible patients with HCC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database, and then prognostic models were built using Cox regression, machine learning (ML), and deep learning (DL) algorithms. The model’s performance was evaluated using C-index, receiver operating characteristic curve, Brier score and decision curve analysis, respectively, and the best model was interpreted using SHapley additive explanations (SHAP) interpretability technique.

Results: A total of eight variables were included in the follow-up study, our analysis identified that the gradient boosted machine (GBM) model was the best prognostic model for advanced MHCC. In particular, the GBM model in the training cohort had a C-index of 0.73, a Brier score of 0.124, with area under the curve (AUC) values above 0.78 at the first, third, and fifth year. Importantly, the model also performed well in test cohort. The Kaplan–Meier (K-M) survival analysis demonstrated that the newly developed risk stratification system could well differentiate the prognosis of patients.

Conclusion: Of the ML models, GBM model could predict the prognosis of advanced MHCC patients most accurately.

1 Introduction

Hepatocellular carcinoma (HCC), the sixth most common cancer in the world, has an insidious onset, rapid progression and poor prognosis, making it more difficult to treat (1). Accurately assessing the prognosis of HCC patients may provide clinicians reference values to develop more effective treatment plans. AJCC-TNM and Barcelona Clinic Liver Cancer (BCLC) staging are the most commonly used staging systems for HCC, but they are unable to take into account the effects of treatment, age, and other important factors, thus, they seem to have poor accuracy (2). Recently, a large number of scholars have used nomograms to study cancer (3, 4), which were built based on multifactorial Cox regression analyses with fixed weights assigned, and the accuracy is sometimes unsatisfactory (5). Machine Learning (ML) enables computers to learn from large-scale, disparate healthcare data and then make decisions or predictions without being explicitly programmed. ML models offer considerable advantages over traditional statistical models for tasks such as diagnosis, classification and survival prediction (6, 7). DL is a branch of ML that uses a ML technique called artificial neural networks to extract patterns and make predictions from large datasets, and is particularly well suited to solving complex computational problems (8).

MHCC is classified into two types, one is intrahepatic metastasis, which is the result of intrahepatic metastasis of solitary tumor nodule, and the other is multicentric origin, which is the primary HCC (9). There are few studies on the prognosis of MHCC, making treatment more difficult (10). Previous studies have revealed that tumor size, Alpha-Fetal Protein (AFP) level, surgical treatment, microvascular invasion and hepatic functional status were important risk factors affecting patients’ recurrence or Overall Survival (OS) (11–13). As for surgery, although many studies have consistently shown that surgery is beneficial in MHCC (11, 14, 15), there are no studies that indicate whether patients with advanced MHCC can benefit from it.

The aim of this study is to construct prognostic models based on Cox regression, ML and DL algorithms using a large dataset from the SEER database, to predict the prognosis of patients with advanced MHCC, thus helping clinicians to optimize their decisions.

2 Methods

2.1 Selection of patients and study variables

Data on patients, who were diagnosed with HCC between 2000 and 2020, were obtained with SEER*Stat software (version 8.4.2). The SEER database is publicly accessible and does not require approval by the ethics institutional review board. External validation data were obtained from Renmin Hospital of Wuhan University. Variables included in the study were age, sex, race, tumor size, tumor primary site and regional lymph surgery information, months from diagnosis to treatment, AJCC-TNM stage, histological grade, radiotherapy, chemotherapy, AFP, sequence of systemic therapy and surgery, number and sequence of malignant tumors, a total of 15 variables. Patient inclusion criteria: (1) Patients diagnosed with HCC between 2000 and 2020 (histologic type International Classification of Disease for Oncology third edition = 8,170–8,175) (ICD-O3); (2) CS extension records as multiple nodules; (3) TNM stage was stage III or IV. The exclusion criteria are as follows: (1) Data missing or not clearly recorded, grouping disputed data; (2) Survival time is not recorded or less than 1 month; (3) A patient has two or more medical records, the last one shall prevail. The detailed selection process was shown in Figure 1.

Figure 1

Figure 1. Flow chart of patients’ selection in the training and test cohorts from the SEER database.

2.2 Variable selection and construction of prognostic models

We randomly divided 1707 advanced MHCC patients into a training cohort and test cohort in a 7:3 ratio. Univariate and multivariate Cox were successively used to screen variables with prognostic significance, that is, Variables with hazard ratio (HR) more or less than 1 and statistically significant were retained. Use R software (version 4.2.1), open-source Python library scikit-survival (version 0.21.0) and PyTorch (Python version 3.11.4) to build prediction models (16).

2.3 Evaluation and selection of the best prediction model

Calculating C-index and Brier score to assess the accuracy of model prediction, receiver operating characteristic (ROC) curves and decision curve analysis (DCA) curves for the 1st, 3rd, and 5th year were then continued to be plotted to compare the accuracy of the models and potential clinical benefit (17). We determined the best cut off value for risk grouping by X-tile software, then K-M curves were used to compare the differences in OS of advanced MHCC patients in different risk stratification groups.

2.4 Interpretation of GBM model

The explanation of the model was divided into two parts: SHAP plot and the prediction website based on JAVA. SHAP is a model interpretation package developed in Python, for each prediction sample, the SHAP value is assigned to each feature. The larger the absolute value of SHAP, the greater the influence of the feature, and the sign of the value indicates whether the feature has a positive or negative effect on the result (18, 19). In order to better present the results and make it easier for the reader to use the model, an interactive website was established. By entering the required clinical information, 1-, 3-, and 5-year survival probability and risk score can be automatically calculated.

2.5 Statistical analysis

All statistical analyses were performed by R software (version 4.2.1.) and Python (version 3.11.4.) The “survival” package and “survminer” package were used for univariate and multivariate Cox regression analysis, forest mapping. Hazard ratio (HR) > 1 indicates that the factor is a risk factor, while HR < 1 indicates it is a protective factor. The “rms” package was used to draw the nomogram. Survival distributions were compared using the log-rank test. All tests were two-sided and p values less than 0.05 were considered statistically significant.

3 Results

3.1 Baseline characteristics in the training and test cohorts

A total of 1707 advanced MHCC patients were enrolled in our study, including 1,195 (70%) in the training cohort and 512 (30%) in the internal test cohort, the information for the external cohort can be obtained from Supplement Sheet 1. Most of these patients only had HCC, and their histological grading was in grades I to III, with a predominance of grade II. The vast majority of patients were at stage IIIA in the AJCC-TNM staging, i.e., there were multiple lesions in liver and any one of the lesions was more than 5 cm in size without lymph node or major vascular invasion. Because of this, the vast majority of tumor size was greater than 5 cm, but dimensions greater than 10 cm were rare. AFP is often considered as a marker for HCC, although the sensitivity and specificity are not satisfactory. In this study, AFP was abnormal in more than 70% of patients with advanced MHCC. In terms of treatment, not many patients were treated immediately after being diagnosed, they were more likely to choose to receive treatment after one to 2 months, and, of course, more than 10% of patients still went for treatment in the fourth month or later. There is a gap in research regarding surgery in advanced MHCC. Nearly 70% of the patients in this study did not undergo surgical treatment, still more than 20% underwent partial hepatectomy, in addition, almost all patients did not undergo lymph node dissection. Table 1 detailed the baseline information of the patients with advanced MHCC in this study.

Table 1

Table 1. Demographic and clinical characteristics of patients with advanced MHCC.

3.2 Screening for statistically significant prognostic factors

A total of 15 variables were included in the study, after univariate Cox analysis, as shown in Supplementary Table S1, four variables: age, race, number of malignant tumors and radiotherapy were excluded. A multivariate analysis was conducted immediately afterward, the results showed that TNM stage, histological grade, months from diagnosis to treatment, primary site surgery, tumor size, regional lymph surgery, AFP and sequence of malignant tumors were independent prognostic factors for patients with advanced MHCC, therefore, a total of 8 prognostic factors with statistical significance (Figure 2A).

Figure 2

Figure 2. Demonstration of multivariate Cox regression analysis and analysis of patients in different months from diagnosis to treatment. (A) Forest plot based on multivariate Cox regression analysis. (B) Bar plot of important features of advanced MHCC patients in different months from diagnosis to treatment. The vertical coordinate is the percentage of the feature subgroup in the group.

As shown in the figure, it is clear that patients with AJCC-TNM staging at stage IV had a higher risk of death than those at stage IIIA, and stage IIIB may be a false positive because the proportion of patients was too small. The results regarding histologic grade were consistent with popular knowledge that the lower the degree of differentiation, the correspondingly lower the OS of the patient. Surprisingly, “time interval from diagnosis to treatment” was not the factor that patients who were treated immediately had a better prognosis, patients who were treated immediately after diagnosis or who received treatment a month later had a significantly higher risk of death than those who were delayed for 4 months or more. We tried to analyze whether it was influenced by other factors, selecting some of the important ones. Figure 2B showed that “zero” or “one” group had a significantly lower proportion of patients in stage IIIA than the “4 or more” group, and a significantly higher proportion in stage IV (Figure 2B). The same trend was observed in the factor histological grade, so they may have influenced the significance of the factor “months from diagnosis to treatment” on prognosis. As for surgery for tumor lesions, liver transplantation (LT) remained the best treatment modality, greatly reducing the risk of death, and failure to undergo surgery appeared to be the highest risk. In this study, we did not find significant variability between subgroups of tumor size and subgroups of regional lymph surgery. The risk of death was significantly higher in the AFP-positive group than in the negative group, and surprisingly, the risk of death in MHCC patients who recurred other primary tumors was instead lower than that of MHCC only, which we discussed in the Discussion section.

3.3 Evaluation and comparison of prognostic models

Based on the training cohort, we first constructed a nomogram model using R software (Figure 3A), which is a visualization of multivariate Cox regression analysis with the same performance as Cox proportional hazards (CPH) model (20). Nomogram is convenient to use, but it is not hard to notice from Supplementary Table S2 that although its Brier score is not high, its C-index is 0.71, which is unsatisfactory. So based on ML and DL algorithms, we constructed CPH, survival tree, random survival forest (RSF), GBM and DeepSurv model, a total of five models, and optimized the parameters of models with five-fold cross-validation (Supplementary Table S3; Supplement Sheet 1). We first calculated their C-index, Brier score to evaluate the models as shown in Supplementary Table S2. Obviously, the GBM model performed the best with a high C-index of 0.73 and a low Brier score of 0.111. We then plotted the ROC curves for the 1st, 3rd, and 5th year of the five models (Figures 3B–D), and we can note that the GBM model always had the highest area under the curve (AUC) values, followed by the DeepSurv model. Interestingly, the AUC values gradually increased with time, suggesting that the GBM model is more accurate in predicting long-term prognosis. DCA curves showed (Figures 3E–G) that using our models to guide treatment can bring benefits to patients, with the GBM and DeepSurv models leading to more benefits for patients with advanced MHCC. In summary, it is not difficult to conclude that the GBM model outperformed the other models, so we selected the GBM model for subsequent evaluation and research.

Figure 3

Figure 3. Nomogram of patients with advanced MHCC and evaluation of the performance of the five models. (A) Nomogram of patients with advanced MHCC. (B–D) ROC curves for prognostic models predicting 1-, 3-, and 5-year OS in the training cohort. (E–G) DCA curves of prognostic models for 1-year, 3-year, and 5-year OS prediction in the training cohort.

3.4 Validation of GBM performance and development of a risk stratification system

We performed internal and external tests. Internal and external test cohorts consisted of 512 and 41 patients, respectively. The mean AUC values of GBM model over the period from the 1st to the 72nd month was 0.772 and increased over time (Figure 4A). In external cohort, the average AUC value was 0.771, which is surprisingly high in the first year (Figure 4B) its C-index was 0.702 and Brier score was 0.129 in internal test cohort, with C-index of 0.691 and Brier score of 0.136 in external test cohort (Supplementary Table S4). Calibration curves revealed that the model’s predictions were highly consistent with the actual situation (Figures 4C–E). Therefore, the GBM model still performed well in the test cohort. Figure 4F showed the poor ability of TNM stage to differentiate patients’ prognosis (Figure 4F), to assess the model’s ability to differentiate patients’ OS, we developed a risk stratification system based on the total risk score of each patient in the training cohort and determined the optimal cut off value using X-tile software (Figure 4G). Patient risk scores were determined from the GBM model’s predictions and they ranged from between −1.7 and 2.1, with lower than −0.1 being low risk, higher than 1.0 being high risk, and in between being intermediate risk. Following this, we plotted the K-M survival curves for the three risk subgroups (Figure 4H), which showed significant differences in prognosis among the different subgroups, with the high-risk group having the worst prognosis and the low-risk group having a better prognosis. The prognosis of external test cohort was similarly well differentiated (Figure 4I).

Figure 4

Figure 4. Validation of the GBM model and development of new risk stratification system. (A, B) Time-dependent AUC for the GBM model in internal test cohort (A) and external test cohort (B). (C-E) Calibration curves of first (C), third (D) and fifth (E) year in the internal test cohort. (F) Survival curves based on AJCC-TNM stage. (G) Cut off values for optimal grouping determined using X-tile. (H) K-M survival curves based on new risk stratification system. (I) K-M survival curves of external test cohort based on new risk stratification system (Only one of these patients was high risk and was merged into the intermediate risk group).

3.5 Interpretation of GBM model and feature importance

Features with higher mean Shapley values are more important for prognosis, and in the SHAP plot (Figure 5A), the features were listed in descending order of importance. Among them, whether the tumor primary site was operated on was the most important. In addition, a positive SHAP value increases the probability of death, i.e., the higher the value, the higher the risk of death, and vice versa. The results suggested that histological grade of grade III and TNM stage of stage IV increased the probability of death. As for “tumor primary site surgery,” no surgery generally increased the probability of death, but it is not difficult to find that in a considerable number of cases, no surgery would increase the probability of survival. Three patients from the training cohort were selected for the prognostic demonstration (Figures 5B–D). The first patient underwent partial hepatectomy, which increased the probability of death, while the next two patients had the opposite effect, with an increase in the probability of death due to no surgical intervention. Therefore, many patients with advanced MHCC may have lost the opportunity for surgery at the time of diagnosis, and it is necessary to strictly grasp the indications for surgery in order to make the patients benefit from surgery. To facilitate the use of our prognostic model by clinicians, we built a website,¹ which allows users to directly input their own data for prediction of OS and risk score. Controlling for the same other features and then inputting a different treatment to determine if the prediction improves or decreases, by which they can also preliminarily assess whether a treatment is beneficial.

Figure 5

Figure 5. The SHAP plot of the GBM model. (A) SHAP beeswarm summary plot on the impact of input variables on the GBM model’s prediction. (B) The local SHAP plot of patient #1. Patient #1: 74-year-old male, survival time was 96 months, alive. AJCC TNM stage was IIIA, Histological grade was II, tumor size = 6.0 cm, AFP was positive. She was treated 2 months after diagnosis, underwent partial hepatectomy and regional lymph surgery, only had HCC in his life. (C) The local SHAP plot of patient #2. Patient #2: 42-year-old male, survival time was 2 months, died. AJCC TNM stage was IV, Histological grade was III, tumor size = 13.0 cm, AFP was negative. She was treated 2 months after diagnosis, no tumor site and regional lymph surgery, only had HCC in his life. (D) The local SHAP plot of patient #3. Patient #3: 82-year-old male, survival time was 7 months, died. AJCC TNM stage was IIIA, Histological grade was II, tumor size = 5.9 cm, AFP was negative. She was treated 1 month after diagnosis, no tumor site and regional lymph surgery. Only had HCC in his life. The red ribbons in the local SHAP plot represent risk factors that lead to a poor prognosis, whereas the blue ribbons are the relatively protective factors.

4 Discussion

The morbidity and mortality rates of HCC are increasing annually, and the treatment of MHCC is more complicated than that of solitary HCC, and once it reaches an advanced stage, the prognosis of the patient is quite dismal (21). Importantly, clinicians need to balance commonly used treatments at this stage, and there is a lack of effective predictive models to the extent that some patients are not treated rationally enough (22). Our research is an attempt to build predictive models for advanced MHCC patients using well-established Cox regression, ML and DL algorithms.

Our results indicated that the GBM model had the best prediction accuracy with a C-index of 0.730 and a Brier score of 0.111, and the AUCs for the 1st year, 3rd year, and 5th year were higher than 0.78 with an increasing trend. In addition, the GBM model still performed well in the test cohort, which demonstrated that our model is quite reliable in terms of prediction accuracy. The DCA curves indicated that the use of our GBM model maximized the survival benefit for patients with advanced MHCC. DeepSurv model uses a DL neural network to integrate Cox proportional hazards, which performed slightly weaker with the GBM model in this study.

According to Cox regression analysis, our model included 8 variables, which were shown in Table 1. The higher the histological grade, the worse the differentiation, the later the TNM stage, and the worse the OS of HCC patients, which has been recognized by the public. AFP is currently the most commonly used tumor marker for HCC, and according to the Asian HCC guidelines, the serum biomarker AFP is recommended as one of the monitoring and diagnostic tools for HCC (23, 24), however, many non-cancer sources involving liver and other organs may also lead to elevated AFP and thus have lower sensitivity and specificity (25). Limited literatures addressed the clinical significance of regional lymph node dissection during surgery in patients with HCC, a study by Yang et al. based on the SEER database reported that regional lymph node dissection was not an independent prognostic factor for OS (26). Another report showed a significantly higher incidence of postoperative ascites and a significantly lower overall tumor recurrence rate for liver surgery combined with regional lymph node dissection versus no lymph node dissection, although there was no difference in OS rates (27). The clinical significance of regional lymph node dissection in advanced MHCC remains to be studied. As for surgery for primary tumor sites, multivariate Cox results demonstrated that liver surgery improved OS for patients with advanced MHCC in general, and liver transplantation in particular. However, the subsequent SHAP figure indicated that a considerable number of patients with advanced MHCC were not suitable for surgical treatment, and no surgery was a kind of protection. Although a number of studies on MHCC have shown that hepatectomy (28, 29), LT, and even combined ablation therapy were effective treatment strategies for MHCC (30), Bartolini et al. (11) reported that surgery should be subject to strict indications in order to benefit specific patients, especially for patients with advanced MHCC. Our model may help clinicians make decisions, but further test is needed. For solitary HCC, tumor size often affects treatment and prognosis (31), but in our study, for advanced MHCC, there did not appear to be a significant difference in risk of death between different tumor sizes.

Interestingly, sequence of malignant tumors and interval from diagnosis to treatment showed results that seemed to differ from popular perception. Patients with HCC alone had worse OS than those who developed other primary tumors after HCC, and we found that other researchers have reported similar results (32, 33). They noted that patients with only one cancer may die prematurely due to poor health or a higher degree of malignancy of the tumor, with no chance of getting other tumors, and that re-emergence of other tumors occurs only in patients who have been survival for a long time. Secondly, patients with HCC only may have defective immune surveillance, leading to “immune escape,” while reoccurrence of other tumors may activate cancer-related immune mechanisms. Finally, patients who re-emerge with other tumors will inevitably receive additional anti-tumor treatments, and these subsequent treatments may act as concurrent anti-HCC therapies. There is no uniformity in the literature regarding the impact of the time interval between diagnosis and treatment on prognosis. One study reported that time delay from diagnosis to treatment did not significantly affect OS in HCC patients (34), but Tsai et al. reported that the longer the time interval between diagnosis and treatment of early liver cancer, the lower the OS was (35). Therefore, randomized controlled trials may be needed to clarify the clinical significance of this factor.

Our research is advanced. We first applied multiple algorithms to construct prognostic models for patients with advanced MHCC, which were evaluated by multiple methods, and ultimately the more superior GBM model was selected among five models. To our knowledge, this is the first model for advanced MHCC. Visualization and application promotion of ML models are difficult problems, we used SHAP technique for model interpretation and built a prediction website to solve this problem well. Despite a substantial amount of published research indicating that AI-based systems demonstrate significant advantages in improving the accuracy and efficiency of HCC screening, diagnosis, and tumor characterization, there is still a need for rigorous multicenter prospective validation studies and the validation of standardized multimodal datasets (36, 37). Secondly, the SEER database only covers cancer data in the U.S. Our study would be more convincing if more data were obtained. Due to the limitations of the SEER database, some variables that may be important, such as BCLC stage, genetic factors and targeted therapies, are not available. Having access to these variables may improve the performance of the model.

5 Conclusion

A total of eight variables were independent prognostic factors, which were included in the model to predict the prognosis of patients with advanced MHCC, and the GBM model could provide a more accurate prediction of patients’ OS.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found at: the Surveillance, Epidemiology, and End Results (SEER) database (https://seer.cancer.gov/mortality/).

Ethics statement

The studies involving humans were approved by the Clinical Research Ethics Committee, Renmin Hospital of Wuhan University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

JS: Writing – review & editing, Writing – original draft, Visualization, Validation, Software, Methodology, Investigation, Data curation, Conceptualization. YZ: Writing – original draft, Resources, Methodology, Formal analysis. JP: Writing – review & editing, Resources, Data curation. DY: Writing – review & editing, Validation, Methodology. KZ: Writing – review & editing, Project administration, Funding acquisition. YD: Writing – review & editing, Project administration, Funding acquisition.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was funded by the national key research and development program of China (2022YFC2407304), Natural Science Foundation of Hubei Province (2022CFB122) and National Natural Science Foundation of China (82370654).

Acknowledgments

Thanks to Yang Hao for his contributions in building online websites using JAVA.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2024.1452188/full#supplementary-material

Footnotes

1. ^http://39.101.130.191:8888/mhcc

References

1. Qi, W, Dai, J, Qiu, Z, Wu, Y, Wen, T, Xie, F, et al. Nomogram to predict liver surgery-specific complications for hepatocellular carcinoma: a multicenter study. Eur J Surg Oncol. (2023) 49:107119. doi: 10.1016/j.ejso.2023.107119

PubMed Abstract | Crossref Full Text | Google Scholar

2. Torimura, T, and Iwamoto, H. Treatment and the prognosis of hepatocellular carcinoma in Asia. Liver Int. (2022) 42:2042–54. doi: 10.1111/liv.15130

Crossref Full Text | Google Scholar

3. Wang, Y, Li, J, Xia, Y, Gong, R, Wang, K, Yan, Z, et al. Prognostic nomogram for intrahepatic cholangiocarcinoma after partial hepatectomy. J Clin Oncol Off J Am Soc Clin Oncol. (2013) 31:1188–95. doi: 10.1200/jco.2012.41.5984

Crossref Full Text | Google Scholar

4. Hyder, O, Marques, H, Pulitano, C, Marsh, JW, Alexandrescu, S, Bauer, TW, et al. A nomogram to predict long-term survival after resection for intrahepatic cholangiocarcinoma: an eastern and Western experience. JAMA Surg. (2014) 149:432–8. doi: 10.1001/jamasurg.2013.5168

PubMed Abstract | Crossref Full Text | Google Scholar

5. Ji, GW, Jiao, CY, Xu, ZG, Li, XC, Wang, K, and Wang, XH. Development and validation of a gradient boosting machine to predict prognosis after liver resection for intrahepatic cholangiocarcinoma. BMC Cancer. (2022) 22:258. doi: 10.1186/s12885-022-09352-3

PubMed Abstract | Crossref Full Text | Google Scholar

6. Ngiam, KY, and Khor, IW. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. (2019) 20:e262–73. doi: 10.1016/s1470-2045(19)30149-4

Crossref Full Text | Google Scholar

7. Addissouky, TA, Sayed, IETE, Ali, MMA, Wang, Y, Baz, AE, Khalil, AA, et al. Latest advances in hepatocellular carcinoma management and prevention through advanced technologies. Egypt Liver J. (2024) 14:2. doi: 10.1186/s43066-023-00306-3

Crossref Full Text | Google Scholar

8. Tran, KA, Kondrashova, O, Bradley, A, Williams, ED, Pearson, JV, and Waddell, N. Deep learning in cancer diagnosis, prognosis and treatment selection. Genome Med. (2021) 13:152. doi: 10.1186/s13073-021-00968-x

PubMed Abstract | Crossref Full Text | Google Scholar

9. Yamamoto, S, Midorikawa, Y, Nagae, G, Tatsuno, K, Ueda, H, Moriyama, M, et al. Spatial and temporal expansion of intrahepatic metastasis by molecularly-defined clonality in multiple liver cancers. Cancer Sci. (2020) 111:601–9. doi: 10.1111/cas.14282

PubMed Abstract | Crossref Full Text | Google Scholar

10. Li, C, Liu, JY, Peng, W, Wen, TF, Yan, LN, Yang, JY, et al. Liver resection versus transplantation for multiple hepatocellular carcinoma: a propensity score analysis. Oncotarget. (2017) 8:81492–500. doi: 10.18632/oncotarget.20623

PubMed Abstract | Crossref Full Text | Google Scholar

11. Bartolini, I, Nelli, T, Russolillo, N, Cucchetti, A, Pesi, B, Moraldi, L, et al. Multiple hepatocellular carcinoma: long-term outcomes following resection beyond actual guidelines. An Italian multicentric retrospective study. Am J Surg. (2021) 222:599–605. doi: 10.1016/j.amjsurg.2021.01.023

PubMed Abstract | Crossref Full Text | Google Scholar

12. Shindoh, J, Kobayashi, Y, Kawamura, Y, Akuta, N, Kobayashi, M, Suzuki, Y, et al. Microvascular invasion and a size cutoff value of 2 cm predict long-term oncological outcome in multiple hepatocellular carcinoma: reappraisal of the American joint committee on Cancer staging system and validation using the surveillance, epidemiology, and end-results database. Liver Cancer. (2020) 9:156–66. doi: 10.1159/000504193

PubMed Abstract | Crossref Full Text | Google Scholar

13. Ryu, T, Takami, Y, Wada, Y, Sasaki, S, Imamura, H, Ureshino, H, et al. Combined hepatectomy and microwave ablation for multifocal hepatocellular carcinoma: long-term outcomes and prognostic factors. Asian J Surg. (2021) 44:186–91. doi: 10.1016/j.asjsur.2020.05.008

PubMed Abstract | Crossref Full Text | Google Scholar

14. Nojiri, K, Tanaka, K, Takeda, K, Ueda, M, Matsuyama, R, Taniguchi, K, et al. The efficacy of liver resection for multinodular hepatocellular carcinoma. Anticancer Res. (2014) 34:2421–6.

PubMed Abstract | Google Scholar

15. Li, ZL, Yu, JJ, Guo, JW, Sui, CJ, Dai, BH, Zhang, WG, et al. Liver resection is justified for multinodular hepatocellular carcinoma in selected patients with cirrhosis: a multicenter analysis of 1,066 patients. Eur J Surg Oncol. (2019) 45:800–7. doi: 10.1016/j.ejso.2018.12.016

PubMed Abstract | Crossref Full Text | Google Scholar

16. Pölsterl SJJMLR. Scikit-survival: a library for time-to-event analysis built on top of scikit-learn. JMLR. (2020) 21:1–6.

Google Scholar

17. Dankers, F, Traverso, A, Wee, L, and van Kuijk, SMJ. Prediction modeling methodology In: P Kubben, M Dumontier, and A Dekker, editors. Fundamentals of clinical data science. Springer: Cham (2019). 101–20.

Google Scholar

18. Lundberg, SM, and Lee, S-I. A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems; California, USA: Curran Associates Inc; (2017). 4768–4777.

Google Scholar

19. Lin, J, Yin, M, Liu, L, Gao, J, Yu, C, Liu, X, et al. The development of a prediction model based on random survival Forest for the postoperative prognosis of pancreatic Cancer: a SEER-based study. Cancers. (2022) 14:667. doi: 10.3390/cancers14194667

PubMed Abstract | Crossref Full Text | Google Scholar

20. Yang, L, Zhou, Y, Wang, G, Liu, D, Chen, B, Pu, D, et al. Clinical features and prognostic factors of combined small cell lung cancer: development and validation of a nomogram based on the SEER database. Transl Lung Cancer Res. (2021) 10:4250–65. doi: 10.21037/tlcr-21-804

PubMed Abstract | Crossref Full Text | Google Scholar

21. Cassese, G, Han, HS, Lee, E, Lee, B, Lee, HW, Cho, JY, et al. Laparoscopic versus open liver resection for multiple hepatocellular carcinoma within and beyond the Milan criteria: an eastern-Western propensity score-matched analysis. J Hepatobiliary Pancreat Sci. (2024) 31:2–11. doi: 10.1002/jhbp.1384

PubMed Abstract | Crossref Full Text | Google Scholar

22. Colagrande, S, Inghilesi, AL, Aburas, S, Taliani, GG, Nardi, C, and Marra, F. Challenges of advanced hepatocellular carcinoma. World J Gastroenterol. (2016) 22:7645–59. doi: 10.3748/wjg.v22.i34.7645

PubMed Abstract | Crossref Full Text | Google Scholar

23. Omata, M, Cheng, AL, Kokudo, N, Kudo, M, Lee, JM, Jia, J, et al. Asia-Pacific clinical practice guidelines on the management of hepatocellular carcinoma: a 2017 update. Hepatol Int. (2017) 11:317–70. doi: 10.1007/s12072-017-9799-9

PubMed Abstract | Crossref Full Text | Google Scholar

24. Xie, DY, Ren, ZG, Zhou, J, Fan, J, and Gao, Q. Critical appraisal of Chinese 2017 guideline on the management of hepatocellular carcinoma. Hepatob Surg Nutr. (2017) 6:387–96. doi: 10.21037/hbsn.2017.11.01

PubMed Abstract | Crossref Full Text | Google Scholar

25. Wang, W, and Wei, C. Advances in the early diagnosis of hepatocellular carcinoma. Genes Dis. (2020) 7:308–19. doi: 10.1016/j.gendis.2020.01.014

PubMed Abstract | Crossref Full Text | Google Scholar

26. Yang, A, Xiao, W, Ju, W, Liao, Y, Chen, M, Zhu, X, et al. Prevalence and clinical significance of regional lymphadenectomy in patients with hepatocellular carcinoma. ANZ J Surg. (2019) 89:393–8. doi: 10.1111/ans.15096

PubMed Abstract | Crossref Full Text | Google Scholar

27. Ravaioli, M, Ercolani, G, Grazi, GL, Cescon, M, Dazzi, A, Zanfi, C, et al. Safety and prognostic role of regional lymphadenectomy for primary and metastatic liver tumors. Updat Surg. (2010) 62:27–34. doi: 10.1007/s13304-010-0008-9

Crossref Full Text | Google Scholar

28. Shen, WF, Wu, L, Dong, H, Sui, CJ, Dai, BH, Shen, RX, et al. Hepatic resection for multiple hepatocellular carcinoma less than 5 cm: a prospective comparative study. Hepato Gastroenterol. (2014) 61:173–80.

PubMed Abstract | Google Scholar

29. Donadon, M, Fontana, A, Procopio, F, Del Fabbro, D, Cimino, M, Viganò, L, et al. Dissecting the multinodular hepatocellular carcinoma subset: is there a survival benefit after hepatectomy? Updat Surg. (2019) 71:57–66. doi: 10.1007/s13304-019-00626-3

Crossref Full Text | Google Scholar

30. Tada, T, Kumada, T, Toyoda, H, Nakamura, S, Endo, Y, Kaneoka, Y, et al. A validation study of combined resection and ablation therapy for multiple hepatocellular carcinoma. Clin Radiol. (2022) 77:114–20. doi: 10.1016/j.crad.2021.10.012

Crossref Full Text | Google Scholar

31. Usta, S, and Kayaalp, C. Tumor diameter for hepatocellular carcinoma: why should size matter? J Gastrointest Cancer. (2020) 51:1114–7. doi: 10.1007/s12029-020-00483-z

PubMed Abstract | Crossref Full Text | Google Scholar

32. Heo, J, Noh, OK, Oh, YT, Chun, M, and Kim, L. Second primary cancer after liver transplantation in hepatocellular carcinoma: a nationwide population-based study. Hepatol Int. (2017) 11:523–8. doi: 10.1007/s12072-017-9824-z

PubMed Abstract | Crossref Full Text | Google Scholar

33. Wang, S, Hu, S, Huang, S, Su, L, Guo, Q, Wu, B, et al. Better survival and prognosis in SCLC survivors after combined second primary malignancies: a SEER database-based study. Medicine. (2023) 102:e32772. doi: 10.1097/md.0000000000032772

PubMed Abstract | Crossref Full Text | Google Scholar

34. Rao, A, Rich, NE, Marrero, JA, Yopp, AC, and Singal, AG. Diagnostic and therapeutic delays in patients with hepatocellular carcinoma. J Natl Comp Cancer Netw. (2021) 19:1063–71. doi: 10.6004/jnccn.2020.7689

PubMed Abstract | Crossref Full Text | Google Scholar

35. Tsai, WC, Kung, PT, Wang, YH, Kuo, WY, and Li, YH. Influence of the time interval from diagnosis to treatment on survival for early-stage liver cancer. PLoS One. (2018) 13:e0199532. doi: 10.1371/journal.pone.0199532

PubMed Abstract | Crossref Full Text | Google Scholar

36. Addissouky, T, Sayed, I, Ali, M, and Alubiady, M. Realizing the promise of artificial intelligence in hepatocellular carcinoma through opportunities and recommendations for responsible translation. J Online Inform. (2024) 9:70–9. doi: 10.15575/join.v9i1.1297

Crossref Full Text | Google Scholar

37. Chatzipanagiotou, OP, Loukas, C, Vailas, M, Machairas, N, Kykalos, S, Charalampopoulos, G, et al. Artificial intelligence in hepatocellular carcinoma diagnosis: a comprehensive review of current literature. J Gastroenterol Hepatol. (2024). doi: 10.1111/jgh.16663

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: advanced multiple hepatocellular carcinoma, prognosis, machine learning, deep learning, gradient boosted machine

Citation: Shen J, Zhou Y, Pei J, Yang D, Zhao K and Ding Y (2024) Development of prognostic models for advanced multiple hepatocellular carcinoma based on Cox regression, deep learning and machine learning algorithms. Front. Med. 11:1452188. doi: 10.3389/fmed.2024.1452188

Received: 20 June 2024; Accepted: 18 September 2024;
Published: 27 September 2024.

Edited by:

Marcello Dallio, University of Campania Luigi Vanvitelli, Italy

Reviewed by:

Mohammadsadegh Nikdad, Universitätsmedizin Greifswald, Germany
Tamer A. Addissouky, University of Menoufia, Egypt

Copyright © 2024 Shen, Zhou, Pei, Yang, Zhao and Ding. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Youming Ding, ZGluZ3ltQHdodS5lZHUuY24=; Kailiang Zhao, emhhb2tsMTk4M0B3aHUuZWR1LmNu

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.