Skip to main content

ORIGINAL RESEARCH article

Front. Immunol., 18 November 2022
Sec. Cancer Immunity and Immunotherapy
This article is part of the Research Topic Efficacy, Safety and Biomarkers of Novel Therapeutics and Regimens in the Peri-operative Setting of Bone and Soft Tissue Sarcoma View all 5 articles

A deep belief network-based clinical decision system for patients with osteosarcoma

Wenle Li,&#x;Wenle Li1,2†Youzheng Dong&#x;Youzheng Dong3†Wencai Liu&#x;Wencai Liu4†Zhiri TangZhiri Tang5Chenyu SunChenyu Sun6Scott LoweScott Lowe7Shuya ChenShuya Chen8Rachel BentleyRachel Bentley7Qin ZhouQin Zhou9Chan XuChan Xu10Wanying LiWanying Li10Bing WangBing Wang10Haosheng WangHaosheng Wang11Shengtao DongShengtao Dong12Zhaohui HuZhaohui Hu13Qiang LiuQiang Liu10Xintian CaiXintian Cai14Xiaowei Feng*Xiaowei Feng15*Wei Zhao*Wei Zhao1*Chengliang Yin*&#x;Chengliang Yin16*‡
  • 1Department of Orthopaedic Surgery, People’s Hospital of Xinjiang Uygur Autonomous Region, Urumqi, Xinjiang, China
  • 2Center for Molecular Imaging and Translational Medicine, Xiamen University, Xiamen, China
  • 3Key Laboratory of Molecular Medicine, The Second Affiliated Hospital of Nanchang University, Nanchang, China
  • 4Department of Orthopaedic Surgery, the First Affiliated Hospital of Nanchang University, Nanchang, China
  • 5School of Physics and Technology, Wuhan University, Wuhan, China
  • 6AMITA Health Saint Joseph Hospital Chicago, Chicago, IL, United States
  • 7College of Osteopathic Medicine, Kansas City University, Kansas, MO, United States
  • 8Foundation Program, Newham University Hospital, London, United Kingdom
  • 9Radiation Oncology, Mayo Clinic, Rochester, MN, United States
  • 10Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
  • 11Department of Orthopaedics, The Second Hospital of Jilin University, Changchun, China
  • 12Department of Spine Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
  • 13Department of Spine Surgery, Liuzhou People’s Hospital, Liuzhou, China
  • 14Graduate School, Xinjiang Medical University, Urumqi, Xinjiang, China
  • 15Department of Neuro Rehabilitation, Shaanxi Provincial Rehabilitation Hospital, Xi 'an, China
  • 16Faculty of Medicine, Macau University of Science and Technology, Macau, Macau SAR, China

Osteosarcoma was the most frequent type of malignant primary bone tumor with a poor survival rate mainly occurring in children and adolescents. For precision treatment, an accurate individualized prognosis for Osteosarcoma patients is highly desired. In recent years, many machine learning-based approaches have been used to predict distant metastasis and overall survival based on available individual information. In this study, we compared the performance of the deep belief networks (DBN) algorithm with six other machine learning algorithms, including Random Forest, XGBoost, Decision Tree, Gradient Boosting Machine, Logistic Regression, and Naive Bayes Classifier, to predict lung metastasis for Osteosarcoma patients. Therefore the DBN-based lung metastasis prediction model was integrated as a parameter into the Cox proportional hazards model to predict the overall survival of Osteosarcoma patients. The accuracy, precision, recall, and F1 score of the DBN algorithm were 0.917/0.888, 0.896/0.643, 0.956/0.900, and 0.925/0.750 in the training/validation sets, respectively, which were better than the other six machine-learning algorithms. For the performance of the DBN survival Cox model, the areas under the curve (AUCs) for the 1-, 3- and 5-year survival in the training set were 0.851, 0.806 and 0.793, respectively, indicating good discrimination, and the calibration curves showed good agreement between the prediction and actual observations. The DBN survival Cox model also demonstrated promising performance in the validation set. In addition, a nomogram integrating the DBN output was designed as a tool to aid clinical decision-making.

Introduction

Osteosarcoma is a malignant bone tumor with a low incidence but a high mortality rate, mainly occurring in children and adolescents (1, 2). Osteosarcoma frequently arises in the extremities, but it is also found in the axial skeleton and can be diagnosed at any age. At present, the combined treatment for Osteosarcoma includes tumor resection, radiotherapy, and chemotherapy. In addition, preoperative neoadjuvant chemotherapy and preoperative radiotherapy are increasingly important (35). Despite these advances, the prognosis of patients with Osteosarcoma is still poor due to the aggressive and metastatic behavior of the tumor (6, 7). Previous studies have shown that the major predictors of prognosis include age, gender, tumor dimension, response to chemotherapy, involvement of the proximal extremity or within the axial skeleton, and the presence of metastasis at diagnosis (8, 9). Osteosarcoma can metastasize to distant sites, mainly to the lungs, and occasionally to bone or lymph nodes (10, 11). Metastases can be detected in about 15%-20% of newly diagnosed Osteosarcoma patients, and the 5-year survival rate for these patients with metastases is only 20% (1214). Therefore, early identification of patients at high risk of metastasis and timely assessment of overall survival is particularly important for reducing mortality and improving patients’ quality of life.

In most situations, researchers conducted survival analyses often used Cox proportional hazards model (15). The model can analyze the influence of multiple factors on survival time simultaneously without estimating the distribution type of survival data, applying survival outcome and survival time as the dependent variable. The model assumes that the risk of a clinical outcome is a linear combination of the patient’s covariates. However, this method may be too crude for many intricate clinical outcomes such as tumor metastasis.

Machine learning (ML) is a significant subfield of Artificial intelligence (AI) to build decision-making models, it concentrates on making predictions by learning from available data. Deep learning (DL) is a sub-field of ML that concentrates on making predictions using multi-layered neural network algorithms. Compared to other ML methods such as logistic regression, the neural network architecture of DL enables the models to scale exponentially with the growing quantity and dimensionality of data (16). The deep belief network (DBN) model is a DL algorithm that stacks simpler models known as restricted Boltzmann machines (RBMs) (17). The unsupervised learning builds a multi-level structure layer-by-layer, automatically extracting more abstract representations from the layers. It makes DBN particularly useful for solving complex computational problems such as large-scale image classification, natural language processing and speech recognition and translation (17).

Recently, numerous ML-based studies have been carried out for cancer prediction, prognosis, or even assessing treatment response (1820). They predict survival time in years (particularly for the pre-operative prognosis of tumor patients) by using regression or categorizing it into long-term or short-term based on phenotypic features extracted from various types of pre-operative clinical characteristics or image data, i.e., blood tests, computed tomography (CT), magnetic resonance imaging (MRI) data before operation. The ML algorithms used in these studies include Decision Trees (DT), Gradient Boosting Machine (GBM), Random Forest (RF), Naive Bayes Classifier (NBC) and DL. However, few studies have compared the diagnostic performance of various ML algorithms with DL algorithms in assessing lung metastasis in OS patients.

In this study, the performance of DBN and six machine learning algorithms in predicting lung metastasis for Osteosarcoma patients was compared to find the optimal algorithm. The optimal DBN algorithm was subsequently used to construct a pulmonary metastasis predictive model for pulmonary metastasis in Osteosarcoma patients, the predictive model was integrated with other important variables into the Cox model to predict overall survival in Osteosarcoma patients. The study demonstrated that the DBN survival Cox model had good discrimination and calibration, which would provide great help for clinical decision-making.

Materials and methods

Applications of deep learning in medical field

DL has developed into an innovative field in the exploration part of ML and data mining (21, 22), which can break through the limitations of human eyes and reveal hidden information (2325). CNN is a representative deep neural network (DNN) model, which is considered one of the most commonly used DL methods (26), it eliminates the need for tedious steps such as manual feature extraction of medical images and significantly improves the ability to classify images and detect objects in images. CNNs have been gradually used by the medical community to assist in the early diagnosis of clinical diseases and to predict clinical outcomes of disease progression since the introduction of AlexNet in 2012 (2730). Research shows that deep CNNs can achieve state-of-the-art performance in tumor detection and diagnosis compared to other machine learning methods and human experts (3133). JY et al. developed and validated a deep learning signal (DLS) from diffusion tensor imaging (DTI) using a deep CNN model, identified key pathways for DLS in a radiogenomics cohort (n=78) from paired DTI and RNA-seq data, and could improve stratification of gliomas by identifying risk groups that affect survival outcomes for dysregulated biological pathways. Gun Woo Lee et al. created a CNN model to diagnose spinal cord cervical spondylosis (CSM) by receiving input from multiple channels of two-dimensional data and performing iterative transformations using convolution and pooling operations to identify features of lateral cervical spine radiographs of patients with or without spinal cord cervical spondylosis (CSM) with high diagnostic accuracy (34). This also provides new ideas and references for the treatment and diagnosis of clinically relevant diseases (3537).

Dataset and preprocessing

The training set for this study was derived from the Surveillance, Epidemiology and End Results (SEER) database using the SEER*stat software (version 8.6.3) from 2010 to 2016. Inclusion criteria included: (1) patients with histologically proven osteosarcoma (ICD-0-3 8936/3); (2) the primary tumor had to be localized in the limb bone; and (3) osteosarcoma was the first primary tumor. Patients with missing survival data or data on tumor size, metastasis, stage, or surgical modality were excluded. Collected variables included age, sex, race, grade, primary sites, laterality, bone metastases, T, N stage, lymph node surgery, surgery, radiation, chemotherapy survival status, and survival time. The validation set was collected from the inpatient Electronic Medical Record database at the Second Hospital of Jilin University, the Second Hospital of Dalian Medical University, Xianyang Central Hospital, and Liuzhou People’s Hospital in China. The variables to be collected were the same as before. Time-to-event or censoring was based on the date of diagnosis or the date of the last contact. The Osteosarcoma’s tumor node metastasis (TNM) stage was evaluated based on the 7th edition of the American Joint Committee on Cancer (AJCC) staging manual. “SEER Combined Mets at DX-lung (2010 +)” was used to identify the presence of lung metastasis in a newly diagnosed. Osteosarcoma patient. The Ethics Review Board of the Xianyang Central Hospital approved this study(Ethics Committee number: 20210022).

Variables screening for overall survival in OS patients

Cox proportional hazards with the least absolute shrinkage and selection operator (LASSO) penalty were employed to identify clinical variables that were linked with the overall survival of Osteosarcoma patients. LASSO penalized estimation methods shrank the estimates of the regression coefficients towards zero relatives to the maximum likelihood estimates. The shrinkage was to prevent overfitting due to either collinearity of the covariates or high dimensionality.

Deep learning algorithm versus machine learning algorithms

To compare the DBN algorithm with other ML algorithms for pulmonary metastasis in OS patients, several supervised classification methods were evaluated to determine better classification accuracy. The evaluated conventional classifiers include DT, GBM, logistic regression (LR), NBC, RF, and XGBoost (XGB). For parameters identification of six ML algorithms, univariate and multivariate logistics regression analyses were conducted. The preprocessed labelled dataset was used to train and test the model of different classifiers using 10-fold cross-validation as the experimental setting. The 10-fold cross-validation is a method to validate the studied/built model by iterating through the labelled data 10 times with different subsets of training and testing for each iteration.

DBN comprises multilayer random variables and binary la-tent variables and is a probabilistic deep learning algorithm. The model training performed is arranged in two main steps. Step 1: Train each layer of the RBM Network separately in an unsupervised manner and ensure that the maximum feature information is retained when the feature vector is mapped to a different feature space. Step 2: The BP Network is set as the final layer of the DBN, and the output feature vectors of the RBM are used as its input feature vectors to train the entity relationship classifier in a supervised manner.

In this study, the number of attribute features was obtained from the non-negative matrix decomposition as K = 14. After training the DBN, the last layer is the output features. The dimensionality of the feature vector is the number of nodes in the last layer. The number of nodes is determined by parameter sensitivity experiments based on the characteristics of our data. We finally chose 4 as the number of nodes.

In lung metastasis prediction, the attribute vector V of each case sample was used after the dimensionality reduction process as the input of DBN. In this training phase, the input vector V of the visible layer was passed to the hidden layer. Conversely, the input V of the visible layer was randomly selected to reconstruct the original input data. Finally, these new visible neuron activation units forwarded the reconstruction of the hidden layer activation units to obtain h1 and h2. Gibbs sampling was used to repeat the above process during the training. The correlation differences between the hidden layer activation units and the input visible layer were used as the basis for updating the weights W1 and W2.

The above seven algorithms were evaluated by the following indexes: accuracy, precision, recall, and F-measure (F1-Score). In addition, the feature importance of the optimal ML algorithm was calculated and compared with the DBN algorithm.

Model development and visualization

The previous screening variables selected by the LASSO Cox proportional hazards model were combined with output of the pulmonary metastasis predictive model to construct an integrated Cox model for predicting overall survival in Osteosarcoma patients. A web calculator and a clinical dynamic nomogram for those screening variables were planned to calculate and visualize the relationship between the variables and predicted probabilities of overall survival.

Model validation

The performance of the DBN survival Cox model was internally validated using the training set and externally authorized using the validation set. Discrimination for the model was evaluated by the area under the curve (AUC) of the receiver operating characteristic (ROC) curve. AUC values can range from 0 to 1, with values of 0.5-0.7 demonstrating poor discrimination, 0.7-0.8 acceptable, and >0.8 excellent discrimination (38). The calibration curve was used to assess the agreement between model-predicted and actual overall survival. The closer the calibration curve is to the 45-degree diagonal, the more perfect the model will be (39).

Assessment of clinical utility

Decision curve analysis (DCA) was used to explore the net benefit of the prognostic model over the entire range of probability thresholds (40, 41). DCA calculates the clinical net benefit acquired by applying the studied DBN survival Cox model to make clinical decisions. In the plot of DCA, the X-axis indicates thresholds probability for survival, and the Y-axis demonstrates net benefits depending on different thresholds probability. The pink horizontal line parallel to the X-axis represents no clinical action taken by any patient (“treat none”) and their net clinical benefit is 0. The smooth arc line represents that all patients took clinical action (“treat all”), and their net clinical benefit is a backslash with a negative slope. The curve of “treat by DBN survival Cox model” is then compared to “treat all” and to “treat none”.

Statistical analysis

The baseline categorical variables were represented as their counts and percentage, and continuous variables were expressed as means with standard deviation (SD) and medians [interquartile ranges (IQR)]. The COX proportional hazards with the LASSO penalty were used to identify predictor variables for survival. The lambda (λ) parameters in LASSO regression analysis were chosen for minimized expected model deviance. To compare deep learning and six machine learning algorithms, univariate and multivariate logistic regression analysis were conducted for parameter selection. After a univariate logistics regression analysis of all collected variables, those with a P-value < 0.05 were included in multivariable logistics regression. Variables with P-value < 0.05 in multivariable analysis were used to build the model. Accuracy, precision, recall, and F-measure were used to assess the performance of algorithms. The discrimination and calibration of the model in training and validation sets were evaluated by using the AUC and calibration curve. Clinical applicability was analyzed using DCA. P values < 0.05 indicated statistical significance. The statistical analyses were performed using RStudio software version 1.1.414 (Boston, MA, USA).

Results

Data characteristics

In total, 1,094 eligible Osteosarcoma patients from the SEER database were identified and designated as the training set. The validation set included 107 patients from the Second Hospital of Jilin University, the Second Hospital of Dalian Medical University, Xianyang Central Hospital and Liuzhou People’s Hospital. An overview of the total datasets combining the training set and validation set is shown in Figure 1A.

FIGURE 1
www.frontiersin.org

Figure 1 Overview of the total datasets combining the training set and validation set and LASSO model profile plots. (A), Heatmap of each clinical factor in the total datasets. (B), Coefficient profile plots showing how the size of the coefficients of clinical factors shrinks with increasing value of the penalty, with the factors and their regression coefficients selected for the model based on the optimal for the LASSO model. (C), Penalty plot for the LASSO model; color error bars indicate standard error. LASSO, least absolute shrinkage and selection operator.

Predictor variables of overall survival

Among 14 candidate survival-associated variables, 8 variables with statistically significant hazard ratios were selected based on LASSO Cox regression analysis (Figures 1B, C). These were sex, age, primary site, grade, T stage, surgery, bone metastases, and lung metastases.

Compare with other machine learning classifiers

Uni- and multivariate logistic regression analysis identified that sex, bone metastasis, surgery, N and T were significant factors for predicting pulmonary metastasis (Table 1). For training set, the results of DBN algorithm and six ML algorithms (DT, GBM, LR, NBC, RF and XGB) are shown in Figure 2. The accuracy, precision, recall, and F-Score of DBN were 0.917 ± 0.017, 0.896 ± 0.022, 0.956 ± 0.018 and 0.925 ± 0.018, respectively. Among the six ML algorithms, the accuracy, precision, recall and F- Score of the best algorithm (XGB) were 0.712 ± 0.014, 0.689 ± 0.026, 0.754 ± 0.044 and 0.724 ± 0.017, respectively. For the validation set, the comparison result is shown in Figure 3. The accuracy, precision, recall, and F-Score of DBN and XGB were 0.888/0.665, 0.642/0.326, 0.900/0.750 and 0.750/0.455, respectively. Figure 4 shows the ranking of the top 12 feature importance according to the DBN and the XGB algorithm. As shown, ‘N stage’ and ‘T stage’ was the most two important predictors of lung metastasis in the DBN algorithm. ‘Surgery’ was the most important predictor, and ‘T stage’ was the second most important predictor of lung metastasis in the XGB algorithm.

TABLE 1
www.frontiersin.org

Table 1 Univariate and multivariate Logistics regression for pulmonary metastasis of osteosarcoma.

FIGURE 2
www.frontiersin.org

Figure 2 Comparison of DBN (A) algorithm and other 6 machine learning algorithms including DT (B), GBM (C), LR (D), RF (E), NBC (F) and XGB (G) for accuracy, precision, recall, and F-measure (F1-Score) in training set. DBN, Deep Belief Networks; DT, Decision Tree; GBM, Gradient Boosting Machine; LR, Logistic Regression; RF, Random Forest; NBC, Naive Bayes Classifier; XGB, XGBoost.

FIGURE 3
www.frontiersin.org

Figure 3 Comparison of DBN algorithm and other 6 machine learning algorithms including DT, GBM, LR, RF, NBC and XGB for accuracy, precision, recall, and F-measure (F1-Score) in validation set. DBN, Deep Belief Networks; DT, Decision Tree; GBM, Gradient Boosting Machine; LR, Logistic Regression; RF, Random Forest; NBC, Naive Bayes Classifier; XGB, XGBoost.

FIGURE 4
www.frontiersin.org

Figure 4 Nomogram for predicting overall survival for 1-, 3-, and 5-years in OS patients. Each variable value for the individuals was determined according to the top Points scale, and then the points for each variable were added. Finally, a personalized survival probability was obtained according to the bottom Total Points scale.

Model development

The variables of sex, age, primary site, grade, T, surgery, and bone metastases were combined with the output of DBN-based lung metastasis predictive model to construct an integrated DBN survival Cox model for predicting overall survival in OS patients. To make the DBN survival Cox model more intuitional, a nomogram (Figure 5) was then constructed. An online web calculator embedding a dynamic nomogram with our DBN survival Cox model was also developed, which is available at https://drwenleli0910.shinyapps.io/ODSapp/. After filling in the online form as required in the webpage, the webpage will automatically generate a personalized nomogram, together with the probability of survival at 1-, 3-, and 5-years.

FIGURE 5
www.frontiersin.org

Figure 5 Feature importance. The top 12 feature importance results for the DBN (A) model and XGB (B) model trained on training set. DBN, Deep Belief Networks; XGB, XGBoost.

Performance of the prediction model

The AUC of the DBN survival Cox model at 1-, 3-, 5-years in the training set were 0.851, 0.806, and 0.793, respectively (Figures 6A–C). Calibration curves showed that the predicted and actual survival rates matched very well at 3 and 5 years, and were acceptable at 1-year (Figures 6D–F). In the validation set, the AUC of DBN survival Cox model at 1-, 3-, 5-years were 0.876, 0.827, and 0.814, respectively (Figures 7A–C). The predicted and actual survival rates matched very well at 1-year, and were accepted at 3- and 5-years (Figures 7D–F). The risk curve, survival status and time, and expression of features in the DBN survival Cox model were also represented according to the high- and low-risk groups in the training and validation set (Figure 8).

FIGURE 6
www.frontiersin.org

Figure 6 The AUC and calibration curves of DL survival Cox model at 1-, 3-, 5-years in the training set. (A–C), the AUC of the model at 1-, 3-, 5-years. (D–F), the calibration curves of model at 1-, 3-, 5-years. AUC, area under the curve; DL, deep learning.

FIGURE 7
www.frontiersin.org

Figure 7 The AUC and calibration curves of DL survival Cox model at 1-, 3-, 5-years in the validation set. (A–C), the AUC of model at 1-, 3-, 5-years. (D–F), the calibration curves of model at 1-, 3-, 5-years. AUC, area under the curve; DL, deep learning.

FIGURE 8
www.frontiersin.org

Figure 8 Risk score analysis of deep learning survival Cox model in training (A) and validation (B) set.

Assessment of clinical utility

The decision curves for 1-, 3- and 5-years in the training set and validation set (Figures 9A, B) demonstrated relatively good performance for the DBN survival Cox model in terms of clinical application. As shown, when the threshold probability in the clinical decision was more than 0.063, 0.151, and 0.193 for 1 year, 3 years, and 5 years, respectively, the DBN survival Cox model provided more of a net benefit than “ treat all “ or “ treat none”.

FIGURE 9
www.frontiersin.org

Figure 9 Decision curves for the DL survival Cox model for 1-, 3-, 5-years in the training set (A) and validation set (B). The X-axis indicates thresholds probability for lung metastasis risk, and the Y-axis indicates net benefits depending on different thresholds probability. The pink horizontal line parallel to the X-axis represents no clinical action taken by any patient (“treat none”) and their net clinical benefit is 0. The smooth arc line represents that all patients took clinical action (“treat all”), and their net clinical benefit is a backslash with a negative slope. The curve of “treat by DL survival Cox model” is then compared to “treat all” and to “treat none”.

Discussion

Although the effectiveness of the localized Osteosarcoma treatment has gradually improved, the 5-year overall survival of Osteosarcoma patients with lung metastasis is less than 30%, suggesting that these patients still fare poorly (1214). Several studies have explored potential risk factors for tumor metastasis to facilitate early management (8, 9). However, these articles mainly concentrated on the impact of a single factor on metastasis, a study on the combined impact of multiple factors on metastasis is still lacking.

Cox proportional hazards model is the traditional statistical approach for survival analysis and risk prediction. The data in this model are required to meet prior assumptions, including the proportional hazard assumptions and linear relationships between continuous covariates and the log hazard function.

However, in the complex biomedical field, there are many nonlinear relationships (42, 43). In this context, a single Cox model with proportional hazards is likely to produce less accurate estimations of survival outcomes. Therefore, new solutions containing these potentially nonlinear variables are highly needed to accurately predict the survival of individuals,

DBN, a relatively new computational algorithm that has become a popular research topic, has been rapidly developed and widely used in medical research. It is a DL-based generative model, when a network contains a large number of deep layers, which addresses the problem of vanishing gradients that suffer from traditional gradient-based learning algorithms. In cancer research, the advantages of DBN model for survival analysis are as follows: First, this model shows a modified fit for features with a nonlinear relationship, which applies to the nonlinear associations that are abundant in real-world practice. Second, as a DL model, the DBN automatically learns complex mapping by transforming the features through the multi-layer structure. At present, most ML algorithms have applied shallow-structured architectures.

These models are specifically effective in solving well–constrained problems. However, several studies have confirmed reasons for using deep structures (4446). Deep models may be more robust in the wide variety of functions that can be parameterized by composing non-linear transformations (47). They also allow more efficient representation of highly varying functions than shallow architectures. In addition, a prominent problem with traditional algorithms is the requirement for a certain level of domain expertise to design a feature extractor that converts raw data into a suitable feature vector (16). DBN allows a system to be fed with raw data and to automatically discover the required representations. This study shows that the DBN-based model can achieve better prediction performance than other machine learning algorithms.

Our predictive models currently use seven clinical variables, including common demographic and clinical characteristics and histopathological outcomes. The SEER database was used with relatively complete data during the model development, while some data were missing in the external validation set. Our model has tolerance for missing data, and we still achieved high performance on the external validation set even missing 30% data of the patients. In addition, a dynamic nomogram can provide dynamic assessments as missing data are supplemented during subsequent treatment.

There were aols some limitations despite the promising results. First, the training set was extracted from the SEER database with a relatively small sample size, although an independent validation group from various hospitals was used. Thus, a large sample of multicenter data was required to fully evaluate the generalization ability of the DBN survival COX model. Second, the SEER database only provided limited clinical variables, and many variables closely associated with tumor metastasis and survival, such as tumor markers and gene expression, were not available. Future studies could incorporate these potentially essential factors and construct a more comprehensive predictive model.

Conclusion

In our study, a DBN survival Cox model was established to predict the overall survival in Osteosarcoma patients. Compared to the other six ML algorithms, this DNB algorithm demonstrated better performance. Both internal validation and external validation revealed good generalizability. In addition, an individualized risk estimate of survival can be calculated through the nomogram and online web calculators developed by this study. This allows the model to be applied to clinical practice and helps with clinical decision-making.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Author contributions

CLY, WZ and XWF completed the entire research design. WLL, YZD and WCL participated in the research and analyzed data. WLL and YZD drafted manuscripts. CYS and SL provided expert consultation and advice. SYC, RB, QZ, CX, WYL and BW participated in its design and coordination, HSW, STD and ZHH helped polish the language. All authors reviewed the final version of the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA: A Cancer J Clin (2020) 70(1):7–30. doi: 10.3322/caac.21590

CrossRef Full Text | Google Scholar

2. Misaghi A, Goldin A, Awad M, Kulidjian AA. Osteosarcoma: a comprehensive review. SICOT-J (2018) 4(1):12. doi: 10.1051/sicotj/2017028

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Amanda, Dancsok AR, Asleh-Aburaya K, Nielsen TO. Advances in sarcoma diagnostics and treatment. Oncotarget (2017) 8(4):7068–93. doi: 10.18632/oncotarget.12548.

PubMed Abstract | CrossRef Full Text | Google Scholar

4. El Beaino M, Araujo DM, Lazar AJ, Lin PP. Synovial sarcoma: Advances in diagnosis and treatment identification of new biologic targets to improve multimodal therapy. Ann Surg Oncol (2017) 24(8):1–10. doi: 10.1245/s10434-017-5855-x

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gronchi A, Maki RG, Jones RL. Treatment of soft tissue sarcoma: a focus on earlier stages. Future Oncol (2017) 13(1s):13–21. doi: 10.2217/fon-2016-0499

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Rainusso N, Wang LL, Yustein JT. The adolescent and young adult with cancer: State of the art - bone tumors. Curr Oncol Rep (2013) 15(4):296–307. doi: 10.1007/s11912-013-0321-9.

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Kager L, Tamamyan G, Bielack S. Novel insights and therapeutic interventions for pediatric osteosarcoma. Future Oncol (2017) 13(4):357–68. doi: 10.2217/fon-2016-0261.

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Jo VY, Fletcher CD. WHO classification of soft tissue tumours: an update based on the 2013. (4th) edition. Pathology (2014) 46(2):95–104. doi: 10.1097/PAT.0000000000000050.

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Bielack SS, Kempf-Bielack B, Delling G, Exner GU, Flege S, Helmke K, et al. Prognostic factors in high-grade osteosarcoma of the extremities or trunk: an analysis of 1,702 patients treated on neoadjuvant cooperative osteosarcoma study group protocols. J Clin Oncol (2002) 20(3):776–90. doi: 10.1200/JCO.2002.20.3.776.

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Kager L, Zoubek A, Pötschger U, Kastner U, Flege S, Kempf-Bielack B, et al. Primary metastatic osteosarcoma: presentation and outcome of patients treated on neoadjuvant cooperative osteosarcoma study group protocols. J Clin Oncol (2003) 21(10):2011–8. doi: 10.1200/JCO.2003.08.132

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Isakoff MS, Bielack SS, Meltzer P, Gorlick R. Osteosarcoma: Current treatment and a collaborative pathway to success. J Clin Oncol Off J Am Soc Clin Oncol (2015) 33(27):3029–35. doi: 10.1200/JCO.2014.59.4895.

CrossRef Full Text | Google Scholar

12. Harting MT, Blakely ML. Management of osteosarcoma pulmonary metastases. Semin Pediatr Surg (2006) 15(1):25–9. doi: 10.1053/j.sempedsurg.2005.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Bacci G, Rocca M, Salone M, Balladelli A, Ferrari S, Palmerini E. High grade osteosarcoma of the extremities with lung metastases at presentation: Treatment with neoadjuvant chemotherapy and simultaneous resection of primary and metastatic lesions. J Surg Oncol (2008). 98(6):415–20. doi: 10.1002/jso.21140

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Geller DS, Gorlick R. Osteosarcoma: a review of diagnosis, management, and treatment strategies. Clin Adv Hematol Oncol (2010) 8(10):705–18.

PubMed Abstract | Google Scholar

15. Prentice RL, Zhao S. Regression Models and Multivariate Life Tables. J Am Stat Assoc (2021) 116(535):1330–45. doi: 10.1080/01621459.2020.1713792.

PubMed Abstract | CrossRef Full Text | Google Scholar

16. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature (2015) 521(7553):436–44. doi: 10.1038/nature14539.

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput (2006) 18(7):1527–54. doi: 10.1162/neco.2006.18.7.1527

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Brendlin AS, Peisen F, Almansour H, Afat S, Eigentler T, Amaral T, et al. A machine learning model trained on dual-energy CT radiomics significantly improves immunotherapy response prediction for patients with stage IV melanoma. J Immunother Cancer (2021) 9(11) :e003261. doi: 10.1136/jitc-2021-003261.

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Kawakami E, Tabata J, Yanaihara N, Ishikawa T, Koseki K, Iida Y, et al. Application of artificial intelligence for preoperative diagnostic and prognostic prediction in epithelial ovarian cancer based on blood biomarkers. Clin Cancer Res Off J Am Assoc Cancer Res (2019) 25(10):3006–15. doi: 10.1158/1078-0432.CCR-18-3378.

CrossRef Full Text | Google Scholar

20. Tahmassebi A, Wengert GJ, Helbich TH, Bago-Horvath Z, Alaei S, Bartsch R, et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Invest Radiol (2019) 54(2):110–7. doi: 10.1097/RLI.0000000000000518.

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Cerentini A, Welfer D, Cordeiro d'Ornellas M, Pereira Haygert CJ, Dotto GN. Automatic identification of glaucoma using deep learning methods. Stud Health Technol Inf (2017) 245:318–21.

Google Scholar

22. LeCun Y, Bengio Y, Hinton G. Deep learning Nature. (2015) 521(7553):436–44. doi: 10.1038/nature14539.

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Lei T, Qian H, Lei P, Hu Y.Ferroptosis-related gene signature associates with immunity and predicts prognosis accurately in patients with osteosarcoma. Cancer Sci (2021) 112(11):4785–98. doi: 10.1111/cas.15131.

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Kocher M, Ruge MI, Galldiks N, Lohmann P Applications of radiomics and machine learning for radiotherapy of malignant brain tumors. Strahlenther Onkol (2020) 196(10):856–67. doi: 10.1007/s00066-020-01626-8

CrossRef Full Text | Google Scholar

25. Kaur A, Sachdeva R, Singh A. Latest trends in deep learning for automatic speech recognition system, in: International Conference on Artificial Intelligence and Speech Technology, (Cham: Springer) (2021) 62–72. doi: 10.1007/978-3-030-95711-7_6

CrossRef Full Text | Google Scholar

26. Fayaz Begum S, Prasanthi B. Investigation of level set segmentation procedures in brain MR images. ICCCE (2020) (Singapore: Springer) (2021) 431–8. doi: 10.1007/978-981-15-7961-5_43.

CrossRef Full Text | Google Scholar

27. Corbin D, Lesage F. Assessment of the predictive potential of cognitive scores from retinal images and retinal fundus metadata via deep learning using the CLSA database. Sci Rep (2022) 12(1):5767. doi: 10.1038/s41598-022-09719-3

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Pacheco A, Krohling R. An attention-based mechanism to combine images and metadata in deep learning models applied to skin cancer classification. IEEE J Biomed Health Inf (2021) 25(9):3554–63. doi: 10.1109/JBHI.2021.3062002.

CrossRef Full Text | Google Scholar

29. Hassan R, Islam M F, Uddin M Z, Ghoshal G, Hassan M M, Huda S. Prostate cancer classification from ultrasound and MRI images using deep learning based explainable artificial intelligence. Future Generation Computer Systems (2022) 127:462–72. doi: 10.1016/j.future.2021.09.030.

CrossRef Full Text | Google Scholar

30. Pan L, Ji B, Wang H, Wang L, Liu M, Chongcheawchamnan M, et al. MFDNN: multi-channel feature deep neural network algorithm to identify COVID19 chest X-ray images. Health Inf Sci Syst (2022) 10(1):4. doi:10.1007/s13755-022-00174-y.

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Cik I, Rasamoelina A D, Mach M, Sinčák P. Explaining deep neural network using layer-wise relevance propagation and integrated gradients. In: 2021 IEEE 19th world symposium on applied machine intelligence and informatics (SAMI) (2021). doi: 10.1109/SAMI50585.2021.9378686

CrossRef Full Text | Google Scholar

32. Xu Y, He X, Li Y, Pang P, Shu Z, Gong X. The nomogram of MRI-based radiomics with complementary visual features by machine learning improves stratification of glioblastoma patients: A multicenter study. J Magnetic Resonance Imaging (2021) 54(2):571–83. doi: 10.1002/jmri.27536.

CrossRef Full Text | Google Scholar

33. Mao B, Ma J, Duan S, Xia Y, Tao Y, Zhang L. Preoperative classification of primary and metastatic liver cancer via machine learning-based ultrasound radiomics. Eur Radiol (2021) 31(7):4576–86. doi: 10.1007/s00330-020-07562-6

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Lee GW, Shin H, Chang MC. Deep learning algorithm to evaluate cervical spondylotic myelopathy using lateral cervical spine radiograph. BMC Neurol (2022) 22(1):147. doi: 10.1186/s12883-022-02670-w.

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Ma R, Vanstrum EB, Lee R, Chen J, Hung AJ. Machine learning in the optimization of robotics in the operative field. Curr Opin Urol (2020) 30(6):808–16. doi: 10.1097/MOU.0000000000000816

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Augsburger M, Galatzer-Levy IR. Utilization of machine learning to test the impact of cognitive processing and emotion recognition on the development of PTSD following trauma exposure. BMC Psychiatry (2020) 20(1):325. doi: 10.1186/s12888-020-02728-4.

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Fessler J, Gouy-Pailler C, Fischler M, Le Guen M. Machine learning in lung transplantation. J Heart Lung Transplant (2020) 39(4):S385. doi: 10.1016/j.healun.2020.01.497

CrossRef Full Text | Google Scholar

38. Alba AC, Agoritsas T, Walsh M, Hanna S, Iorio A, Devereaux PJ. Discrimination and calibration of clinical prediction models: Users' guides to the medical literature. JAMA J Am Med Assoc (2017) 318(14):1377. doi: 10.1001/jama.2017.12126

CrossRef Full Text | Google Scholar

39. Koller M, Witteman J, Steyerberg E. Prognostic models with competing risks: methods and application to coronary risk prediction. Epidemiology (2009) 20(4):555–61. doi: 10.1097/EDE.0b013e3181a39056

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: Seven steps for development and an ABCD for validation. Eur Heart J (2014) 35(29):1925–31. doi: 10.1093/eurheartj/ehu207.

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Vickers AJ, Elkin EB. Decision curve analysis: A novel method for evaluating prediction models. Med Decision Making (2006) 26(6):565–74. doi:10.1177/0272989X06295361.

CrossRef Full Text | Google Scholar

42. Chi SQ, Tian Y, Li J, Tong DY, Kong XX, Poston G. Time‐dependent and nonlinear effects of prognostic factors in nonmetastatic colorectal cancer. Cancer Med (2017) 6(8):1882–92. doi:10.1002/cam4.1116

PubMed Abstract | Google Scholar

43. Joensuu H, Vehtari A, Riihimäki J, Nishida T, Steigen SE, Brabec P. Risk of recurrence of gastrointestinal stromal tumour after surgery: an analysis of pooled population-based cohorts. Lancet Oncol (2012) 13(3):265–74. doi: 10.1016/S1470-2045(11)70299-6.

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Bengio Y, Lecun Y. Scaling learning algorithms towards AI. In: Large-Scale kernel machines (2007) 34(5):1–41.

Google Scholar

45. Bengio Y. Learning deep architectures for AI. Now Publishers Inc (2009) 2(1):1–129. doi: 10.1561/2200000006

CrossRef Full Text | Google Scholar

46. Wu EQ, Qiu X Y, Tang Z, Zhang W M, Zhu L M, Ren H, et al. Detecting fatigue status of pilots based on deep learning network using EEG signals. IEEE Trans Cogn Dev Syst (2020) 13(3):575–85. doi: 10.1109/TCDS.2019.2963476

CrossRef Full Text | Google Scholar

47. Tang Z, Sun ZH, Wu EQ, Wei CF, Ming D, Chen S. MRCG: A MRI retrieval system with convolutional and graph neural networks for secure and private IoMT. IEEE J BioMed Health Inform (2021). doi: 10.1109/JBHI.2021.3130028

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Osteosarcoma, lung metastasis, deep belief networks, prognosis, clinical decision

Citation: Li W, Dong Y, Liu W, Tang Z, Sun C, Lowe S, Chen S, Bentley R, Zhou Q, Xu C, Li W, Wang B, Wang H, Dong S, Hu Z, Liu Q, Cai X, Feng X, Zhao W and Yin C (2022) A deep belief network-based clinical decision system for patients with osteosarcoma. Front. Immunol. 13:1003347. doi: 10.3389/fimmu.2022.1003347

Received: 26 July 2022; Accepted: 19 October 2022;
Published: 18 November 2022.

Edited by:

Jinming Han, Capital Medical University, China

Reviewed by:

Qun Zhao, Fourth Hospital of Hebei Medical University, China
Yuan Xiong, Huazhong University of Science and Technology, China

Copyright © 2022 Li, Dong, Liu, Tang, Sun, Lowe, Chen, Bentley, Zhou, Xu, Li, Wang, Wang, Dong, Hu, Liu, Cai, Feng, Zhao and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chengliang Yin, chengliangyin@163.com; Wei Zhao, 34302603@qq.com; Xiaowei Feng, fxw600@qq.com

These authors have contributed equally to this work

ORCID ID: Chengliang Yin, orcid.org/0000-0001-8262-5749

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.