- 1Department of Orthopaedic Surgery, People’s Hospital of Xinjiang Uygur Autonomous Region, Urumqi, Xinjiang, China
- 2Center for Molecular Imaging and Translational Medicine, Xiamen University, Xiamen, China
- 3Key Laboratory of Molecular Medicine, The Second Affiliated Hospital of Nanchang University, Nanchang, China
- 4Department of Orthopaedic Surgery, the First Affiliated Hospital of Nanchang University, Nanchang, China
- 5School of Physics and Technology, Wuhan University, Wuhan, China
- 6AMITA Health Saint Joseph Hospital Chicago, Chicago, IL, United States
- 7College of Osteopathic Medicine, Kansas City University, Kansas, MO, United States
- 8Foundation Program, Newham University Hospital, London, United Kingdom
- 9Radiation Oncology, Mayo Clinic, Rochester, MN, United States
- 10Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
- 11Department of Orthopaedics, The Second Hospital of Jilin University, Changchun, China
- 12Department of Spine Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
- 13Department of Spine Surgery, Liuzhou People’s Hospital, Liuzhou, China
- 14Graduate School, Xinjiang Medical University, Urumqi, Xinjiang, China
- 15Department of Neuro Rehabilitation, Shaanxi Provincial Rehabilitation Hospital, Xi 'an, China
- 16Faculty of Medicine, Macau University of Science and Technology, Macau, Macau SAR, China
Osteosarcoma was the most frequent type of malignant primary bone tumor with a poor survival rate mainly occurring in children and adolescents. For precision treatment, an accurate individualized prognosis for Osteosarcoma patients is highly desired. In recent years, many machine learning-based approaches have been used to predict distant metastasis and overall survival based on available individual information. In this study, we compared the performance of the deep belief networks (DBN) algorithm with six other machine learning algorithms, including Random Forest, XGBoost, Decision Tree, Gradient Boosting Machine, Logistic Regression, and Naive Bayes Classifier, to predict lung metastasis for Osteosarcoma patients. Therefore the DBN-based lung metastasis prediction model was integrated as a parameter into the Cox proportional hazards model to predict the overall survival of Osteosarcoma patients. The accuracy, precision, recall, and F1 score of the DBN algorithm were 0.917/0.888, 0.896/0.643, 0.956/0.900, and 0.925/0.750 in the training/validation sets, respectively, which were better than the other six machine-learning algorithms. For the performance of the DBN survival Cox model, the areas under the curve (AUCs) for the 1-, 3- and 5-year survival in the training set were 0.851, 0.806 and 0.793, respectively, indicating good discrimination, and the calibration curves showed good agreement between the prediction and actual observations. The DBN survival Cox model also demonstrated promising performance in the validation set. In addition, a nomogram integrating the DBN output was designed as a tool to aid clinical decision-making.
Introduction
Osteosarcoma is a malignant bone tumor with a low incidence but a high mortality rate, mainly occurring in children and adolescents (1, 2). Osteosarcoma frequently arises in the extremities, but it is also found in the axial skeleton and can be diagnosed at any age. At present, the combined treatment for Osteosarcoma includes tumor resection, radiotherapy, and chemotherapy. In addition, preoperative neoadjuvant chemotherapy and preoperative radiotherapy are increasingly important (3–5). Despite these advances, the prognosis of patients with Osteosarcoma is still poor due to the aggressive and metastatic behavior of the tumor (6, 7). Previous studies have shown that the major predictors of prognosis include age, gender, tumor dimension, response to chemotherapy, involvement of the proximal extremity or within the axial skeleton, and the presence of metastasis at diagnosis (8, 9). Osteosarcoma can metastasize to distant sites, mainly to the lungs, and occasionally to bone or lymph nodes (10, 11). Metastases can be detected in about 15%-20% of newly diagnosed Osteosarcoma patients, and the 5-year survival rate for these patients with metastases is only 20% (12–14). Therefore, early identification of patients at high risk of metastasis and timely assessment of overall survival is particularly important for reducing mortality and improving patients’ quality of life.
In most situations, researchers conducted survival analyses often used Cox proportional hazards model (15). The model can analyze the influence of multiple factors on survival time simultaneously without estimating the distribution type of survival data, applying survival outcome and survival time as the dependent variable. The model assumes that the risk of a clinical outcome is a linear combination of the patient’s covariates. However, this method may be too crude for many intricate clinical outcomes such as tumor metastasis.
Machine learning (ML) is a significant subfield of Artificial intelligence (AI) to build decision-making models, it concentrates on making predictions by learning from available data. Deep learning (DL) is a sub-field of ML that concentrates on making predictions using multi-layered neural network algorithms. Compared to other ML methods such as logistic regression, the neural network architecture of DL enables the models to scale exponentially with the growing quantity and dimensionality of data (16). The deep belief network (DBN) model is a DL algorithm that stacks simpler models known as restricted Boltzmann machines (RBMs) (17). The unsupervised learning builds a multi-level structure layer-by-layer, automatically extracting more abstract representations from the layers. It makes DBN particularly useful for solving complex computational problems such as large-scale image classification, natural language processing and speech recognition and translation (17).
Recently, numerous ML-based studies have been carried out for cancer prediction, prognosis, or even assessing treatment response (18–20). They predict survival time in years (particularly for the pre-operative prognosis of tumor patients) by using regression or categorizing it into long-term or short-term based on phenotypic features extracted from various types of pre-operative clinical characteristics or image data, i.e., blood tests, computed tomography (CT), magnetic resonance imaging (MRI) data before operation. The ML algorithms used in these studies include Decision Trees (DT), Gradient Boosting Machine (GBM), Random Forest (RF), Naive Bayes Classifier (NBC) and DL. However, few studies have compared the diagnostic performance of various ML algorithms with DL algorithms in assessing lung metastasis in OS patients.
In this study, the performance of DBN and six machine learning algorithms in predicting lung metastasis for Osteosarcoma patients was compared to find the optimal algorithm. The optimal DBN algorithm was subsequently used to construct a pulmonary metastasis predictive model for pulmonary metastasis in Osteosarcoma patients, the predictive model was integrated with other important variables into the Cox model to predict overall survival in Osteosarcoma patients. The study demonstrated that the DBN survival Cox model had good discrimination and calibration, which would provide great help for clinical decision-making.
Materials and methods
Applications of deep learning in medical field
DL has developed into an innovative field in the exploration part of ML and data mining (21, 22), which can break through the limitations of human eyes and reveal hidden information (23–25). CNN is a representative deep neural network (DNN) model, which is considered one of the most commonly used DL methods (26), it eliminates the need for tedious steps such as manual feature extraction of medical images and significantly improves the ability to classify images and detect objects in images. CNNs have been gradually used by the medical community to assist in the early diagnosis of clinical diseases and to predict clinical outcomes of disease progression since the introduction of AlexNet in 2012 (27–30). Research shows that deep CNNs can achieve state-of-the-art performance in tumor detection and diagnosis compared to other machine learning methods and human experts (31–33). JY et al. developed and validated a deep learning signal (DLS) from diffusion tensor imaging (DTI) using a deep CNN model, identified key pathways for DLS in a radiogenomics cohort (n=78) from paired DTI and RNA-seq data, and could improve stratification of gliomas by identifying risk groups that affect survival outcomes for dysregulated biological pathways. Gun Woo Lee et al. created a CNN model to diagnose spinal cord cervical spondylosis (CSM) by receiving input from multiple channels of two-dimensional data and performing iterative transformations using convolution and pooling operations to identify features of lateral cervical spine radiographs of patients with or without spinal cord cervical spondylosis (CSM) with high diagnostic accuracy (34). This also provides new ideas and references for the treatment and diagnosis of clinically relevant diseases (35–37).
Dataset and preprocessing
The training set for this study was derived from the Surveillance, Epidemiology and End Results (SEER) database using the SEER*stat software (version 8.6.3) from 2010 to 2016. Inclusion criteria included: (1) patients with histologically proven osteosarcoma (ICD-0-3 8936/3); (2) the primary tumor had to be localized in the limb bone; and (3) osteosarcoma was the first primary tumor. Patients with missing survival data or data on tumor size, metastasis, stage, or surgical modality were excluded. Collected variables included age, sex, race, grade, primary sites, laterality, bone metastases, T, N stage, lymph node surgery, surgery, radiation, chemotherapy survival status, and survival time. The validation set was collected from the inpatient Electronic Medical Record database at the Second Hospital of Jilin University, the Second Hospital of Dalian Medical University, Xianyang Central Hospital, and Liuzhou People’s Hospital in China. The variables to be collected were the same as before. Time-to-event or censoring was based on the date of diagnosis or the date of the last contact. The Osteosarcoma’s tumor node metastasis (TNM) stage was evaluated based on the 7th edition of the American Joint Committee on Cancer (AJCC) staging manual. “SEER Combined Mets at DX-lung (2010 +)” was used to identify the presence of lung metastasis in a newly diagnosed. Osteosarcoma patient. The Ethics Review Board of the Xianyang Central Hospital approved this study(Ethics Committee number: 20210022).
Variables screening for overall survival in OS patients
Cox proportional hazards with the least absolute shrinkage and selection operator (LASSO) penalty were employed to identify clinical variables that were linked with the overall survival of Osteosarcoma patients. LASSO penalized estimation methods shrank the estimates of the regression coefficients towards zero relatives to the maximum likelihood estimates. The shrinkage was to prevent overfitting due to either collinearity of the covariates or high dimensionality.
Deep learning algorithm versus machine learning algorithms
To compare the DBN algorithm with other ML algorithms for pulmonary metastasis in OS patients, several supervised classification methods were evaluated to determine better classification accuracy. The evaluated conventional classifiers include DT, GBM, logistic regression (LR), NBC, RF, and XGBoost (XGB). For parameters identification of six ML algorithms, univariate and multivariate logistics regression analyses were conducted. The preprocessed labelled dataset was used to train and test the model of different classifiers using 10-fold cross-validation as the experimental setting. The 10-fold cross-validation is a method to validate the studied/built model by iterating through the labelled data 10 times with different subsets of training and testing for each iteration.
DBN comprises multilayer random variables and binary la-tent variables and is a probabilistic deep learning algorithm. The model training performed is arranged in two main steps. Step 1: Train each layer of the RBM Network separately in an unsupervised manner and ensure that the maximum feature information is retained when the feature vector is mapped to a different feature space. Step 2: The BP Network is set as the final layer of the DBN, and the output feature vectors of the RBM are used as its input feature vectors to train the entity relationship classifier in a supervised manner.
In this study, the number of attribute features was obtained from the non-negative matrix decomposition as K = 14. After training the DBN, the last layer is the output features. The dimensionality of the feature vector is the number of nodes in the last layer. The number of nodes is determined by parameter sensitivity experiments based on the characteristics of our data. We finally chose 4 as the number of nodes.
In lung metastasis prediction, the attribute vector V of each case sample was used after the dimensionality reduction process as the input of DBN. In this training phase, the input vector V of the visible layer was passed to the hidden layer. Conversely, the input V of the visible layer was randomly selected to reconstruct the original input data. Finally, these new visible neuron activation units forwarded the reconstruction of the hidden layer activation units to obtain h1 and h2. Gibbs sampling was used to repeat the above process during the training. The correlation differences between the hidden layer activation units and the input visible layer were used as the basis for updating the weights W1 and W2.
The above seven algorithms were evaluated by the following indexes: accuracy, precision, recall, and F-measure (F1-Score). In addition, the feature importance of the optimal ML algorithm was calculated and compared with the DBN algorithm.
Model development and visualization
The previous screening variables selected by the LASSO Cox proportional hazards model were combined with output of the pulmonary metastasis predictive model to construct an integrated Cox model for predicting overall survival in Osteosarcoma patients. A web calculator and a clinical dynamic nomogram for those screening variables were planned to calculate and visualize the relationship between the variables and predicted probabilities of overall survival.
Model validation
The performance of the DBN survival Cox model was internally validated using the training set and externally authorized using the validation set. Discrimination for the model was evaluated by the area under the curve (AUC) of the receiver operating characteristic (ROC) curve. AUC values can range from 0 to 1, with values of 0.5-0.7 demonstrating poor discrimination, 0.7-0.8 acceptable, and >0.8 excellent discrimination (38). The calibration curve was used to assess the agreement between model-predicted and actual overall survival. The closer the calibration curve is to the 45-degree diagonal, the more perfect the model will be (39).
Assessment of clinical utility
Decision curve analysis (DCA) was used to explore the net benefit of the prognostic model over the entire range of probability thresholds (40, 41). DCA calculates the clinical net benefit acquired by applying the studied DBN survival Cox model to make clinical decisions. In the plot of DCA, the X-axis indicates thresholds probability for survival, and the Y-axis demonstrates net benefits depending on different thresholds probability. The pink horizontal line parallel to the X-axis represents no clinical action taken by any patient (“treat none”) and their net clinical benefit is 0. The smooth arc line represents that all patients took clinical action (“treat all”), and their net clinical benefit is a backslash with a negative slope. The curve of “treat by DBN survival Cox model” is then compared to “treat all” and to “treat none”.
Statistical analysis
The baseline categorical variables were represented as their counts and percentage, and continuous variables were expressed as means with standard deviation (SD) and medians [interquartile ranges (IQR)]. The COX proportional hazards with the LASSO penalty were used to identify predictor variables for survival. The lambda (λ) parameters in LASSO regression analysis were chosen for minimized expected model deviance. To compare deep learning and six machine learning algorithms, univariate and multivariate logistic regression analysis were conducted for parameter selection. After a univariate logistics regression analysis of all collected variables, those with a P-value < 0.05 were included in multivariable logistics regression. Variables with P-value < 0.05 in multivariable analysis were used to build the model. Accuracy, precision, recall, and F-measure were used to assess the performance of algorithms. The discrimination and calibration of the model in training and validation sets were evaluated by using the AUC and calibration curve. Clinical applicability was analyzed using DCA. P values < 0.05 indicated statistical significance. The statistical analyses were performed using RStudio software version 1.1.414 (Boston, MA, USA).
Results
Data characteristics
In total, 1,094 eligible Osteosarcoma patients from the SEER database were identified and designated as the training set. The validation set included 107 patients from the Second Hospital of Jilin University, the Second Hospital of Dalian Medical University, Xianyang Central Hospital and Liuzhou People’s Hospital. An overview of the total datasets combining the training set and validation set is shown in Figure 1A.
Figure 1 Overview of the total datasets combining the training set and validation set and LASSO model profile plots. (A), Heatmap of each clinical factor in the total datasets. (B), Coefficient profile plots showing how the size of the coefficients of clinical factors shrinks with increasing value of the penalty, with the factors and their regression coefficients selected for the model based on the optimal for the LASSO model. (C), Penalty plot for the LASSO model; color error bars indicate standard error. LASSO, least absolute shrinkage and selection operator.
Predictor variables of overall survival
Among 14 candidate survival-associated variables, 8 variables with statistically significant hazard ratios were selected based on LASSO Cox regression analysis (Figures 1B, C). These were sex, age, primary site, grade, T stage, surgery, bone metastases, and lung metastases.
Compare with other machine learning classifiers
Uni- and multivariate logistic regression analysis identified that sex, bone metastasis, surgery, N and T were significant factors for predicting pulmonary metastasis (Table 1). For training set, the results of DBN algorithm and six ML algorithms (DT, GBM, LR, NBC, RF and XGB) are shown in Figure 2. The accuracy, precision, recall, and F-Score of DBN were 0.917 ± 0.017, 0.896 ± 0.022, 0.956 ± 0.018 and 0.925 ± 0.018, respectively. Among the six ML algorithms, the accuracy, precision, recall and F- Score of the best algorithm (XGB) were 0.712 ± 0.014, 0.689 ± 0.026, 0.754 ± 0.044 and 0.724 ± 0.017, respectively. For the validation set, the comparison result is shown in Figure 3. The accuracy, precision, recall, and F-Score of DBN and XGB were 0.888/0.665, 0.642/0.326, 0.900/0.750 and 0.750/0.455, respectively. Figure 4 shows the ranking of the top 12 feature importance according to the DBN and the XGB algorithm. As shown, ‘N stage’ and ‘T stage’ was the most two important predictors of lung metastasis in the DBN algorithm. ‘Surgery’ was the most important predictor, and ‘T stage’ was the second most important predictor of lung metastasis in the XGB algorithm.
Figure 2 Comparison of DBN (A) algorithm and other 6 machine learning algorithms including DT (B), GBM (C), LR (D), RF (E), NBC (F) and XGB (G) for accuracy, precision, recall, and F-measure (F1-Score) in training set. DBN, Deep Belief Networks; DT, Decision Tree; GBM, Gradient Boosting Machine; LR, Logistic Regression; RF, Random Forest; NBC, Naive Bayes Classifier; XGB, XGBoost.
Figure 3 Comparison of DBN algorithm and other 6 machine learning algorithms including DT, GBM, LR, RF, NBC and XGB for accuracy, precision, recall, and F-measure (F1-Score) in validation set. DBN, Deep Belief Networks; DT, Decision Tree; GBM, Gradient Boosting Machine; LR, Logistic Regression; RF, Random Forest; NBC, Naive Bayes Classifier; XGB, XGBoost.
Figure 4 Nomogram for predicting overall survival for 1-, 3-, and 5-years in OS patients. Each variable value for the individuals was determined according to the top Points scale, and then the points for each variable were added. Finally, a personalized survival probability was obtained according to the bottom Total Points scale.
Model development
The variables of sex, age, primary site, grade, T, surgery, and bone metastases were combined with the output of DBN-based lung metastasis predictive model to construct an integrated DBN survival Cox model for predicting overall survival in OS patients. To make the DBN survival Cox model more intuitional, a nomogram (Figure 5) was then constructed. An online web calculator embedding a dynamic nomogram with our DBN survival Cox model was also developed, which is available at https://drwenleli0910.shinyapps.io/ODSapp/. After filling in the online form as required in the webpage, the webpage will automatically generate a personalized nomogram, together with the probability of survival at 1-, 3-, and 5-years.
Figure 5 Feature importance. The top 12 feature importance results for the DBN (A) model and XGB (B) model trained on training set. DBN, Deep Belief Networks; XGB, XGBoost.
Performance of the prediction model
The AUC of the DBN survival Cox model at 1-, 3-, 5-years in the training set were 0.851, 0.806, and 0.793, respectively (Figures 6A–C). Calibration curves showed that the predicted and actual survival rates matched very well at 3 and 5 years, and were acceptable at 1-year (Figures 6D–F). In the validation set, the AUC of DBN survival Cox model at 1-, 3-, 5-years were 0.876, 0.827, and 0.814, respectively (Figures 7A–C). The predicted and actual survival rates matched very well at 1-year, and were accepted at 3- and 5-years (Figures 7D–F). The risk curve, survival status and time, and expression of features in the DBN survival Cox model were also represented according to the high- and low-risk groups in the training and validation set (Figure 8).
Figure 6 The AUC and calibration curves of DL survival Cox model at 1-, 3-, 5-years in the training set. (A–C), the AUC of the model at 1-, 3-, 5-years. (D–F), the calibration curves of model at 1-, 3-, 5-years. AUC, area under the curve; DL, deep learning.
Figure 7 The AUC and calibration curves of DL survival Cox model at 1-, 3-, 5-years in the validation set. (A–C), the AUC of model at 1-, 3-, 5-years. (D–F), the calibration curves of model at 1-, 3-, 5-years. AUC, area under the curve; DL, deep learning.
Figure 8 Risk score analysis of deep learning survival Cox model in training (A) and validation (B) set.
Assessment of clinical utility
The decision curves for 1-, 3- and 5-years in the training set and validation set (Figures 9A, B) demonstrated relatively good performance for the DBN survival Cox model in terms of clinical application. As shown, when the threshold probability in the clinical decision was more than 0.063, 0.151, and 0.193 for 1 year, 3 years, and 5 years, respectively, the DBN survival Cox model provided more of a net benefit than “ treat all “ or “ treat none”.
Figure 9 Decision curves for the DL survival Cox model for 1-, 3-, 5-years in the training set (A) and validation set (B). The X-axis indicates thresholds probability for lung metastasis risk, and the Y-axis indicates net benefits depending on different thresholds probability. The pink horizontal line parallel to the X-axis represents no clinical action taken by any patient (“treat none”) and their net clinical benefit is 0. The smooth arc line represents that all patients took clinical action (“treat all”), and their net clinical benefit is a backslash with a negative slope. The curve of “treat by DL survival Cox model” is then compared to “treat all” and to “treat none”.
Discussion
Although the effectiveness of the localized Osteosarcoma treatment has gradually improved, the 5-year overall survival of Osteosarcoma patients with lung metastasis is less than 30%, suggesting that these patients still fare poorly (12–14). Several studies have explored potential risk factors for tumor metastasis to facilitate early management (8, 9). However, these articles mainly concentrated on the impact of a single factor on metastasis, a study on the combined impact of multiple factors on metastasis is still lacking.
Cox proportional hazards model is the traditional statistical approach for survival analysis and risk prediction. The data in this model are required to meet prior assumptions, including the proportional hazard assumptions and linear relationships between continuous covariates and the log hazard function.
However, in the complex biomedical field, there are many nonlinear relationships (42, 43). In this context, a single Cox model with proportional hazards is likely to produce less accurate estimations of survival outcomes. Therefore, new solutions containing these potentially nonlinear variables are highly needed to accurately predict the survival of individuals,
DBN, a relatively new computational algorithm that has become a popular research topic, has been rapidly developed and widely used in medical research. It is a DL-based generative model, when a network contains a large number of deep layers, which addresses the problem of vanishing gradients that suffer from traditional gradient-based learning algorithms. In cancer research, the advantages of DBN model for survival analysis are as follows: First, this model shows a modified fit for features with a nonlinear relationship, which applies to the nonlinear associations that are abundant in real-world practice. Second, as a DL model, the DBN automatically learns complex mapping by transforming the features through the multi-layer structure. At present, most ML algorithms have applied shallow-structured architectures.
These models are specifically effective in solving well–constrained problems. However, several studies have confirmed reasons for using deep structures (44–46). Deep models may be more robust in the wide variety of functions that can be parameterized by composing non-linear transformations (47). They also allow more efficient representation of highly varying functions than shallow architectures. In addition, a prominent problem with traditional algorithms is the requirement for a certain level of domain expertise to design a feature extractor that converts raw data into a suitable feature vector (16). DBN allows a system to be fed with raw data and to automatically discover the required representations. This study shows that the DBN-based model can achieve better prediction performance than other machine learning algorithms.
Our predictive models currently use seven clinical variables, including common demographic and clinical characteristics and histopathological outcomes. The SEER database was used with relatively complete data during the model development, while some data were missing in the external validation set. Our model has tolerance for missing data, and we still achieved high performance on the external validation set even missing 30% data of the patients. In addition, a dynamic nomogram can provide dynamic assessments as missing data are supplemented during subsequent treatment.
There were aols some limitations despite the promising results. First, the training set was extracted from the SEER database with a relatively small sample size, although an independent validation group from various hospitals was used. Thus, a large sample of multicenter data was required to fully evaluate the generalization ability of the DBN survival COX model. Second, the SEER database only provided limited clinical variables, and many variables closely associated with tumor metastasis and survival, such as tumor markers and gene expression, were not available. Future studies could incorporate these potentially essential factors and construct a more comprehensive predictive model.
Conclusion
In our study, a DBN survival Cox model was established to predict the overall survival in Osteosarcoma patients. Compared to the other six ML algorithms, this DNB algorithm demonstrated better performance. Both internal validation and external validation revealed good generalizability. In addition, an individualized risk estimate of survival can be calculated through the nomogram and online web calculators developed by this study. This allows the model to be applied to clinical practice and helps with clinical decision-making.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.
Author contributions
CLY, WZ and XWF completed the entire research design. WLL, YZD and WCL participated in the research and analyzed data. WLL and YZD drafted manuscripts. CYS and SL provided expert consultation and advice. SYC, RB, QZ, CX, WYL and BW participated in its design and coordination, HSW, STD and ZHH helped polish the language. All authors reviewed the final version of the manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA: A Cancer J Clin (2020) 70(1):7–30. doi: 10.3322/caac.21590
2. Misaghi A, Goldin A, Awad M, Kulidjian AA. Osteosarcoma: a comprehensive review. SICOT-J (2018) 4(1):12. doi: 10.1051/sicotj/2017028
3. Amanda, Dancsok AR, Asleh-Aburaya K, Nielsen TO. Advances in sarcoma diagnostics and treatment. Oncotarget (2017) 8(4):7068–93. doi: 10.18632/oncotarget.12548.
4. El Beaino M, Araujo DM, Lazar AJ, Lin PP. Synovial sarcoma: Advances in diagnosis and treatment identification of new biologic targets to improve multimodal therapy. Ann Surg Oncol (2017) 24(8):1–10. doi: 10.1245/s10434-017-5855-x
5. Gronchi A, Maki RG, Jones RL. Treatment of soft tissue sarcoma: a focus on earlier stages. Future Oncol (2017) 13(1s):13–21. doi: 10.2217/fon-2016-0499
6. Rainusso N, Wang LL, Yustein JT. The adolescent and young adult with cancer: State of the art - bone tumors. Curr Oncol Rep (2013) 15(4):296–307. doi: 10.1007/s11912-013-0321-9.
7. Kager L, Tamamyan G, Bielack S. Novel insights and therapeutic interventions for pediatric osteosarcoma. Future Oncol (2017) 13(4):357–68. doi: 10.2217/fon-2016-0261.
8. Jo VY, Fletcher CD. WHO classification of soft tissue tumours: an update based on the 2013. (4th) edition. Pathology (2014) 46(2):95–104. doi: 10.1097/PAT.0000000000000050.
9. Bielack SS, Kempf-Bielack B, Delling G, Exner GU, Flege S, Helmke K, et al. Prognostic factors in high-grade osteosarcoma of the extremities or trunk: an analysis of 1,702 patients treated on neoadjuvant cooperative osteosarcoma study group protocols. J Clin Oncol (2002) 20(3):776–90. doi: 10.1200/JCO.2002.20.3.776.
10. Kager L, Zoubek A, Pötschger U, Kastner U, Flege S, Kempf-Bielack B, et al. Primary metastatic osteosarcoma: presentation and outcome of patients treated on neoadjuvant cooperative osteosarcoma study group protocols. J Clin Oncol (2003) 21(10):2011–8. doi: 10.1200/JCO.2003.08.132
11. Isakoff MS, Bielack SS, Meltzer P, Gorlick R. Osteosarcoma: Current treatment and a collaborative pathway to success. J Clin Oncol Off J Am Soc Clin Oncol (2015) 33(27):3029–35. doi: 10.1200/JCO.2014.59.4895.
12. Harting MT, Blakely ML. Management of osteosarcoma pulmonary metastases. Semin Pediatr Surg (2006) 15(1):25–9. doi: 10.1053/j.sempedsurg.2005.11.005
13. Bacci G, Rocca M, Salone M, Balladelli A, Ferrari S, Palmerini E. High grade osteosarcoma of the extremities with lung metastases at presentation: Treatment with neoadjuvant chemotherapy and simultaneous resection of primary and metastatic lesions. J Surg Oncol (2008). 98(6):415–20. doi: 10.1002/jso.21140
14. Geller DS, Gorlick R. Osteosarcoma: a review of diagnosis, management, and treatment strategies. Clin Adv Hematol Oncol (2010) 8(10):705–18.
15. Prentice RL, Zhao S. Regression Models and Multivariate Life Tables. J Am Stat Assoc (2021) 116(535):1330–45. doi: 10.1080/01621459.2020.1713792.
16. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature (2015) 521(7553):436–44. doi: 10.1038/nature14539.
17. Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput (2006) 18(7):1527–54. doi: 10.1162/neco.2006.18.7.1527
18. Brendlin AS, Peisen F, Almansour H, Afat S, Eigentler T, Amaral T, et al. A machine learning model trained on dual-energy CT radiomics significantly improves immunotherapy response prediction for patients with stage IV melanoma. J Immunother Cancer (2021) 9(11) :e003261. doi: 10.1136/jitc-2021-003261.
19. Kawakami E, Tabata J, Yanaihara N, Ishikawa T, Koseki K, Iida Y, et al. Application of artificial intelligence for preoperative diagnostic and prognostic prediction in epithelial ovarian cancer based on blood biomarkers. Clin Cancer Res Off J Am Assoc Cancer Res (2019) 25(10):3006–15. doi: 10.1158/1078-0432.CCR-18-3378.
20. Tahmassebi A, Wengert GJ, Helbich TH, Bago-Horvath Z, Alaei S, Bartsch R, et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Invest Radiol (2019) 54(2):110–7. doi: 10.1097/RLI.0000000000000518.
21. Cerentini A, Welfer D, Cordeiro d'Ornellas M, Pereira Haygert CJ, Dotto GN. Automatic identification of glaucoma using deep learning methods. Stud Health Technol Inf (2017) 245:318–21.
22. LeCun Y, Bengio Y, Hinton G. Deep learning Nature. (2015) 521(7553):436–44. doi: 10.1038/nature14539.
23. Lei T, Qian H, Lei P, Hu Y.Ferroptosis-related gene signature associates with immunity and predicts prognosis accurately in patients with osteosarcoma. Cancer Sci (2021) 112(11):4785–98. doi: 10.1111/cas.15131.
24. Kocher M, Ruge MI, Galldiks N, Lohmann P Applications of radiomics and machine learning for radiotherapy of malignant brain tumors. Strahlenther Onkol (2020) 196(10):856–67. doi: 10.1007/s00066-020-01626-8
25. Kaur A, Sachdeva R, Singh A. Latest trends in deep learning for automatic speech recognition system, in: International Conference on Artificial Intelligence and Speech Technology, (Cham: Springer) (2021) 62–72. doi: 10.1007/978-3-030-95711-7_6
26. Fayaz Begum S, Prasanthi B. Investigation of level set segmentation procedures in brain MR images. ICCCE (2020) (Singapore: Springer) (2021) 431–8. doi: 10.1007/978-981-15-7961-5_43.
27. Corbin D, Lesage F. Assessment of the predictive potential of cognitive scores from retinal images and retinal fundus metadata via deep learning using the CLSA database. Sci Rep (2022) 12(1):5767. doi: 10.1038/s41598-022-09719-3
28. Pacheco A, Krohling R. An attention-based mechanism to combine images and metadata in deep learning models applied to skin cancer classification. IEEE J Biomed Health Inf (2021) 25(9):3554–63. doi: 10.1109/JBHI.2021.3062002.
29. Hassan R, Islam M F, Uddin M Z, Ghoshal G, Hassan M M, Huda S. Prostate cancer classification from ultrasound and MRI images using deep learning based explainable artificial intelligence. Future Generation Computer Systems (2022) 127:462–72. doi: 10.1016/j.future.2021.09.030.
30. Pan L, Ji B, Wang H, Wang L, Liu M, Chongcheawchamnan M, et al. MFDNN: multi-channel feature deep neural network algorithm to identify COVID19 chest X-ray images. Health Inf Sci Syst (2022) 10(1):4. doi:10.1007/s13755-022-00174-y.
31. Cik I, Rasamoelina A D, Mach M, Sinčák P. Explaining deep neural network using layer-wise relevance propagation and integrated gradients. In: 2021 IEEE 19th world symposium on applied machine intelligence and informatics (SAMI) (2021). doi: 10.1109/SAMI50585.2021.9378686
32. Xu Y, He X, Li Y, Pang P, Shu Z, Gong X. The nomogram of MRI-based radiomics with complementary visual features by machine learning improves stratification of glioblastoma patients: A multicenter study. J Magnetic Resonance Imaging (2021) 54(2):571–83. doi: 10.1002/jmri.27536.
33. Mao B, Ma J, Duan S, Xia Y, Tao Y, Zhang L. Preoperative classification of primary and metastatic liver cancer via machine learning-based ultrasound radiomics. Eur Radiol (2021) 31(7):4576–86. doi: 10.1007/s00330-020-07562-6
34. Lee GW, Shin H, Chang MC. Deep learning algorithm to evaluate cervical spondylotic myelopathy using lateral cervical spine radiograph. BMC Neurol (2022) 22(1):147. doi: 10.1186/s12883-022-02670-w.
35. Ma R, Vanstrum EB, Lee R, Chen J, Hung AJ. Machine learning in the optimization of robotics in the operative field. Curr Opin Urol (2020) 30(6):808–16. doi: 10.1097/MOU.0000000000000816
36. Augsburger M, Galatzer-Levy IR. Utilization of machine learning to test the impact of cognitive processing and emotion recognition on the development of PTSD following trauma exposure. BMC Psychiatry (2020) 20(1):325. doi: 10.1186/s12888-020-02728-4.
37. Fessler J, Gouy-Pailler C, Fischler M, Le Guen M. Machine learning in lung transplantation. J Heart Lung Transplant (2020) 39(4):S385. doi: 10.1016/j.healun.2020.01.497
38. Alba AC, Agoritsas T, Walsh M, Hanna S, Iorio A, Devereaux PJ. Discrimination and calibration of clinical prediction models: Users' guides to the medical literature. JAMA J Am Med Assoc (2017) 318(14):1377. doi: 10.1001/jama.2017.12126
39. Koller M, Witteman J, Steyerberg E. Prognostic models with competing risks: methods and application to coronary risk prediction. Epidemiology (2009) 20(4):555–61. doi: 10.1097/EDE.0b013e3181a39056
40. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: Seven steps for development and an ABCD for validation. Eur Heart J (2014) 35(29):1925–31. doi: 10.1093/eurheartj/ehu207.
41. Vickers AJ, Elkin EB. Decision curve analysis: A novel method for evaluating prediction models. Med Decision Making (2006) 26(6):565–74. doi:10.1177/0272989X06295361.
42. Chi SQ, Tian Y, Li J, Tong DY, Kong XX, Poston G. Time‐dependent and nonlinear effects of prognostic factors in nonmetastatic colorectal cancer. Cancer Med (2017) 6(8):1882–92. doi:10.1002/cam4.1116
43. Joensuu H, Vehtari A, Riihimäki J, Nishida T, Steigen SE, Brabec P. Risk of recurrence of gastrointestinal stromal tumour after surgery: an analysis of pooled population-based cohorts. Lancet Oncol (2012) 13(3):265–74. doi: 10.1016/S1470-2045(11)70299-6.
44. Bengio Y, Lecun Y. Scaling learning algorithms towards AI. In: Large-Scale kernel machines (2007) 34(5):1–41.
45. Bengio Y. Learning deep architectures for AI. Now Publishers Inc (2009) 2(1):1–129. doi: 10.1561/2200000006
46. Wu EQ, Qiu X Y, Tang Z, Zhang W M, Zhu L M, Ren H, et al. Detecting fatigue status of pilots based on deep learning network using EEG signals. IEEE Trans Cogn Dev Syst (2020) 13(3):575–85. doi: 10.1109/TCDS.2019.2963476
Keywords: Osteosarcoma, lung metastasis, deep belief networks, prognosis, clinical decision
Citation: Li W, Dong Y, Liu W, Tang Z, Sun C, Lowe S, Chen S, Bentley R, Zhou Q, Xu C, Li W, Wang B, Wang H, Dong S, Hu Z, Liu Q, Cai X, Feng X, Zhao W and Yin C (2022) A deep belief network-based clinical decision system for patients with osteosarcoma. Front. Immunol. 13:1003347. doi: 10.3389/fimmu.2022.1003347
Received: 26 July 2022; Accepted: 19 October 2022;
Published: 18 November 2022.
Edited by:
Jinming Han, Capital Medical University, ChinaReviewed by:
Qun Zhao, Fourth Hospital of Hebei Medical University, ChinaYuan Xiong, Huazhong University of Science and Technology, China
Copyright © 2022 Li, Dong, Liu, Tang, Sun, Lowe, Chen, Bentley, Zhou, Xu, Li, Wang, Wang, Dong, Hu, Liu, Cai, Feng, Zhao and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chengliang Yin, Y2hlbmdsaWFuZ3lpbkAxNjMuY29t; Wei Zhao, MzQzMDI2MDNAcXEuY29t; Xiaowei Feng, Znh3NjAwQHFxLmNvbQ==
†These authors have contributed equally to this work
‡ORCID ID: Chengliang Yin, orcid.org/0000-0001-8262-5749