Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study

Qu, Wenjie; Liu, Qingqing; Jiao, Xinlin; Zhang, Teng; Wang, Bingyu; Li, Ningfeng; Dong, Taotao; Cui, Baoxia

doi:10.3389/fonc.2020.623818

ORIGINAL RESEARCH article

Front. Oncol. , 18 February 2021

Sec. Women's Cancer

Volume 10 - 2020 | https://doi.org/10.3389/fonc.2020.623818

Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study

Wenjie Qu¹

Qingqing Liu¹

Xinlin Jiao²

Teng Zhang²

Bingyu Wang¹

Ningfeng Li¹

Taotao Dong^2*

Baoxia Cui^2*

¹Cheeloo College of Medicine, Shandong University, Jinan, China
²Department of Obstetrics and Gynecology, Qilu Hospital of Shandong University, Jinan, China

Background: The aim was to develop a personalized survival prediction deep learning model for adenosarcoma patients using the surveillance, epidemiology and end results (SEER) database.

Methods: A total of 797 uterine adenosarcoma patients were enrolled in this study. Duplicated and useless variables were excluded, and 15 variables were selected for further analyses, including age, grade, positive lymph nodes or not, marital status, race, tumor extension, stage, and surgery or not. We created our deep survival learning (DSL) model to manipulate the data, which was randomly split into a training set (n = 519, 65%), validation set (n = 143, 18%) and testing set (n = 143, 18%). The Cox proportional hazard (CPH) model was also included comparatively. Finally, personalized survival curves were plotted for randomly selected patients.

Results: The c-index for the CPH model was 0.726, and the Brier score was 0.17. For our deep survival learning model, we achieved a c-index of 0.774 and a Brier score of 0.14 in the external testing set. In addition, the limitations of the traditional staging system were revealed, and a personalized survival prediction system based on our risk scoring grouping was developed.

Conclusions: Our study developed a deep neural network model for adenosarcoma. The performance of this model was superior to that of the traditional Cox proportional hazard model. In addition, a personalized survival prediction system was developed based on our deep survival learning model, which provided more accurate prognostic information for adenosarcoma patients.

Introduction

Adenosarcoma is a rare tumor of the female genital tract, accounting for approximately 5% of uterine sarcoma. The National Comprehensive Cancer Network (NCCN) and International Federation of Gynecology and Obstetrics (FIGO) have published clinical practice guidelines and staging systems for adenosarcoma. Standard treatment is total hysterectomy with bilateral salpingo-oophorectomy, and neither radiotherapy nor chemotherapy has been proven beneficial (1). Patients with stage I disease often have a good prognosis with a 5-year survival of 60–80% (1). However, tumors demonstrate sarcomatous overgrowth (>25% of the total tumor volume consists of pure sarcoma) course with a higher rate of recurrence and death (2). Due to the low incidence and histological diversity of uterine adenosarcoma, only a few case reports and series provide data on prognostic factors and survival prediction (3). Researchers have developed prognostication studies with different methods, including univariate analysis, multivariate analysis, multivariable Cox regression and the Kaplan-Meier method (4, 5), among which the most commonly used is multivariable Cox regression analyses. However, whether these traditional methods accurately work remains debatable. Therefore, an accurate prognostication system is crying needed for treatment decisions and survival prediction.

As mentioned above, most researchers were restricted to the low incidence and rare cases of adenosarcoma. The surveillance, epidemiology and end results (SEER) database is a population-based data source covering approximately 34.65% of the U.S. population. Clinical data have been collected since 1973, including the stage of cancer, histopathological subtypes, treatment modality, and survival data (6). The database has been used in various studies to perform survival analyses of all malignant tumors, including adenosarcoma (7). However, most of these studies used the Cox proportional hazard (CPH) model, which cannot handle nonlinear correlations in survival analyses.

With the rapid development of artificial intelligence, a new choice is provided for adenosarcoma researchers. The deep learning method allows a machine to be fed with raw data and to automatically discover the representations needed for detection with the use of multiple neural layers in the network (8). It has the ability to analyze the nonlinear correlations that are more common in the real world. It has been proven to be greatly effective for various clinical tasks, including image identification (9, 10), pathological diagnoses (11–13), genomic analysis (14, 15), metabolomics (16), and immunology (17) studies. Combining the SEER database with the deep learning method is a good choice that has been proven to be valuable in many cancer prognosis studies, such as breast cancer and lung cancer (18, 19). However, studies taking advantage of the abundant cases in the SEER database and the high efficiency of the deep learning method are absent in the prognosis of adenosarcoma.

In this study, we aimed to develop a survival prediction deep learning model for adenosarcoma patients collected from the SEER database. With this model, better prediction was achieved. We also attempted to develop a new personalized survival prediction system based on the model we established.

Materials and Methods

Data Collection

The SEER Program is a comprehensive source of population-based information in the United States that includes the cancer stage at the time of diagnosis and patient survival data. It updates annually and is provided as a public service for researchers.

The SEER database had 133 usable variables. In this study, we used “CS Schema v2040+”, which was collected under the specifications of a particular schema based on site and histology, to select corpus adenosarcoma patients from 1973 to 2014. Only cases with one primary tumor were included in our study. We also excluded cases with incomplete follow-up data. Cases with a follow-up time equal to 0 which might indicate death in the hospital or other recording error were excluded too because of their great uncertainty. We kept corpus adenosarcoma patients of all stages, and the final sample size was 797.

Since the SEER dataset utilized publicly available desensitized data, this study did not need approval from the institutional review board (IRB) or informed consent from patients.

Data Preparation

Among 133 original variables, we excluded those duplicated variables using correlation matrix analyses. The selected variables for further analyses were age, year when patients were diagnosed, diameter of tumor, grade, Hispanic status, number of excised lymph nodes, positive lymph nodes or not, number of positive lymph nodes, metastasis, marital status, race, extension of tumor, stage, surgery or not, and surgery type.

Stages were defined from the farthest extension of the tumor and lymph nodes involved as category variables. The SEER catalog is named the Extent of Disease (EOD), which is used for cases diagnosed before 2004, and Collaborative Stage (CS), which is used for data after 2004. We also redefined marital status as single, married, divorced and windowed. In the SEER database, several methods were introduced to define race. In this study, we classified race into white, black, and Asian, among which Hispanic was singled out as two classified variables. In addition, two classified variables also included positive lymph nodes or not and surgery or not. Grade was defined as a category variable indicating undifferentiation and low, moderate or high differentiation. Moreover, surgery type was defined as a category variable indicating local excision, hysterectomy and bilateral adnexectomy plus or not plus lymphadenectomy. The number of excised lymph nodes, number of positive lymph nodes, diameter of tumor, and age were defined as continuous variables. Extension of the tumor, which indicated localization, parametrium or distance, was defined as a category variable.

Deep Survival Neural Network

The original multitask logistic regression model was first developed by Chun-Nam Yu in “Deep neural networks for survival analysis based on multitask framework” (20). Then, S. Fotso updated this model to a kind of neural multitask logistic regression model (N-MTLR) in “Learning patient-specific cancer survival distributions as a sequence of dependent regressors” (21). In this work, we used this kind of N-MTLR to develop our deep survival neural network for survival analyses.

In summary, this model first transformed patient follow-up time into a series of time vectors annotated with 0. If a patient had the event (=1), then the corresponding time point changed to 1. For a censored patient, all of the censored time vectors were annotated as 1.

Statistical Analyses and Evaluation of Models

Overall survival was defined as the final outcome of patients, which was measured by interval time between diagnoses and death or loss of follow-up. Both the CPH model and deep survival learning model were evaluated in this study. For the deep learning model, we used the independent testing set to evaluate the performance to prevent potential overfitting. We used the concordance index (c-index) and the integrated Brier scores (IBS) to evaluate the performances of different models. Differences between predicted and actual data were also recorded.

Kaplan-Meier curves and Cox regression analyses for patients staged with the traditional staging system were performed. The difference was considered significant if the P value was less than 0.05. Our model assigned precise weights to each variable after data training and multiple iterations. According to the final weights, a risk score was calculated by the DSL model. we developed new staging groups according to the risk score. Finally, personalized survival curves were also plotted for randomly selected patients from the testing set.

The deep learning model was developed on the PyTorch framework. Scikit-learn and pandas packages were also used for the treatment of data. We also used STATA software (version 13) for other statistical analyses.

Results

Patient Demographics and Characteristics

A total of 797 corpus adenosarcoma patients registered from 1973 to 2014 in the SEER database were enrolled in this study. A correlation matrix was plotted, and 15 variables correlated with survival were selected (Figure 1). The selected data were randomly and automatically split into a training set (n = 519, 65%), validation set (n = 143, 18%) and testing set (n = 143, 18%).

FIGURE 1

Figure 1 Correlation matrix of 15 selected features. Values in this figure indicated the correlation coefficient of two corresponding variables. Color indicated strength of correlation, in which dark blue indicated strong positive, and dark red indicated strong negative relationships. Diagnosis: year when patients were diagnosed. Diam: diameter of tumor. Lymex: number of excised lymph nodes. Lympo: number of positive lymph nodes. Lymph: positive lymph nodes or not. Spread: extension of tumor. Surg: surgery or not. Surgery: surgery type.

The patient demographic characteristics are shown in Table 1. A total of 594 cases were white (75.8%), 115 were black (14.7%), and 75 were Asian (9.5%). A total of 172 cases were single (22.9%), 388 were married (51.7%), 85 were divorced (11.3%), and 106 were widowed (14.1%). Eighty-eight cases were undifferentiated (24.7%), 52 were poorly differentiated (14.6%), 133 were moderately differentiated (37.2%), and 84 were highly differentiated (23.5%). A total of 588 patients had localized tumors (77.7%), 127 patients extended to the parametrium (16.8%), and 42 patients extended to a distance (5.5%). A total of 239 cases were stage I (88.1%), 11 were stage II (4.1%), 7 were stage III (2.6%), and 14 were stage IV (5.2%). Twelve patients underwent local excision surgery (11.2%), 56 underwent a hysterectomy and bilateral adnexectomy (52.3%), and 39 underwent a hysterectomy and bilateral adnexectomy plus lymphadenectomy (36.5%).

TABLE 1

Table 1 Patients demographic and clinicopathological characteristics.

Cox Proportional Hazard Model

The cox proportional hazard (CPH) model was first developed for multivariable analysis, dealing with both category and continuous variables. The concordance index (c-index) has been widely used in the survival analysis of several cancers (22, 23). Generally, when the c-index is close to 1, the model has almost perfect predicted ability, but when it is close to 0.5, the model has no power to discriminate a risk factor. In this study, the CPH model achieved a c-index of 0.726 in survival prediction.

The Brier score measures the accuracy of probabilistic predictions. Because it is a cost function, a lower score indicates more accurate predictions, while a higher score indicates less accurate predictions. In this study, a Integrated Brier Score (IBS) of 0.17 was achieved using the CPH model (Figure 2A).

FIGURE 2

Figure 2 Performance of cox proportional hazard (CPH) model. (A) CPH model has 0.17 of IBS (below 0.25 obviously). (B) CPH model make a median absolute error of 1.615 patients and mean absolute error of 2.223 patients during 12000 days of follow-up time in testing set. (C) Predicted and actual survival curves plotted by CPH model. It made a median absolute error of 13.726 and mean absolute error of 14.626. As we can see from the figure, some spots were plotted outside the confidence intervals.

In addition, median absolute error and mean absolute error measure the variability between prediction and reality. A median absolute error of 1.615 and a mean absolute error of 2.223 were achieved by the CPH model in our study (Figure 2B). However, in regard to survival curves, many areas of the predicted survival curves were plotted outside the confidence intervals of actual survival curves, and greater absolute errors were shown (Figure 2C). The ability to perform the CPH model may be limited by missing data for many patients.

Deep Survival Learning (DSL) Model Building

The CPH model performed well in linear relationships, while the survival problem contained nonlinear relationships in the real world. Therefore, a neural network was introduced for nonlinear relationships, while the classic deep learning method failed in handling time-to-event data. Here, we used a “multitask logistic regression model” to handle specific survival data and undertake censored data.

The structure of the final model is four layers, each of which has 50 neurons. The grid search method was used for hyperparameter selection. The selected optimal hyperparameters were as follows: initial method was glorot_uniform, the dropout rate was 0.3, l2 regularization was 1e-2, l2 smooth was 1e-2, the optimizer was Adam, and learning_rate was 1e-4. After 2,000 iterations, the loss value decreased from 1,300 to 762 (Figure 3). Finally, a c-index of 0.831 was achieved in the validation set.

FIGURE 3

Figure 3 Values of loss function for DSL model decrease from 1,300 to 762 after 2,000 time of iterations.

DSL Model in the Testing Set

Overfitting is a common prediction error in machine learning. It means a model doing much better on the training set than on the test set. In another word, a model has low generalization. To prevent potential overfitting of our model, we evaluated the performance of the DSL model using an independent testing set instead of a training set and validation set. Finally, our model reached a c-index of 0.774 and a IBS of 0.14 in the external testing set (Figure 4A). Moreover, 1.989 of the median absolute error and 2.621 of the mean absolute error (Figure 4B) were achieved in each time interval. This result suggested that our model could perform well in survival prediction.

FIGURE 4

Figure 4 Performance of Deep survival learning (DSL) model. (A) In the independent testing set, we achieved 0.14 of brier score using our DSL model. (B) DSL model made a median absolute error of 1.989 patients and mean absolute error of 2.621 patients during 12,000 day of follow-up time in testing set. (C) DSL model made a median absolute error of 3.851 and mean absolute error of 5.632 in survival curve prediction. Nearly all spots of predicted curve lied within confidence intervals of actual curve and the predicted curve was drew similarly to the actual one.

In addition, calibration survival curves were also drawn using the testing set. Calibration curves showed that nearly all areas of the predicted survival curves were plotted within confidence intervals of actual survival curves (Figure 4C), suggesting that the predicted survival result was amply credible and obviously better than the CPH model in this study.

Personalized Survival Prediction Using the DSL Model

Kaplan-Meier curves were plotted for patients from the conventional staging system (Figure 5A). The difference in survival between stage I patients from the other three stages was significant (P < 0.001); however, the difference between stages II, III and IV was inapparent. Mortality for stage II, III and IV patients increased 3.9-, 4.7-, and 5.5-fold, respectively, relative to the stage I patients in Cox regression analyses.

FIGURE 5

Figure 5 Survival curves for conventional staging system and personalized survival prediction established by DSL model. (A) K-M curve of conventional staging system showed significant difference between stage I from other three stages and inapparent difference between stage II, III and IV. (B) We divided adenosarcoma patients into three stages according to risk factors calculated by our DSL model. Patients with a score of 0-4 were classified in stage I and marked in red color, patients with a score of 4-5.5 in stage II and green color, patients with 5.5-8 score in stage III and blue color.

Risk factors for patients were computed by our DSL model. According to our model, the risk score ranged from 0 to 8. Patients were divided into three staging groups based on the number of patients in different risk levels (Figure 5B). Then, one patient was randomly selected from each group of our new risk-related staging system, and survival curves were painted for the three patients. Six times Repeated selections and validations were carried out (Figure 6). Notable differences were observed, indicating that survival results between patients of our three stages were more significantly different than those of the traditional four stages. In other words, our model may have potential implications in personalized treatment and prediction of adenosarcoma.

FIGURE 6

Figure 6 Personalized survival curve for three randomly selected patients showed apparently diverse results. After six times Repeated selections and validations (A–F), Patient with low risk always has the best survival result, contrasting with patient with high risk resulting in short survival time.

In addition, dividing patients into four or three stages only provided a general impression of survival prediction, since patients differed within stages. However, our model calculated the risk score of one certain patient and describe her personalized survival curve, suggesting that our model may have a latent capacity in the personalized treatment of adenosarcoma.

Discussion

Adenosarcoma is a rare malignancy that is often associated with irregular uterine bleeding and physical complaints. Many studies have been performed for its pathologic characteristics, treatment, and prognosis factors during past decades. Researchers aimed to determine clinical risk factors associated with decreased survival, which may guide the optimal management of this rare tumor (24). Based on previous studies, the most important prognostic factors of adenosarcoma are age, sarcomatous overgrowth, myometrial and lymphovascular invasion, and lymph node involvement; moreover, heterologous stromal components and extrauterine manifestations are also associated with poor prognosis (25). Currently, an increasing number of researchers focus on the survival prediction of adenosarcoma. Among these studies, multivariable logistic regression has been widely used to identify risk factors, and the Cox proportional hazard model has been the most widely applied to predict survival outcomes. However, these traditional methods have been proven to perform worse than the new artificial intelligence method. In this study, we established a deep survival learning model for adenosarcoma patients. To our knowledge, this is the first adenosarcoma prognostication study applying a deep learning method.

Due to the rarity of adenosarcoma, only a small case series of prognosis studies have been launched before (26, 27). Many studies have taken advantage of the SEER database due to its large population-based data (28, 29). In this study, a large number of patients provided by the SEER database were analyzed, which offered statistical strength to our conclusion.

Previous investigations have explored the ability of the CPH model to predict adenosarcoma survival outcomes (7). Since adenosarcoma is associated with multidimensional factors, the conventional model, the CPH model, for example, could not recognize complex nonlinear relationships between the variables. However, the potential for deep learning models in cancer prognostication research has also been revealed by several studies. Ole-Johan Skrede et al. (30) developed a clinically useful prognostic marker for colorectal cancer using a deep learning method and evaluated it in a large, independent patient population. Charlie Saillard et al. (31) developed two deep learning algorithms for hepatocellular carcinoma, and both models had a higher discriminatory power than a score combining all baseline variables associated with survival, confirming that the deep learning method can help refine the prediction of hepatocellular carcinoma prognosis. Since adenosarcoma is associated with multidimensional factors, the conventional model, the CPH model, for example, could not recognize complex nonlinear relationships between the variables. As we can see from our data, the CPH model performed poorly in the survival curve description, and many spots of the predicted curve were plotted outside the confidence intervals. In addition, missing data have a great impact on CPH model performance. In this study, the deep learning model performed the data imputation job and showed better performance. We contributed a better c-index of 0.774 and a Brier score of 0.14, taking advantage of the DSL model.

In addition, past works have never concentrated on the adenosarcoma staging system and prognosis-related subgroups. In our work, we found that the conventional staging system made limited contributions to predicting survival results. Several studies have shown that the majority of patients (73.4–82%) are diagnosed with stage I (32–34). However, these patients have different survival outcomes. In this study, we provided a personalized survival prediction curve for randomly selected patients, which showed a more significant difference than that of the traditional staging system. Profiting from the deep learning method, more possible risk factors were considered in the survival analysis. Brandon-Luke et al. (3) found that primary site, lymph node status, surgical procedure, chemotherapy use, race, insurance status and income quartiles were not significantly associated with overall survival of adenosarcoma using the CPH model. However, in this study, the aforementioned factors were included, and correlations within those variables were considered, which accounted for a better result.

The limitations of this study include the absence of detailed pathological information, such as sarcomatous overgrowth which has been proven to have a significant impact on survival time, as well as some molecular markers such as Ki-67 and p53 or bcl-2. Secondly, peritoneal/ascitic fluid cytology is another adverse risk factor that should be considered. But limited by SEER database, this information was unavailable. Third, only overall survival but not progression-free survival was included in our study. Progression-free survival time is also an important constitution for prognosis, indicating the period from the beginning of treatment to disease progression. However, we made a breakthrough in combining continuous overall time variables with classified death or not variables as outcome indicators, which provided more specific survival predictions.

Above all, only clinical data, including demographics and therapeutic information, were included in this study. Actually, our model could incorporate different types of data, including clinical, hematological, pathological, imaging and genetic information. Comprehensive and massive data could further improve the accuracy of survival prediction of our model. Beyond that, the therapeutic value of radiotherapy and chemotherapy is still to be proven, as well as hormone therapy. Our model could also provide evidence for the feasibility of various follow-up treatments and explore the effect of these options on prognosis with detailed treatment data. Therefore, further studies including a large series with comprehensive information, detailed survival data and multiple patient sources will be needed.

Our model gives survival time predictions that are much more accurate than the traditional survival analysis model. The personalized survival prediction system based on our DSL model showed good performance as well. The extension of our new system to an online program that can update with new measures can be expected.

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Ethics Statement

Written informed consent was obtained from the individual(s), and minor(s)’ legal guardian/next of kin, for the publication of any potentially identifiable images or data included in this article.

Author Contributions

WQ: Conceptualization, methodology, and writing (original draft). QL: Conceptualization and methodology. XJ: Visualization and writing (review and editing). TZ: Formal analysis and data curation. BW: Formal analysis and visualization. NL: Formal analysis and data curation. TD: Conceptualization, software, and supervision. BC: Project administration and funding acquisition. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by Clinical Research Center of Shandong University (No.2020SDUCRCA007) and National Key Research & Development Program of China (2016YFC1302900).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors would like to thank TD for excellent technical support and BC for valuable guidance.

References

1. Nathenson MJ, Ravi V, Fleming ND, Wang WL, Conley AP. Uterine Adenosarcoma: A Review. Curr Oncol Rep (2016) 18(11):68. doi: 10.1007/s11912-016-0552-7

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Nathenson MJ, Conley AP. Prognostic factors for uterine adenosarcoma: A review. Expert Rev Anticancer Ther (2018) 18(11):1093–100. doi: 10.1080/14737140.2018.1518136

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Seagle BLL, Kanis MJ, Strohl AE, Shahabi S. Survival of women with Mullerian adenosarcoma: A National Cancer Data Base study. Gynecol Oncol (2016) 143(3):636–41. doi: 10.1016/j.ygyno.2016.10.013

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Albert A, Lee A, Allbright RM, Vijayakumar S. Primary sarcoma of the cervix: an analysis of patient and tumor characteristics, treatment patterns, and outcomes. J Gynecol Oncol (2020) 31(3):e25. doi: 10.3802/jgo.2020.31.e25

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Li D, Yin N, Du G, Wang S, Xiao Z, Chen J, et al. A Real-World Study on Diagnosis and Treatment of Uterine Sarcoma in Western China. Int J Biol Sci (2020) 16(3):388–95. doi: 10.7150/ijbs.39773

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Ryu SM, Seo SW, Lee S. Novel prognostication of patients with spinal and pelvic chondrosarcoma using deep survival neural networks. BMC Med Inf Decis Making (2020) 20(1):3. doi: 10.1186/s12911-019-1008-4

CrossRef Full Text | Google Scholar

7. Hosh M, Antar S, Nazzal A, Warda M, Gibreel A, Refky B. Uterine Sarcoma: Analysis of 13,089 Cases Based on Surveillance, Epidemiology, and End Results Database. Int J Gynecol Cancer (2016) 26(6):1098–104. doi: 10.1097/IGC.0000000000000720

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Lecun Y, Bengio Y, Hinton GE. Deep learning. Nature (2015) 521(7553):436–44. doi: 10.1038/nature14539

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Bi WL, Hosny A, Schabath MB, Giger ML, Birkbak NJ, Mehrtash A, et al. Artificial intelligence in cancer imaging: Clinical challenges and applications. CA: A Cancer J Clin (2019) 69(2):127–57. doi: 10.3322/caac.21552

CrossRef Full Text | Google Scholar

10. Yin S, Peng Q, Li H, Zhang Z, You X, Fischer K, et al. Multi-instance Deep Learning of Ultrasound Imaging Data for Pattern Classification of Congenital Abnormalities of the Kidney and Urinary Tract in Children. Urology (2020) 142:183–9. doi: 10.1016/j.urology.2020.05.019

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Campanella G, Hanna MG, Geneslaw L, Miraflor AP, Silva VWK, Busam KJ, et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat Med (2019) 25(8):1301–9. doi: 10.1038/s41591-019-0508-1

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Coudray N, Ocampo PS, Sakellaropoulos T, Narula N, Snuderl M, Fenyo D, et al. Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat Med (2018) 24(10):1559–67. doi: 10.1038/s41591-018-0177-5

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Gao S, Qiu JX, Alawad M, Hinkle J, Schaefferkoetter N, Yoon H, et al. Classifying cancer pathology reports with hierarchical self-attention networks. Artif Intell Med (2019) 101:101726. doi: 10.1016/j.artmed.2019.101726

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Xiao Y, Wu J, Lin Z, Zhao X. A deep learning-based multi-model ensemble method for cancer prediction. Comput Methods Prog Biomed (2018) 153:1–9. doi: 10.1016/j.cmpb.2017.09.005

CrossRef Full Text | Google Scholar

15. Chaudhary K, Poirion O, Lu L, Garmire LX. Deep Learning–Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer. Clin Cancer Res (2017) 24(6):1248–59. doi: 10.1158/1078-0432.CCR-17-0853

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Alakwaa FM, Chaudhary K, Garmire LX. Deep Learning Accurately Predicts Estrogen Receptor Status in Breast Cancer Metabolomics Data. J Proteome Res (2018) 17(1):337–47. doi: 10.1021/acs.jproteome.7b00595

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Pan C, Schoppe O, Parradamas A, Cai R, Todorov MI, Gondi G, et al. Deep Learning Reveals Cancer Metastasis and Therapeutic Antibody Targeting in the Entire Body. Cell (2019) 179(7):1661. doi: 10.1101/541862

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Lynch CM, Abdollahi B, Fuqua JD, De Carlo AR, Bartholomai JA, Balgemann R, et al. Prediction of lung cancer patient survival via supervised machine learning classification techniques. Int J Med Inf (2017) 108:1–8. doi: 10.1016/j.ijmedinf.2017.09.013

CrossRef Full Text | Google Scholar

19. Shukla N, Hagenbuchner M, Win KT, Yang J. Breast cancer data analysis for survivability studies and prediction. Comput Methods Prog Biomed (2018) 155:199–208. doi: 10.1016/j.cmpb.2017.12.011

CrossRef Full Text | Google Scholar

20. Fotso S. Deep Neural Networks for Survival Analysis Based on a Multi-Task Framework. arXiv: Mach Learn (2018) arXiv:1801.05512.

Google Scholar

21. Lin H, Baracos VE, Greiner R, Yu C eds. Learning Patient-Specific Cancer Survival Distributions as a Sequence of Dependent Regressors. In: Neural information processing systems. Curran Associates Inc. (2011). p. 1845–53.

Google Scholar

22. Cao S, Li J, Yang K, Zhang J, Xu J, Feng C, et al. Development and validation of a novel prognostic model for long-term overall survival in liposarcoma patients: a population-based study. J Int Med Res (2020) 48(12):300060520975882. doi: 10.1177/0300060520975882 0300060520975882.

CrossRef Full Text | Google Scholar

23. Chen S, Huang H, Liu Y, Lai C, Peng S, Zhou L, et al. A multi-parametric prognostic model based on clinical features and serological markers predicts overall survival in non-small cell lung cancer patients with chronic hepatitis B viral infection. Cancer Cell Int (2020) 20(1):555. doi: 10.1186/s12935-020-01635-8

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Machida H, Nathenson MJ, Takiuchi T, Adams CL, Garcia-Sayre J, Matsuo K. Significance of lymph node metastasis on survival of women with uterine adenosarcoma. Gynecol Oncol (2017) 144(3):524–30. doi: 10.1016/j.ygyno.2017.01.012

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Ulrich UA, Denschlag D. Uterine Adenosarcoma. Oncol Res Treat (2018) 41(1):693–6. doi: 10.1159/000494067

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Zhang Y, Li Y, Huang H, Yang J, Wu M, Jin Y, et al. Low-Grade Endometrial Stromal Sarcoma and Uterine Adenosarcoma: A Comparison of Clinical Manifestations and Outcomes. J Cancer (2019) 10(15):3352–60. doi: 10.7150/jca.30691

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ebner F, Wiedenmann S, Bekes I, Wolfgang J, De Gregorio N, De Gregorio A. Results of an internal audit on the survival of patients with uterine sarcoma. J Turk Ger Gynecol Assoc (2018) 20(1):15–22. doi: 10.4274/jtgga.galenos.2018.2018.0083

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Zhang Y, Hong Y, Zhuang D, He X, Lin M. Bladder cancer survival nomogram: Development and validation of a prediction tool, using the SEER and TCGA databases. Medicine (Baltimore) (2019) 98(44):e17725. doi: 10.1097/MD.0000000000017725

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Yu C, Zhang Y. Development and validation of prognostic nomogram for young patients with gastric cancer. Ann Trans Med (2019) 7(22):641–. doi: 10.21037/atm.2019.10.77

CrossRef Full Text | Google Scholar

30. Skrede O, De Raedt S, Kleppe A, Hveem TS, Liestol K, Maddison J, et al. Deep learning for prediction of colorectal cancer outcome: a discovery and validation study. Lancet (2020) 395(10221):350–60. doi: 10.1016/S0140-6736(19)32998-8

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Saillard C, Schmauch B, Laifa O, Moarii M, Toldo S, Zaslavskiy M, et al. Predicting survival after hepatocellular carcinoma resection using deep-learning on histological slides. Hepatology (2020) 72(6):2000–13. doi: 10.1016/S0168-8278(20)31254-X

CrossRef Full Text | Google Scholar

32. Brooks SE, Zhan M, Cote TR, Baquet CR. Surveillance, Epidemiology, and End Results analysis of 2677 cases of uterine sarcoma 1989-1999. Gynecol Oncol (2004) 93(1):204–8. doi: 10.1016/j.ygyno.2003.12.029

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Carroll AR, Ramirez PT, Westin SN, Soliman PT, Munsell MF, Nick AM, et al. Uterine adenosarcoma: an analysis on management, outcomes, and risk factors for recurrence. Gynecol Oncol (2014) 135(3):455–61. doi: 10.1016/j.ygyno.2014.10.022

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Bernard B, Clarke BA, Malowany JI, Mcalpine JN, Lee C, Atenafu EG, et al. Uterine adenosarcomas: A dual-institution update on staging, prognosis and survival. Gynecol Oncol (2013) 131(3):634–9. doi: 10.1016/j.ygyno.2013.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: adenosarcoma, survival prediction, deep learning, artificial intelligence, database, personalized model

Citation: Qu W, Liu Q, Jiao X, Zhang T, Wang B, Li N, Dong T and Cui B (2021) Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study. Front. Oncol. 10:623818. doi: 10.3389/fonc.2020.623818

Received: 30 October 2020; Accepted: 30 December 2020;
Published: 18 February 2021.

Edited by:

Giuseppe Vizzielli, Catholic University of the Sacred Heart, Italy

Reviewed by:

Elena Teodorico, Agostino Gemelli University Polyclinic, Italy
Rita Trozzi, Agostino Gemelli University Polyclinic, Italy

Copyright © 2021 Qu, Liu, Jiao, Zhang, Wang, Li, Dong and Cui. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Baoxia Cui, Y3VpYmFveGlhQDE2My5jb20=; Taotao Dong, c3RldmVuZHR0QDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study

Introduction

Materials and Methods

Data Collection

Data Preparation

Deep Survival Neural Network

Statistical Analyses and Evaluation of Models

Results

Patient Demographics and Characteristics

Cox Proportional Hazard Model

Deep Survival Learning (DSL) Model Building

DSL Model in the Testing Set

Personalized Survival Prediction Using the DSL Model

Discussion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Acknowledgments

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good