Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 29 June 2023
Sec. Cancer Epidemiology and Prevention

An online tool for survival prediction of extrapulmonary small cell carcinoma with random forest

  • 1Cancer Center, West China Hospital, Sichuan University, Chengdu, China
  • 2State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Collaborative Innovation Center for Biotherapy, Sichuan University, Chengdu, China

Purpose: Extrapulmonary small cell carcinoma (EPSCC) is rare, and its knowledge is mainly extrapolated from small cell lung carcinoma. Reliable survival prediction tools are lacking.

Methods: A total of 3,921 cases of EPSCC were collected from the Surveillance Epidemiology and End Results (SEER) database, which form the training and internal validation cohorts of the survival prediction model. The endpoint was an overall survival of 0.5–5 years. Internal validation performances of machine learning algorithms were compared, and the best model was selected. External validation (n = 68) was performed to evaluate the generalization ability of the selected model.

Results: Among machine learning algorithms, the random forest model performs best on internal validation, whose area under the curve (AUC) is 0.736–0.800. The net benefit is higher than the TNM classification in decision curve analysis. The AUC of this model on the external validation cohort is 0.739–0.811. This model was then deployed online as a free, publicly available prediction tool of EPSCC (http://42.192.80.13:4399/).

Conclusion: This study provides an excellent online survival prediction tool for EPSCC with machine learning and large-scale data. Age, TNM stages, and surgery (including potential performance status information) are the most critical factors for the prediction model.

Introduction

Small cell carcinoma (SCC) is a poorly differentiated neuroendocrine tumor. SCC mainly involves the lungs, and many studies have drawn reliable conclusions about small cell lung carcinoma (SCLC). Extrapulmonary small cell carcinoma (EPSCC) is much rarer, accounting for 2%–4% of all SCC and 0.1%–0.4% of all cancers (13). With limited data on EPSCC accessible, although EPSCC was first described by Duguid and Kennedy in 1930 (4), most of the understanding of the disease was still extrapolated from SCLC (5). However, according to a study of SCC, the incidence of EPSCC continues to increase (annual percent change = 1.58; p < 0.05) (1). Thus, focusing more on EPSCC in the future is essential.

EPSCC is aggressive, and the median overall survival (OS) is reported to be only 1.2 years (6). However, since EPSCC is widely distributed throughout the body, the survivals of different organs or systems may have significant differences. For example, the survivals of SCC of the breast are much better than those of the gastrointestinal system. Ochoa et al. reviewed 39 cases of SCC of the breast and reported that OS was 72% at 4 years. Even with stage III EPSCC, 75% of all patients were alive after a median follow-up of 17 months (7). With a relatively small number of studies available, it will be difficult for oncologists to predict the prognoses of EPSCC patients. To date, there are only a few survival analyses of EPSCC. If a prediction model based on large-scale data can be applied in EPSCC throughout the body and has good performance, the problem will be solved.

Thus, the aim of this study is to provide a survival prediction model of EPSCC with multiple-site large-scale data and machine learning algorithms. To further increase the clinical application value of this study, the selected prediction model was deployed online to help physicians evaluate the patients’ survival.

Method

Data source

This study is reported using the transparent reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) statement (8). The checklist can be seen in Table S1 in Supplementary Files.

Data for the training and internal validation cohort in this study are collected from the Surveillance Epidemiology and End Results (SEER) database (http://seer.cancer.gov/), which contains clinical oncological data from America over 40 years. Data from 17 registries of the United States population during 1975–2019 are downloaded and processed following the steps below. First, the International Classification of Diseases for Oncology, Third Edition (ICD-O-3), histology codes 8041–8045 were used to filter out all small cell carcinoma cases. Then the primary site codes C33–C34 (trachea, bronchus, and lung) were used to exclude SCLC. Other exclusion criteria were as follows: (1) incomplete survival information or follow-up periods ≤ 2 months; (2) multiple primary tumors; (3) the sites are not within the “head and neck”, “urinary system”, “digestive system”, “prostate”, “female genital system”, or “breast”. (There are too few cases of other sites.) Then, the data were screened to those diagnosed during 2004–2015 due to the consistency of the edition of AJCC stage classification and admitted treatment regimen extrapolated from SCLC. Finally, data from 3,921 cases were included in this study as the training and internal validation cohort (Figure 1).

FIGURE 1
www.frontiersin.org

Figure 1 The flowchart for study identification, screening, and inclusion in the training and validation cohort.

External validation data in this study were individual-level data collected from published case reports/series indexed in PubMed or J-Stage. Finally, 68 cases of 51 reports during 2004–2022 were successfully collected and used as the external validation cohort, while other reports do not have survival information (Figure 1). The two sources of cases were calculated and analyzed combined and separately, respectively. PubMed-sourced data are based on multiple races, while J-Stage-sourced data are based on an East Asian population different from the SEER cohort, which can prove the generalizability of the prediction model. Details of the external validation cohort are listed in Table S2 in the Supplementary Files (939).

Endpoint and variables

The primary endpoint was overall survival, and the related information is from the “Survival months” and “Vital status recode” variables of SEER.

The collected variables of cases included age, sex, marital status, race, staging, distant metastasis, treatment, and area information. Marital status was divided into three subgroups: married, single, and others (mainly divorced and widowed). “Race and origin recode (NHW, NHB, NHAIAN, NHAPI, Hispanic)” was collected as race information. Staging data include tumor node metastasis (TNM) classification, overall stage and TNM separate stages, and summary stage (in situ, localized, regional, and distant). Metastasis data at bone, brain, liver, and lung (“SEER Combined Mets at DX”) were also included. Treatment information includes whether surgery, radiotherapy, or chemotherapy was received. Data about lymph node resection were also collected. Area information means “Median household income inflation adj to 2019” and “Rural-Urban Continuum Code”. In the latter variable, it is regrouped into “Metropolitan”, “Nonmetropolitan”, and “unknown”. Tumor size and extension were not included because of a large amount of missing data. Site-specific factors were also not included since this study involved multiple sites. External validation data were collected after establishing the model. Thus, the reduced variables (race, marital status, and area information) were not collected.

Establishment of the prediction model

Data preprocessing of this study is all performed on Python (version 3.7.9). According to the limitation of the machine learning algorithm, nominal categorical variables have to be changed to dummy variables. The Random Forest Regressor model was used for imputation. Moreover, the performance of multiple machine learning algorithms was tested. When data standardization was needed, data were scaled in the range of 0–1.

Five machine learning algorithms were explored: random forest, logistic regression, support vector machine (SVM), naïve Bayesian, and XGBoost. The machine learning models were established with “scikit-learn” (version 0.24.1) on Python. The algorithm with the best performance was selected for further calibration, evaluation, and deployment.

Before the adjustment of hyper-parameters, data were stratified—randomly split into the training and internal validation cohorts with a ratio of 7:3. Only training cohort data were used to decide the hyper-parameters. Mean results of 10-fold cross-validation were used to find out the best values of hyper-parameters.

Hyper-parameter of random forest algorithms can prevent overfitting. However, for easier use, feature selection was performed. Area information, race, and marital status were excluded due to less contribution (with low Gini coefficients) to the prediction model. Summary stage, distant metastasis at the liver, and distant metastasis at the brain were excluded because of >0.75 Spearman correlation coefficients with other variables. Thirteen variables were left for the final prediction model.

Evaluation of the prediction model

Three types of curves were used to evaluate and exhibit the performance of models, which include the receiver operating characteristic (ROC) curve, calibration curve, and decision curve. ROC curve can evaluate the discrimination between alive and dead cases. The area under the ROC curve (AUC) can be compared quantitatively, equal to C-index in the binary classification problem. The calibration curve can evaluate the accordance between predicted survival and actual survival. Finally, the decision curve analysis (DCA) exhibits whether the models provided in this study are better than the TNM staging system. Curves were drawn with Matplotlib on Python.

The training and internal validation cohorts were evaluated for 0.5–5 years. Because of the small sample size of the external validation cohort, there are insufficient data with long follow-up periods. Only 0.5- and 1-year overall survivals of the external validation cohort were tested.

Interpretability

To gain insight into why the prediction model outperforms the AJCC staging system, the performance of a risk-group classification was explored in this study. The risk groups were evaluated by assessing the percentile of predicted survival probability of each case within the entire external validation cohort. To facilitate comparison with the AJCC staging system whose stages were I–IV, four risk groups were defined. Since the median survival of EPSCC is 1.2 months, as mentioned before, the survival probability used for risk groups was mainly based on the predicted 1-year OS. The Kaplan–Meier (K-M) curve of risk groups was compared with that of AJCC stages. Meanwhile, the random forest model’s feature importance (Gini coefficients) was exported and displayed in a heatmap.

Deployment

Since the performance of the machine learning model is better than the AJCC staging system, to let this study have a better clinical application value, an interactive website was established with the selected model. Clinical information needs to be entered, and overall survival of 0.5–5 years can be calculated automatically and displayed graphically.

The website is deployed based on Django 2.2.28 (a Python web framework) and elastic computing service is provided by Tencent Cloud company (Shenzhen, China).

Statistical analysis

SPSS Statistics 26 was used for data description, Kaplan–Meier survival analysis, and Cox regression analysis. Categorical variables were expressed by frequency (N) and percentage (%). One-year OS and median survival months were expressed as mean ± standard error or median ± quartile. A 95% confidence interval (CI) of HR of each variable was provided. Student’s t-test and the Mann–Whitney U test were used for statistical analysis. p-value <0.05 was considered statistically significant.

Results

Patient characteristics and survival analysis

Patient characteristics of SEER data can be seen in Table 1, which is the primary data source of this study. The number of patients with SEER data is 3,912. The total 1-year OS is 43.9% ± 0.8%. The median age is 65–69, and 58.0% are male patients. The most involved sites of SCC in SEER data are the digestive system (30.9%), the urinary system (32.2%), and the female genital system (15.4%). When diagnosed, 43.3% of patients (excluding unstaged) have had distant metastasis, and 47.4% have TNM stage IV. Referring to the treatment of SCLC, most patients have received chemotherapy. However, a large number of patients did not receive surgery or radiotherapy. The training and internal validation cohorts were stratified and randomly divided, and the difference was compared, which is shown in Table S3 with the characteristics of the external validation cohort (n = 68).

TABLE 1
www.frontiersin.org

Table 1 Patient characteristics of SEER data and HR results of Cox regression analyses.

HRs of survival were calculated and are listed in Table 1. Results show that when compared with the elderlies, the middle-aged have better survival (HR = 0.73, 95% CI 0.67–0.80), and people younger than 45 years old have the best survival (HR = 0.39–0.49). Male (HR = 1.15, 95% CI 1.08–1.23) or “divorced or widowed” (HR = 1.27, 95% CI 1.18–1.38) patients have significantly shorter survival than female or “married or single” patients. Hispanic (HR = 0.86, 95% CI 0.76–0.97) patients have significantly better survival than patients of other races. Among sites, SCC of the digestive system has the worst prognosis. The 1-year OS is 33.0% ± 1.4% and is significantly worse than all other sites. The prognosis of the breast is the best, and its 1-year OS is 64.5% ± 4.8%. All surgery (HR = 0.44–0.66), chemotherapy (HR = 0.52, 95% CI 0.48–0.56), or radiotherapy (HR = 0.61, 95% CI 0.56–0.65) can significantly help prolong survival. It is also seen that with efficient lymph node resection, the HR of surgery (HR = 0.44, 95% CI 0.40–0.48) can be even lower than that of chemotherapy.

Model evaluation

Machine learning models based on different algorithms were established and tested with the internal validation data. The algorithms and their performance are listed in Table S4. Algorithms based on decision trees (random forest and XGBoost) have better AUC than other algorithms, and the AUC of random forest is the best.

After the feature selection and calibration, the random forest model was evaluated with ROC curves, calibration curves, and DCA on all three cohorts (Figure 2). The AUCs of 0.5–5 years range from 0.736 to 0.800 and display good discrimination ability (Figure 2B). The calibration curves are all near the diagonal (Figure 2E), while mean-predicted probabilities are also reliable. Because there are not sufficient external validation data with long follow-up periods, only 0.5- and 1-year overall survivals of the external validation cohort were tested, whose AUCs are 0.758 and 0.790, respectively. Since the data of the external validation cohort were collected from both PubMed and J-stage, the AUCs of different sources were also calculated. The races of J-stage patients are all East Asian, different from that of the SEER database. Thus, the good AUCs of J-stage patients can also be regarded as a justification of generalization ability of this prediction model (Figure 2C). Figures 2G–L show that the random forest model has a better net benefit than the AJCC staging system in all 0.5- to 5-year survival predictions.

FIGURE 2
www.frontiersin.org

Figure 2 Evaluation of the random forest model. Three methods were used, namely, ROC curves (A–C), calibration curves (D–F), and decision curves (G–L). All of the training cohort (A, D), internal validation cohort (B, E, G–L), and external validation cohort (C, F) were used for evaluating the performance of the model. AUCs show good discrimination ability of the model. Decision curves of the internal validation cohort show that the model is better than the AJCC TNM staging system all from 0.5 to 5 years.

Interpretability

To understand the model more intuitively, patients were divided into four risk groups according to the predicted probability, and the K-M curve is as shown in Figure 3A. When compared to the K-M curve of TNM overall stage (Figure 3B), it is seen that the model can better divide the cohort because the survival of TNM stages II and III is almost the same.

FIGURE 3
www.frontiersin.org

Figure 3 Direct display of the model. (A) After dividing the SEER cohort into four risk groups with the predicted probability, it is seen that survival of four groups has apparent difference. (B) If using the TNM staging system, stage II and stage III have similar survival. (C) The heatmap of the feature importance (Gini coefficient) in the prediction model.

Figure 3C shows the feature importance (Gini coefficients). The higher value and deeper color are more important to the model. Age, sites, overall stage, T stage, M stage, and surgery (including node resection information) considerably influence the prediction of different time lengths. Among them, age, TNM stages, and surgery have the highest importance. Chemotherapy has an enormous influence on 0.5-year survival. However, surgery and node resection (which also includes potential information combined with better performance status and stages) may influence predictions of more extended survival periods.

Random forest prediction website

An interactive online website was deployed on the server. Physicians can access the website via http://42.192.80.13:4399/. After entering the required information, prediction result plots will be automatically displayed. Examples can be seen in Figure 4.

FIGURE 4
www.frontiersin.org

Figure 4 Interface and an example of the prediction website. (A, B) The interface for information entering; (C) predicted results of 0.5–5 years displayed by the K-M curve (the entered example data: 56-year-old female patient with T4N2M0 overall stage III esophagus EPSCC. Chemotherapy and radiotherapy were received but surgery was not performed).

Discussion

Despite the increasing incidence of EPSCC (1), the data and research on EPSCC are still limited due to its rarity. Existing studies are mainly case report/series or single-center retrospective analysis. The treatment of EPSCC was extrapolated from that of SCLC or other neuroendocrine tumors since they have similar pathological characteristics. However, different organs have different tumor features, which result in different prognoses. In Canadian data reported by Haider et al., the median OS of gastrointestinal SCC is 4.4 months, while that of breast and gynecology ranges from 40.9 to 54.4 months (40). To date, how long a specific EPSCC patient can survive is also difficult to evaluate. There is still no survival model able to predict EPSCC throughout the body. There are only seven nomograms constructed on single-organ small-scale data (4147). None of the nomograms have external validation. Thus, this study explores predicting the survival of EPSCC throughout the body with a machine-learning ensemble algorithm and large-scale data. Furthermore, external validation was also performed.

“Machine learning” is the computation process of imitating the human ability to recognize patterns from data and can be used in multiple fields of medicine, regardless of disease diagnosis, prognosis prediction, or screening of potential molecular targets (48). For instance, it was used to predict the diagnosis and prognosis of high-grade B-cell lymphoma with clinical data (49). It was also combined with mRNA expression data to find new biomarkers of adrenocortical tumors (50). The machine learning model’s performance is excellent and constantly increasing with a larger amount of data input. Considering EPSCC is a rare tumor, only extensive databases can filter out a sufficient number of cases. Thus, the SEER database was chosen as the primary data source to compensate for the deficiency of limited data. Since many tumor databases do not contain data on rare tumors, the SEER database and machine learning are the best combination to research rare tumors like EPSCC.

Table 1 displays the HRs of included factors. It is found that age, sex, race (Hispanic or not), marital status (divorced/widowed or not), primary site, stage, surgery, chemotherapy, and radiotherapy are all independent factors. Younger, female, Hispanic, non-divorced/widowed, early stages, receiving surgery, chemotherapy, and radiotherapy correlate with better prognosis. This is similar to the result provided by Mandish et al., in which sex is insignificant (6). SCCs of the breast and head and neck have better survival, while SCCs of the digestive system and prostate have poor survival in the present study. Similarly, according to Mandish et al., SCCs of the head/neck and breast have a better prognosis, and gastrointestinal SCCs have the worst prognosis (6). It is worth noting that there might be unrevealed prognostic factors because of the lack of specific variables of the SEER database. For example, the ECOG-performance status (PS) score was found to be a good prognostic factor of multiple cancers, especially in elderly patients. However, the PS scores were not included in the SEER and cannot be found to be prognostic factors in this study. However, as a retrospective analysis, the variables of age and whether surgery was received also implied whether the performance status is good or not, considering that surgeons will not perform surgeries on patients with poor PS scores. Thus, the prediction performance is not affected. Additionally, although Hispanic was found to be correlated with better prognosis, it has to be said that the SEER database has a predominantly Caucasian population, whereas other races like African Americans or Asians are underrepresented. Thus, the established prediction model was evaluated with external validation, but further validations with more races are still needed.

In the evaluation of the established prediction model, predicting survivals of multiple-site EPSCCs throughout the body also has good performance (Figure 2), compared to the nomograms of seven single-site EPSCC studies, whose AUCs range from 0.656 to 0.75 (4147). It may be because of the same pathological nature, the large sample size, and the merits of the decision tree-based algorithm. Thus, EPSCC of multiple sites was included in this study to increase the applicable range of the established model. After a comparison of the performance of several machine learning algorithms, it is found that the random forest model performed best on internal validation data. It is reasonable because researchers evaluated 179 classifiers in 121 real-world datasets and found that the random forest is the most likely to be the best (51).

Compared with nomograms, the problem of random forest models is not intuitive. Thus, the Gini coefficients of included factors are given in Figure 3C. Age, TNM overall stages, and surgery (including potential performance status information) are the most critical factors for the prediction model, while sex is the one with a minor contribution. Chemotherapy is essential whether the patients can survive within 6 months but may help patients less after 1 year. In the K-M curves of risk groups given by the prediction model (Figure 3A), the patients are well divided into subgroups with different prognoses. It performs better than traditional TNM classification (Figure 3B), indicating the heterogeneity of stages II and III and its limitations of TNM classification.

Many studies of machine learning prediction models only provide the evaluation results and prove that the method is feasible and reliable. Nevertheless, it is challenging to utilize in clinical work. Thus, to further expand the clinical value of this study, the model was deployed on the online website (http://42.192.80.13:4399/). An example can be seen in Figure 4.

Despite its merits, this study still has its limitations. First, it is a retrospective study, which means selective bias is inevitable. Second, EPSCC is a rare type of tumor. Thus, the sample size of the external validation cohort of this study is relatively small. Third, there are some important data in the SEER database, and since EPSCC is rare, it is also not included in most large cancer databases like The Cancer Genome Atlas (TCGA).

Two kinds of variables were lacked in the SEER database. The first one is immunohistochemistry (IHC) staining or molecular test results. For example, Ferro et al. found that Ki-67 <55% indicates poor prognosis but only in metastatic EPSCC with their original data (52) In collecting the external validation cohort, it is found that Chromogranin A, EMA, and TTF1 were commonly stained and the positive rates are in the range 0.60–0.66, which means potential ability for prediction and might be a reference for further investigation. However, there are no IHC data in the SEER database, which limits further analyses. Also, the SEER database does not include variables of tobacco use and performance status. The performance status in the present study can only be indirectly considered with other variables like age and surgery records, which might lessen the accuracy of the analyses. Also, the races in the SEER database should be more balanced. If it is possible to collect current missing variables with the future SEER database or other better population-based tumor databases, this machine learning prediction model must perform much better.

Conclusions

With large-scale data and machine learning, an excellent prediction model of EPSCC was constructed and deployed online for clinical use. Age, TNM stages, and surgery (including potential performance status information) are the most critical factors for the prediction model.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics statement

This is an observational study based on the SEER public database. No ethical approval, consent to participate, or consent to publish is required.

Author contributions

The author confirms being the sole contributor of this work and has approved it for publication.

Acknowledgments

Thanks to my colleague Guihong Liu who inspired me to focus on small cell carcinoma.

Conflict of interest

The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1166424/full#supplementary-material

Abbreviations

AJCC, American Joint Committee on Cancer; AUC, area under curve; CI, confidence interval; DCA, decision curve analysis; EPSCC, extrapulmonary small cell carcinoma; IHC, immunohistochemistry; K-M, Kaplan–Meier (curve); OS, overall survival; ROC, receiver operating characteristic (curve); SCC, small cell carcinoma; SCLC, small cell lung carcinoma; SEER, the Surveillance Epidemiology and End Results (database); SVM, support vector machine (algorithm); TNM, tumor node metastasis (stage classification).

References

1. Dores GM, Qubaiah O, Mody A, Ghabach B, Devesa SS. A population-based study of incidence and patient survival of small cell carcinoma in the united states, 1992-2010. BMC Cancer (2015) 15:185. doi: 10.1186/s12885-015-1188-y

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Levenson RM, Ihde DC, Matthews MJ, Cohen MH, Gazdar AF, Bunn PA Jr, et al. Small cell carcinoma presenting as an extrapulmonary neoplasm: sites of origin and response to chemotherapy. J Natl Cancer Inst (1981) 67(3):607–12.

PubMed Abstract | Google Scholar

3. Remick SC, Ruckdeschel JC. Extrapulmonary and pulmonary small-cell carcinoma: tumor biology, therapy, and outcome. Med Pediatr Oncol (1992) 20(2):89–99. doi: 10.1002/mpo.2950200202

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Duguid JB, Kennedy AM. Oat-cell tumours of mediastinal glands. J Pathol Bacteriology (1930) 33(1):93–9. doi: 10.1002/path.1700330109

CrossRef Full Text | Google Scholar

5. Kouba EJ, Cheng L. Understanding the genetic landscape of small cell carcinoma of the urinary bladder and implications for diagnosis, prognosis, and treatment: a review. JAMA Oncol (2017) 3(11):1570–8. doi: 10.1001/jamaoncol.2016.7013

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Mandish SF, Gaskins JT, Yusuf MB, Little BP, Dunlap NE. Extrapulmonary small cell carcinoma: prognostic factors, patterns of care, and overall survival. Eur J Surg Oncol (2020) 46(9):1596–604. doi: 10.1016/j.ejso.2020.04.017

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Ochoa R, Sudhindra A, Garcia-Buitrago M, Romilly AP, Cortes J, Gomez H, et al. Small-cell cancer of the breast: what is the optimal treatment? a report and review of outcomes. Clin Breast Cancer. (2012) 12(4):287–92. doi: 10.1016/j.clbc.2012.03.007

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Bmj (2015) 350:g7594. doi: 10.1161/CIRCULATIONAHA.114.014508

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Abusnina W, Auyoung EY, Megri M, Pacioles T. Small cell carcinoma of prostate: a case report of a patient with concomitant transitional cell cancer of the bladder. J Investig Med High Impact Case Rep (2018) 6:2324709618760644. doi: 10.1177/2324709618760644

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Acosta AM, Hamedani FS, Meeks JJ, Wu S. Primary ureteral thyroid transcription factor 1-positive small cell neuroendocrine carcinoma: case report and review of the literature. Int J Surg Pathol (2015) 23(6):472–7. doi: 10.1177/1066896915594882

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Ahmad I, Singh P, Bhatt CP, Bashir I. Case of metastatic small cell carcinoma of the oropharynx successfully treated with image-guided volumetric modulated arc therapy (IG-VMAT): pushing the limits of technology to match treatment intent. BMJ Case Rep (2018) 11(1). doi: 10.1136/bcr-2018-226522

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Al Masalmeh N, Kukreja G, Zaiem F, Raza SN, Kim H, Nagasaka M, et al. p16 positive oropharyngeal small cell cancer: a case report. Oral Oncol (2021) 121:105391. doi: 10.1016/j.oraloncology.2021.105391

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Brown JR, Wieczorek TJ, Shaffer K, Salgia R. Small-cell cancers, and an unusual reaction to chemotherapy. case 1. extrapulmonary small-cell carcinoma arising in the prostate. J Clin Oncol (2003) 21(12):2437–8. doi: 10.1200/JCO.2003.081.03

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Cha RR, Cho JK, Kim WS, Kim JJ, Lee JM, Lee SS, et al. Primary gastric small cell carcinoma (Presenting as linitis plastica) diagnosed using endoscopic ultrasound-guided biopsy: a case report. Clin Endosc (2019) 52(3):278–82. doi: 10.5946/ce.2018.114

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Chen Y, Wang C, Wang HM, Li J, Zhu HY. Primary small cell carcinoma with neuroendocrine properties of the mandible: a case report and literature review. J Dent Sci (2017) 12(4):413–6. doi: 10.1016/j.jds.2012.03.020

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Chuang J, Wang C, Parekh V, Fakih M. Locally advanced anal small cell carcinoma with durable complete response to chemoradiation followed by consolidation chemotherapy: case report and literature review. J Gastrointest Oncol (2021) 12(6):3148–54. doi: 10.21037/jgo-21-434

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Eberhardt JM, Brown K, Lo S, Nagda S, Yong S. Extrapulmonary small cell carcinoma of the anal canal: a case report and review of the literature. Case Rep Med (2012) 2012. doi: 10.1155/2012/341432

PubMed Abstract | CrossRef Full Text | Google Scholar

18. González-Lois C, Madero S, Redondo P, Alonso I, Salas A, Angeles Montalbán M, et al. Small cell carcinoma of the kidney: a case report and review of the literature. Arch Pathol Lab Med (2001) 125(6):796–8. doi: 10.5858/2001-125-0796-SCCOTK

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Hussein AM, Feun LG, Sridhar KS, Otrakji CL, Garcia-Moore M, Benedetto P, et al. Small cell carcinoma of the large intestine presenting as central nervous systems signs and symptoms. two case reports with literature review. J Neurooncol (1990) 8(3):269–74. doi: 10.1007/BF00177360

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Ikegami T, Kayashima H, Sadanaga N, Morizono S, Nakashima A, Matsuura H, et al. Composite small cell and mucinous carcinoma originating from the intrahepatic bile duct: report of a case. Surg Today (2013) 43(2):194–8. doi: 10.1007/s00595-012-0331-5

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Isaka T, Maruno M, Sato M, Kinoshita M, Nishida T, Kiyohara H, et al. Brain metastasis from small-cell neuroendocrine carcinoma of the urinary bladder: a case report. Brain Tumor Pathol (2002) 19(2):117–22. doi: 10.1007/BF02478938

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Jang H, Yuk SM, Kim JO, Han DS. A rare case of primary malignant small cell carcinoma combined with urothelial cell carcinoma in the ureter. World J Surg Oncol (2013) 11(1):181. doi: 10.1186/1477-7819-11-181

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Kamiya M, Sato Y, Matsumoto H, Sasaki H, Takahashi S, Yamada S, et al. A case of uterine cervical adenocarcinoma associated with small cell carcinoma. J Obstet Gynaecol Res (2008) 34(6):1054–60. doi: 10.1111/j.1447-0756.2008.00828.x

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Kim WS, Lee DG. Primary mixed adenocarcinoma and small-cell carcinoma of appendix: a case report (CARE-compliant). Med (Baltimore) (2019) 98(19):e15614. doi: 10.1097/MD.0000000000015614

CrossRef Full Text | Google Scholar

25. Lee HY, Wu WJ, Tsai KB, Shen JT, Jang MY, Wang HS, et al. Primary small cell carcinoma of kidney after renal transplantation: a case report and literature review. Chin J Cancer Res (2013) 25(5):608–11. doi: 10.3978/j.issn.1000-9604.2013.10.07

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Madroszyk A, Egreteau J, Martin L, Queneau PE, Bosset JF, Merrouche Y, et al. Small-cell carcinoma of the esophagus: report of three cases and review of the literature with emphasis on therapy. Ann Oncol (2001) 12(9):1321–5. doi: 10.1023/A:1012221708372

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Meinhardt AL, Uppuluri A, Shkolnik E, Chang VT. Extrapulmonary small cell carcinoma presenting as an orbital mass: a case report. Cureus (2022) 14(6):e26012. doi: 10.7759/cureus.26012

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Miyake M, Fujimoto K, Tanaka M, Matsushita C, Tanaka N, Hirao Y, et al. A case of small cell carcinoma of the kidney. Hinyokika Kiyo (2007) 53(4):235–40.

PubMed Abstract | Google Scholar

29. Morikawa H, Nakayama Y, Maeda T, Nadatani Y, Kobayashi S, Iwai S, et al. A case of primary small cell carcinoma of the liver that was treated with chemotherapy. Hepatol Int (2008) 2(4):500–4. doi: 10.1007/s12072-008-9090-1

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Nakahara S, Takemoto N, Inohara H. Small cell carcinoma of the larynx treated by concurrent chemoradiotherapy: a case report. Case Rep Otolaryngol (2012) 2012:316165. doi: 10.1155/2012/316165

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Patil S, Kaza RC, Kakkar AK, Chamberlain RS. Small cell carcinoma of the renal pelvis: a case report and review of the literature. ISRN Urol (2011) 2011:786505. doi: 10.5402/2011/786505

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Reckova M, Mego M, Rejlekova K, Sycova-Mila Z, Obertova Z, Mardiak J. Small-cell carcinoma of the ovary with breast metastases: a case report. Klin Onkol (2010) 23(1):43–5.

PubMed Abstract | Google Scholar

33. Sahai P, Baghmar S, Nath D, Arora S, Bhasker S, Gogia A, et al. Extrapulmonary small cell carcinoma - a case series of oropharyngeal and esophageal primary sites treated with chemo-radiotherapy. Asian Pac J Cancer Prev (2015) 16(16):7025–9. doi: 10.7314/APJCP.2015.16.16.7025

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Segawa Y, Nakashima T, Shiratsuchi H, Tanaka R, Mitsugi K, Komune S. Small cell carcinoma of the tonsil treated with irinotecan and cisplatin: a case report and literature review. Case Rep Oncol (2011) 4(3):587–91. doi: 10.1159/000335218

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Spiliopoulou P, Panwar U, Davidson N. Rectal small cell carcinoma: a case report and review of the literature. Case Rep Oncol (2011) 4(3):475–80. doi: 10.1159/000332760

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Sugawara G, Yamaguchi A, Isogai M, Watanabe Y, Kaneoka Y, Suzuki M. Small cell neuroendocrine carcinoma of the ampulla of vater with foci of squamous differentiation: a case report. J Hepatobiliary Pancreat Surg (2004) 11(1):56–60. doi: 10.1007/s00534-002-0840-5

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Tombet CA, Aynaou M, Mhanna T, El Houmaidi A, Achraf M, Barki A, et al. Low back pain revealing a primary small cell neuroendocrine carcinoma of the upper urinary tract: a case report and review of the literature. Urol Case Rep (2020) 33:101338. doi: 10.1016/j.eucr.2020.101338

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Wang J, Liu D, Meng Y, Guo Y, Zhao M. Extrapulmonary small cell carcinoma of the external auditory canal: a case report and review of the literature. J Int Med Res (2020) 48(8):300060520946517. doi: 10.1177/0300060520946517

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Yeung HM, Jeurkar C, Pompa T, Styler M. Small cell cancer of the genitourinary tract: a case report and review of the literature. Case Rep Oncol (2017) 10(2):489–92. doi: 10.1159/000477435

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Haider K, Shahid RK, Finch D, Sami A, Ahmad I, Yadav S, et al. Extrapulmonary small cell cancer: a Canadian province’s experience. Cancer (2006) 107(9):2262–9. doi: 10.1002/cncr.22235

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Huang R, Gan Q, Cheng J. Prognostic factors and local treatment modalities of small-cell carcinoma of the cervix: an analysis according to the international federation of gynecology and obstetrics stage. Cancer Manag Res (2020) 12:3445–56. doi: 10.2147/CMAR.S247081

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Chen H, Liu CT, Hong CQ, Chu LY, Huang XY, Wei LF, et al. Nomogram based on nutritional and inflammatory indicators for survival prediction of small cell carcinoma of the esophagus. Nutrition (2021) 84:111086. doi: 10.1016/j.nut.2020.111086

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Qie S, Wang XF, Ran YG, Liu ML, Cui GM, Shi HYB, et al. Nomogram for predicting the survival of patients with small cell carcinoma of the esophagus: a population study based on the surveillance, epidemiology, and end results database. Med (Baltimore) (2021) 100(15):e25427. doi: 10.1097/MD.0000000000025427

CrossRef Full Text | Google Scholar

44. Dong F, Shen Y, Gao F, Shi X, Xu T, Wang X, et al. Nomograms to predict individual prognosis of patients with primary small cell carcinoma of the bladder. J Cancer (2018) 9(7):1152–64. doi: 10.7150/jca.23344

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Li J, Yu H, Peng L, Li L, Wang X, Hao J, et al. Novel nomogram predicting cancer-specific survival and overall survival in patients with primary esophageal small-cell carcinoma: a surveillance, epidemiology, and end results-based study. J Cancer Res Ther (2021) 17(3):630–7. doi: 10.4103/jcrt.JCRT_1612_20

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Yu CX, Yibulayin F, Feng L, Wang M, Lu MM, Luo Y, et al. Clinicopathological characteristics, treatment and prognosis of head & neck small cell carcinoma: a SEER population-based study. BMC Cancer (2020) 20(1):1208. doi: 10.1186/s12885-020-07522-9

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Zhang DY, Huang GR, Ku JW, Zhao XK, Song X, Xu RH, et al. Development and validation of a prognostic nomogram model for Chinese patients with primary small cell carcinoma of the esophagus. World J Clin Cases (2021) 9(30):9011–22. doi: 10.12998/wjcc.v9.i30.9011

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol (2022) 23(1):40–55. doi: 10.1038/s41580-021-00407-0

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Kong H, Zhu H, Zheng X, Jiang M, Chen L, Lan L, et al. Machine learning models for the diagnosis and prognosis prediction of high-grade b-cell lymphoma. Front Immunol (2022) 13:919012. doi: 10.3389/fimmu.2022.919012

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Marquardt A, Landwehr LS, Ronchi CL, Dalmazi GD, Riester A, Kollmannsberger P, et al. Identifying new potential biomarkers in adrenocortical tumors based on mRNA expression data using machine learning. Cancers (Basel) (2021) 13(18):4671. doi: 10.3390/cancers13184671

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Fernández-Delgado M, Cernadas E, Barro S, Amorim D. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res (2014) 15(1):3133–81. doi: 10.1117/1.JRS.11.015020

CrossRef Full Text | Google Scholar

52. Brazg Ferro L, Wolf I, Peleg Hasson S, Golomb I, Osher E, Berlin A, et al. Extrapulmonary small cell cancer: a new insight into a rare disease. Oncology (2021) 99(6):373–9. doi: 10.1159/000514520

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, SEER database, extrapulmonary small cell carcinoma, survival, online tool

Citation: Zhang X (2023) An online tool for survival prediction of extrapulmonary small cell carcinoma with random forest. Front. Oncol. 13:1166424. doi: 10.3389/fonc.2023.1166424

Received: 15 February 2023; Accepted: 12 June 2023;
Published: 29 June 2023.

Edited by:

Eleonora Volpato, Fondazione Don Carlo Gnocchi Onlus (IRCCS), Italy

Reviewed by:

Jingwen Yan, Peking Union Medical College Hospital (CAMS), China
Changlin Gong, Jacobi Medical Center, United States
Helmut H. Popper, Medical University of Graz, Austria
Ryota Kurimoto, Tokyo Medical and Dental University, Japan

Copyright © 2023 Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xin Zhang, zhang-xin13@foxmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.