Corrigendum: Clinical radiomics-based machine learning versus three-dimension convolutional neural network analysis for differentiation of thymic epithelial tumors from other prevascular mediastinal tumors on chest computed tomography scan
- 1Division of Thoracic Surgery, Department of Surgery, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
- 2Division of Thoracic Surgery, Department of Surgery, Kaohsiung Veterans General Hospital, Kaohsiung, Taiwan
- 3School of Medicine for International Students, College of Medicine, I-Shou University, Kaohsiung, Taiwan
- 4Division of Chest Medicine, Department of Internal Medicine, E-Da Cancer Hospital, Kaohsiung, Taiwan
- 5Department of Medical Imaging, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
- 6Department of Radiology, Kaohsiung Veterans General Hospital, Kaohsiung, Taiwan
- 7Faculty of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- 8Institute of Education, National Sun Yat-sen University, Kaohsiung, Taiwan
- 9School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- 10Institute of Clinical Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- 11Division of Trauma and Acute Care Surgery, Department of Surgery, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
- 12Department of Statistics and Institute of Data Science, National Cheng Kung University, Tainan, Taiwan
Purpose: To compare the diagnostic performance of radiomic analysis with machine learning (ML) model with a convolutional neural network (CNN) in differentiating thymic epithelial tumors (TETs) from other prevascular mediastinal tumors (PMTs).
Methods: A retrospective study was performed in patients with PMTs and undergoing surgical resection or biopsy in National Cheng Kung University Hospital, Tainan, Taiwan, E-Da Hospital, Kaohsiung, Taiwan, and Kaohsiung Veterans General Hospital, Kaohsiung, Taiwan between January 2010 and December 2019. Clinical data including age, sex, myasthenia gravis (MG) symptoms and pathologic diagnosis were collected. The datasets were divided into UECT (unenhanced computed tomography) and CECT (enhanced computed tomography) for analysis and modelling. Radiomics model and 3D CNN model were used to differentiate TETs from non-TET PMTs (including cyst, malignant germ cell tumor, lymphoma and teratoma). The macro F1-score and receiver operating characteristic (ROC) analysis were performed to evaluate the prediction models.
Result: In the UECT dataset, there were 297 patients with TETs and 79 patients with other PMTs. The performance of radiomic analysis with machine learning model using LightGBM with Extra Tree (macro F1-Score = 83.95%, ROC-AUC = 0.9117) had better performance than the 3D CNN model (macro F1-score = 75.54%, ROC-AUC = 0.9015). In the CECT dataset, there were 296 patients with TETs and 77 patients with other PMTs. The performance of radiomic analysis with machine learning model using LightGBM with Extra Tree (macro F1-Score = 85.65%, ROC-AUC = 0.9464) had better performance than the 3D CNN model (macro F1-score = 81.01%, ROC-AUC = 0.9275).
Conclusion: Our study revealed that the individualized prediction model integrating clinical information and radiomic features using machine learning demonstrated better predictive performance in the differentiation of TETs from other PMTs at chest CT scan than 3D CNN model.
Introduction
Prevascular mediastinal tumor (PMT) is relatively uncommon, making up less than 1% of all solid tumors (1). PMT consists of a wide variety of entities, including thymic epithelial tumor (TET), lymphoma, germ cell tumor, ectopic thyroid, and cyst, among which TET is the most frequently encountered (2). The NCCN guidelines suggested that patients with clinically resectable TETs undergo upfront surgical resection instead of preoperative transpleural biopsy to avoid converting stage I thymoma to stage IV thymoma by spreading tumor within the pleural space (3). Chest computed tomography (CT) scan is the standard assessment for PMT. It was reported that the accuracy rate of PMT interpretation via traditional radiographic features on CT scan reaches as high as 61% in experienced radiologists, leaving much room for improvements in the era of advanced technology (4).
The applications of radiomics in diagnostic medicine and outcome analysis have been increasingly proposed lately (5, 6). By combining image-filtering and feature-extraction methods, it is possible to extract a large number of high-order radiomic features from CT images (5). Studies have shown significant radiomic parameters such as skewness, kurtosis, and entropy, correlated with thymic tumor histology (7, 8). Redundancy is often the scenario in highly dimensional data, and classification model could be developed only through proper feature selection and proper machine learning (9).
Convolutional neural networks (CNNs) are a class of deep learning (DL) models that combine imaging filters with artificial neural networks through a series of successive linear and nonlinear layers (10, 11). CNN is far more data hungry because of its millions of learnable parameters to estimate, and therefore is more computationally expensive, resulting in the requirement of graphical processing units (GPUs) for model training. The major drawback in the application of 3D deep learning on medical images is its dependency on data availability and high computational cost (12). With powerful GPUs becoming increasingly available, we have seen exponential growth in the applications of 3D deep learning in different medical image modalities (11).
Nonetheless, with the low incidence of tumor occurrence and the resultant limited radiographic data and information, it is yet to be clarified if 3D CNN out-performs radiomics with ML in differentiating various kinds of mediastinal tumor. Our study aimed to compare the model using radiomics with traditional machine learning with 3D CNN model in differentiating TETs from PMTs, thus providing a prediction tool and the opportunity of improvement on the decisions for invasive diagnostic or treatment modalities.
Materials and methods
Study population
A retrospective study was performed in patients with PMTs and undergoing surgical resection or biopsy in National Cheng-Kung University Hospital, Tainan, Taiwan, E-Da Hospital, Kaohsiung, Taiwan, and Kaohsiung Veterans General Hospital, Kaohsiung, Taiwan between January 2010 and December 2019. Informed consent was waived because the study was retrospective, and it was respectively approved by the Institutional Review Board of National Cheng Kung University Hospital (A-ER-111-211), E-Da Hospital (EMRP-110-145), and Kaohsiung Veteran General Hospital (VGHKS19-CT6-08). Exclusion criteria were patients younger than 20-year-old, missing imaging data, or metastatic prevascular mediastinal tumors. The patients younger than 20-year-old were excluded because the most common types of tumors in this age group are lymphomas and malignant germ cell tumors, and the rare occurrence of thymic epithelial tumors (TET) does not have much impact on histological classification of tumors (13). Clinical data including age, sex, myasthenia gravis (MG) symptoms and pathologic diagnosis were collected. We divided our dataset into UECT (unenhanced computed tomography) and CECT (enhanced computed tomography) for analysis and modelling. Radiomic model and 3D CNN model were adopted respectively to differentiate TETs from non-TET PMTs (including cyst, malignant germ cell tumor, lymphoma and teratoma). The flowchart of patient inclusion was shown in Figure 1.
Figure 1 Flowchart of patients inclusion. NCKUH, National Cheng Kung University Hospital; KVGH, Kaohsiung Veteran General Hospital.
Image acquisition
All CT images were obtained using Siemens SOMATOM Definition Flash, Siemens SOMATOM Definition AS, Siemens SOMATOM Sensation 16, GE Optima CT660, GE Revolution CT, GE Bright Speed Elite, GE light speed VCT, and TOSHIBA CT64-TSX-01A64. The CT protocols were as follows: 120 kVp; tube current, 150–200 mAs with automatic tube current modulation. The section thickness ranged between 0.7 mm and 1.5 mm, and the image size was 512 × 512 pixels. The detailed protocol and contrast materials are summarized in Table E1. Three patients received solely non-enhanced CT scan due to renal function impairment, while the other patients received both non-enhanced and enhanced CT scan. Contrast enhanced images were obtained after intravenous administration of contrast medium (injection dose 60-120 mL at a rate of 1.5-3 ml/s) followed by a 20 ml saline flush. Contrast enhanced images were obtained 90s after contrast agent administration. All images were reconstructed into 5-mm sections.
Tumor segmentation and image preprocessing
CT images were imported into the open-source software 3D Slicer 4.10.2 and the tumors were then contoured manually by one of two observers (C.Y.L. a radiologist with 9 years of experience and C.C.C, a thoracic surgeon with 9 years of experience) blinded to patient diagnosis using the built-in paint tool (14). The delineation of tumor at UECT and CECT was performed in the mediastinal setting (window level, 50 HU; window width, 350 HU) on the axial CT plane. Consensus was reached by discussion if the interobserver variability was apparent. For normalization, all CT voxels were resampled to 1 mm3 using a cubic interpolation.
Radiomics feature extraction, selection and model building
The global framework showing the radiomic analysis process is shown in Figure 2. The whole PMTs in each CT examination served as VOIs, from which 3D radiomic features were extracted using the open-source platform PyRadiomics (15). A total of 851 radiomic features, including 14 shape features, 18 intensity histogram features, 74 texture features, and 745 wavelet features were extracted for further analysis.
Figure 2 Flowchart of the proposed scheme. PMT, prevascular tumor; LASSO, least absolute shrinkage and selection operator; MG, myasthenia gravis; RF, random forest; XGBoost, extreme gradient boosting; LightGBM, light gradient boosting machine.
A multivariate logistic regression model was developed using the least absolute shrinkage and selection operator (LASSO) with L1 penalty to filter the features to reduce the redundancy of the features. The features with non-zero coefficients at optimized hyperparameter lambda were selected and used in ML. Relevant clinical information including age, sex and MG symptoms were also input as feature vectors in ML.
In combination with feature-selection method, eight ML classifiers were used to differentiate thymoma from other PMTs: KNeighbors, random forest (RF), extreme gradient boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), CatBoost, Extra Tree, and Weight Ensemble Model (WeightEnsemble_L2). A Bayesian optimization algorithm (BOA) was applied to optimize the hyperparameters of these models. The flowchart of the proposed scheme was illustrated in Figure 2.
3D convolutional neural networks model
A fully convolutional neural network (CNN), as described in a recently published previous work was used (16). ROI patches were extracted from each CT image by defining a bounding box that enclosed each previously defined ROI. All patches were resized to 128 × 128 × 64. Owing to the limited training data, we applied random rotation (-30°~30°) for training data augmentation. As radiomics-based machine learning model, the clinical information including age, sex and MG symptoms were also input as feature vectors in this neural network model.
All the models are implemented in Python 3.8.9 based on tensorflow 2.8.0 and trained on 1 Tesla V100-DGXS-32GB. The loss function during training was the binary cross-entropy metric and was optimized using an Adam optimizer with a batch size of 2, and an initial learning rate of 10-5. Early stopping was employed to prevent overfitting, and training was stopped once model performance stops improving on a validation dataset after 350 constitutive training epochs (maximum epoch value = 1000). The best model observed during training would be the output model. The flowchart of the proposed scheme was illustrated in Figure 2.
Statistical analyses
Continuous variables were compared using the Student t-test, and categorical variables were compared using the chi-square test. P values of < 0.05 were considered statistically significant.
Clinical information (age, sex and presence of MG symptom) was added into radiomic model and 3D CNN model. Each dataset was randomly split into training, validation and testing sets in the ratio of 60:15:25. The models were evaluated through repeated random sub-sampling validation.
The accuracy, macro precision, macro recall, and macro F1-score for each dataset were calculated and determined to verify the performance of the different models. The macro F1-score reflected the effectiveness on small classes and is an effective evaluation metric for an imbalanced dataset. Receiver operating characteristic (ROC) analysis was performed, and the area under the curve (AUC) was used to evaluate the prediction models. The analysis was performed using python 3.8.9 with scikit-learn 1.0.1, autogluon 0.2.0, and statsmodels 0.13.1.
Result
Basic clinical characteristics
The clinicopathological characteristics of patients in our study are shown in Table 1. In UECT dataset, there were 297 patients with TETs and 79 patients with other PMTs. In CECT dataset, there were 296 patients with TETs and 77 patients with other PMTs. There was significant difference in age and MG symptoms in UECT dataset [TETs vs. other PMTs: 61.70 ± 12.99 vs. 51.19 ± 17.74, p < 0.001; 87 (29.3%) vs. 4 (5.1%), p < 0.001]. In TETs, there were 19.2% thymic carcinoma and 80.8% thymomas. In other PMTs, there were mostly cyst (54.4%). In CECT dataset, there was significant difference in age and MG symptoms [TETs vs. other PMTs: 61.46 ± 13.06 vs. 49.26 ± 17.76, p < 0.001; 87 (29.4%) vs 4 (5.2%), p < 0.001]. In TETs, there were 19.3% thymic carcinoma and 80.7% thymomas. In other PMTs, there were mostly cyst (53.2%).
Radiomics feature selection and optimal signature construction
After performing tumor segmentation in the included patients, 851 radiomics features were extracted (Supplementary Tables 1, 2). The top five feature selection using LASSO logistic regression with different values of lambda was shown in Tables 2, 3, and the top 20 feature selection was showed in Supplementary Tables 3, 4.
Table 2 Top five variable feature selection performed by LASSO Logistic Regression at various lambda value using UECT.
Table 3 Top five variable feature selection performed by Lasso Logistic Regression at various lambda value using CECT.
The result of best feature selection using various machine learning methods was demonstrated in Table 4. In UECT dataset, LightGBM with Extra Tree using features in selection_5 had best performance (macro F1-Score = 83.95%, accuracy = 89.99%). The ROC curve was shown in Figure 3 with AUC = 0.9117. In CECT dataset, LightGBM with Extra Tree using features in selection_4 had best performance (Macro F1-Score = 85.65%、accuracy = 91.15%). The ROC curve was shown in Figure 3 with AUC = 0.9464. The results of Bayesian optimization of different models and various feature selection in UECT and CECT were showed in Supplementary Tables 5, 6.
Table 4 The best result of radiomics models and 3D CNN model to differentiate thymoma from other prevascular mediastinal tumors.
Figure 3 Receiver operating characteristic (ROC) curves showed the performance of LightGBM with Extra Tree and 3D CNN on UECT and CECT. AUC, area under curve.
3D CNN analysis
The result of 3D CNN classification was shown in Table 4. In UECT dataset, macro F1-score was 75.54%, accuracy 84.16%. The ROC curve was shown in Figure 3 with AUC = 0.9015. In CECT dataset, macro F1-score was 81.01%, accuracy 86.73%. The ROC curve was shown in Figure 3 with AUC = 0.9275. Because we used repeated random sub-sampling validation method for 10 times, the total training time was 14.5 hours in UECT dataset and 14.1 hours in CECT dataset.
In comparison, the performance of radiomic analysis with machine learning model using LightGBM with Extra Tree had better performance than the 3D CNN model in both UECT and CECT dataset. Four cases were illustrated to differentiate thymic epithelial tumors from other prevascular mediastinal tumors with our ML and 3D CNN classification models in the Figure 4.
Figure 4 Demonstration of the application of ML and 3D CNN classification models to analyze four distinct cases with varying pathologies. A case is classified as TET if the confidence score obtained from a ML model and 3D CNN using their UECT and CECT, respectively, is greater than 0.50, and classified as non-TET if it is less than or equal to 0.50. Yellow text indicates correct predictions, while blue text indicates incorrect predictions. (A) A 61 years old male with thymoma. Both LightGBM with Extra Tree and 3D CNN had correct prediction from UECT and CECT. (B) A 54 years old female with lymphoma. The LightGBM with ExtraTree had correct prediction from CECT. (C) A 50 years old male with malignant germ cell tumor. The LightGBM with Extra Tree had correct prediction from both UECT and CECT, while 3D CNN had correct prediction from UECT. (D) A 78 years old female with thymic cyst. The LightGBM with Extra Tree had correct prediction from both UECT and CECT, while 3D CNN had correct prediction from CECT.
Discussion
Our result showed radiomics with ensemble machine learning achieved better performance than 3D CNN in differentiating TETs from other PMTs. Deep learning (DL) model presented more stable shape than radiomics with ML model on ROC curve. Radiomics with ML and DL are active research in the field of oncology (17). Some studies showed that the DL model had better performance than the ML-based radiomics (18–20), some showed ML-based radiomics out-performed DL model (21), and some demonstrated DL-based radiomics model had the best performance (22, 23). Prior studies had performed radiomics based ML or DL to classify thymoma form other PMT (24–26). However, our study was the first to compare the performance of radiomics-based ML with DL using the same dataset to differentiated thymoma form other PMT. While it is well-known that with large datasets, the performance of DL model was superior to hand-crafted feature extraction, a large dataset is not always available in medicine and may be limited by factors such as disease incidence, prevalence, and obstacles in data procurement. For small dataset, studies have suggested feature engineering may be a more suitable machine learning strategy with notable advantages of radiomics for medical imaging analysis. At present, studies that directly compare radiomics and deep learning clinical model performance are relatively unexplored. In this study, we address these questions and further aim to enhance interpretability of such machine learning models (27).
There are a large number of vectors and associated computational cost in DL. We used 3D CNN in our study, which was relatively simple; however, the trainable parameters were up to 1,351,873. Because of the long training time, it would be of more difficulty for researchers to modify the algorithm repeatedly. In comparison, radiomics with ML had lower computational cost but was associated with more complicated process. After radiomic feature extraction, most feature vectors are redundant. Therefore, feature selection is demanded to build a model via ML. The method of ML has substantial impact on its performance. In our study, not all radiomics with ML method out-performed 3D CNN. In consistent with prior study, ensemble learning had the best result. Ensemble learning can also be applied in DL, with the cost of longer training time.
Our results revealed that dataset of CECT worked better than UECT in classifying thymoma from other PMTs using both radiomics models and 3D CNN model, which was consistent with our clinical experience. The imaging characters at chest CT scan of low-risk thymoma and thymic cyst showed round or oval shape, smooth contour, while high-risk thymoma showed irregular shape and contour. Nonseminomatous germ cell tumor demonstrated marked hemorrhagic necrosis, while teratoma revealed fat component (2). From the result of LASSO selection in UECT and CECT, sphericity in shape feature played an important role in two dataset model, consistent with the finding in our conventional CT scan. High resolution medical imaging contains many features that is difficult to discover by visual inspection. The ability of multi-scale and multiresolution in wavelet transform has been verified in many imaging studies, and often applied to image compression, edge detection, feature extraction, and texture analysis. Our study demonstrated that wavelet-based features were selected by two datasets, suggesting the importance of high order features in imaging identification. However, other shape features and original first order features are more important in classification in the UECT dataset than in the CECT dataset. Compared with CECT, UECT lacks contrast agent to demonstrate richer texture features of soft tissue, and the septa within the tumor or the range of necrosis are less clearly seen. This could be responsible for the reason that UECT had a different tendency of feature selection from CECT dataset.
In differentiating TET from cysts, radiologists primarily focus on the Hounsfield units (HU) changes between non-enhanced and contrast-enhanced scans. Previous studies have found that cysts have a mean attenuation value of around 23 HU and a maximal attenuation value of 58 HU (28). However, some thymic cysts may have increased CT attenuation if hemorrhage or infection occurs, and relying solely on non-enhanced scans may lead to misdiagnosis. According to our research approach, we have developed separate models for predicting TET from non-enhanced and contrast-enhanced CT scans. Interestingly, although contrast-enhanced CT has better predictive performance, using non-enhanced CT alone, whether based on radiomic-based machine learning or 3D convolutional neural network, achieves an AUC of > 0.9, which is close to the performance of the contrast-enhanced CT group. The difference of macro F1-score in all ML methods and 3D CNN in both UECT and CECT datasets was less than five percent. Tumor segmentation in UECT is sometimes difficult due to its proximity to adjacent vessel, heart, pericardial effusion, or consolidated lung. However, radiomics-based ML achieved an accuracy of 90%, indicating that our model had good performance using UECT. As the LDCT for lung cancer screening become more prevalent, there is increasing number of incidentally found asymptomatic PMTs. Once the UECT dataset provides high accuracy in differentiating PMTs, patients do not need to undergo CECT, and radiation dose and contrast agent exposure with the likelihood of kidney injury could therefore be minimized. Besides, previous literature has reported that approximately 22% to 68% of non-therapeutic thymectomies were unnecessary (28). In our dataset, 14% (53/376) of the patients had cysts or lymphoma and could have been otherwise managed instead of being operated on. Our model achieved an accuracy of 0.91. Therefore, we believe that increasing the accuracy of preoperative imaging diagnosis will help to reduce unnecessary invasive procedures.
Our study had several limitations. First, tumors were manually segmented; this could be user-dependent, time-consuming and labor-intensive. Prior studies have demonstrated DL-based tumor segmentation algorithm with robust performance. Automated tumor segmentation could probably be integrated into an automated processing pipeline to minimize subjectivity and facilitate large-scale studies. Second, after radiomics feature extraction, we used only LASSO regression with variable lambda value for feature selection. Adopting different feature selection methods could result in different outcome. Third, we used a relatively simple 3D CNN model for classification. Although complicated model is computationally expensive, future advancement in hardware (e.g., GPU or cloud computing) and algorithmic development is expectable. Fourth, because of the retrospective nature of this analysis, a selection bias was unavoidable. Lastly, CT images in our study were obtained using heterogeneous CT scanners, with various acquisition parameters, which can affect radiomic features and analysis. Nonetheless, the diagnostic performance of the radiomics model remained high in the validation cohort, which verified the good generalizability of the model.
As DL gradually becomes the mainstream of imaging study, some studies showed DL was superior to radiomics with ML in visual classification. However, it is susceptible to overfitting and takes a large number of data for model training and parameter tuning. Owing to the limited size of dataset, our proposed radiomics-based ML and 3D CNN scheme may be overfitting during training process. In rare disease with limited case number, radiomics-based ML may have better efficacy with lower computational cost. The best method of computing depends on the subject of study and size of dataset. Further studies are mandatory to evaluate the efficacy of ML and DL in the same dataset.
Conclusion
To conclude, an ensemble ML method with radiomic feature can be useful for differentiating TETs from other type of PMTs, performed slightly better than a 3D CNN, and demonstrated good generalizability across institutions.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving human participants were reviewed and approved by National Cheng Kung University Hospital (A-ER-111-211), E-Da Hospital (EMRP-110-145), and Kaohsiung Veteran General Hospital (VGHKS19-CT6-08). Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
Author contributions
(I) Conception and design: C-CC, Y-TY. (II) Administrative support: None. (III) Provision of study materials or patients: C-CC, E-KT, Y-FW, C-YL, F-ZW, M-TW, Y-SL, Y-TY and Y-LT. (IV) Collection and assembly of data: C-CC, E-KT, Y-FW, F-ZW and C-YL. (V) Data analysis and interpretation: C-CC, C-YL and M-CM. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the National Cheng Kung University Hospital of Taiwan [NCKUH-11103026 and NCKUH-11201007], the Ministry of Science and Technology of Taiwan [MOST 110-2314-B-006-103 and MOST 111-2314-B-006-106], and Higher Education Sprout Project, Ministry of Education to the Headquarters of University Advancement at National Cheng Kung University(NCKU).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1105100/full#supplementary-material
References
1. Araki T, Nishino M, Gao W, Dupuis J, Washko GR, Hunninghake GM, et al. Anterior mediastinal masses in the framingham heart study: Prevalence and CT image characteristics. Eur J Radiol Open (2015) 2:26–31. doi: 10.1016/j.ejro.2014.12.003
2. Nakazono T, Yamaguchi K, Egashira R, Mizuguchi M, Irie H. Anterior mediastinal lesions: CT and MRI features and differential diagnosis. Japanese J Radiol (2021) 39(2):101–17. doi: 10.1007/s11604-020-01031-2
3. Network, National Comprehensive Cancer. Thymomas and thymic carcinomas (2022). Available at: https://www.nccn.org/professionals/physician_gls/pdf/thymic.pdf.
4. Tomiyama N, Honda O, Tsubamoto M, Inoue A, Sumikawa H, Kuriyama K, et al. Anterior mediastinal tumors: diagnostic accuracy of CT and MRI. Eur J Radiol (2009) 69(2):280–8. doi: 10.1016/j.ejrad.2007.10.002
5. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer (2012) 48(4):441–6. doi: 10.1016/j.ejca.2011.11.036
6. Liu Z, Wang S, Dong D, Wei J, Fang C, Zhou X, et al. The applications of radiomics in precision diagnosis and treatment of oncology: Opportunities and challenges. Theranostics (2019) 9(5):1303–22. doi: 10.7150/thno.30309
7. Yasaka K, Akai H, Nojima M, Shinozaki-Ushiku A, Fukayama M, Nakajima J, et al. Quantitative computed tomography texture analysis for estimating histological subtypes of thymic epithelial tumors. Eur J Radiol (2017) 92:84–92. doi: 10.1016/j.ejrad.2017.04.017
8. Iannarelli A, Sacconi B, Tomei F, Anile M, Longo F, Bezzi M, et al. Analysis of CT features and quantitative texture analysis in patients with thymic tumors: correlation with grading and staging. Radiol Med (2018) 123(5):345–50. doi: 10.1007/s11547-017-0845-4
9. Rizzo S, Botta F, Raimondi S, Origgi D, Fanciullo C, Morganti AG, et al. Radiomics: the facts and the challenges of image analysis. Eur Radiol Exp (2018) 2(1):36. doi: 10.1186/s41747-018-0068-z
10. Lee JG, Jun S, Cho Y. W, Lee H, Kim GB, Seo JB, et al. Deep learning in medical imaging: General overview. Korean J Radiol (2017) 18(4):570–84. doi: 10.3348/kjr.2017.18.4.570
11. Singh SP, Wang L, Gupta S, Goli H, Padmanabhan P, Gulyas B. 3D deep learning on medical images: A review. Sensors (Basel) (2020) 20(18):5097. doi: 10.3390/s20185097
12. Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging (2018) 9(4):611–29. doi: 10.1007/s13244-018-0639-9
13. Biko DM, Lichtenberger JP. 3rd, Rapp JB, Khwaja A, Huppmann AR, Chung EM. Mediastinal masses in children: Radiologic-pathologic correlation. RadioGraphics (2021) 41(4):1186–207. doi: 10.1148/rg.2021200180
14. Court LE, Fave X, Mackin D, Lee J, Yang J, Zhang L. Computational resources for radiomics. Trans Cancer Res (2016) 5(4):340–8. doi: 10.21037/tcr.2016.06.17
15. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res (2017) 77(21):e104–7. doi: 10.1158/0008-5472.CAN-17-0339
16. Zunair H, Rahman A, Mohammed N, Cohen JP. Uniformizing techniques to process CT scans with 3D CNNs for tuberculosis prediction. (2020). doi: 10.1007/978-3-030-59354-4_15
17. Choi JY. Radiomics and deep learning in clinical imaging: What should we do? Nucl Med Mol Imaging (2018) 52(2):89–90. doi: 10.1007/s13139-018-0514-0
18. Abrol A, Fu Z, Salman M, Silva R, Du Y, Plis S, et al. Deep learning encodes robust discriminative neuroimaging representations to outperform standard machine learning. Nat Commun (2021) 12(1):353. doi: 10.1038/s41467-020-20655-6
19. Park YR, Kim YJ, Ju W, Nam K, Kim S, Kim KG. Comparison of machine and deep learning for the classification of cervical cancer based on cervicography images. Sci Rep (2021) 11(1):16143. doi: 10.1038/s41598-021-95748-3
20. Xu Q, Zhu Q, Liu H, Chang L, Duan S, Dou W, et al. Differentiating benign from malignant renal tumors using T2- and diffusion-weighted images: A comparison of deep learning and radiomics models versus assessment from radiologists. J Magn Reson Imaging (2022) 55(4):1251–9. doi: 10.1002/jmri.27900
21. Castillo TJ, Arif M, Starmans MPA, Niessen WJ, Bangma CH, Schoots IG, et al. Classification of clinically significant prostate cancer on multi-parametric MRI: A validation study comparing deep learning and radiomics. Cancers (Basel) (2021) 14(1):12. doi: 10.3390/cancers14010012
22. Xia X, Gong J, Hao W, Yang T, Lin Y, Wang S, et al. Comparison and fusion of deep learning and radiomics features of ground-glass nodules to predict the invasiveness risk of stage-I lung adenocarcinomas in CT scan. Front Oncol (2020) 10:418. doi: 10.3389/fonc.2020.00418
23. Astaraki M, Yang G, Zakko Y, Toma-Dasu I, Smedby O, Wang C. A comparative study of radiomics and deep-learning based methods for pulmonary nodule malignancy prediction in low dose CT images. Front Oncol (2021) 11:737368. doi: 10.3389/fonc.2021.737368
24. Wang X, You X, Zhang L, Huang D, Aramini B, Shabaturov L, et al. A radiomics model combined with XGBoost may improve the accuracy of distinguishing between mediastinal cysts and tumors: a multicenter validation analysis. Ann Transl Med (2021) 9(23):1737. doi: 10.21037/atm-21-5999
25. Kirienko M, Ninatti G, Cozzi L, Voulaz E, Gennaro N, Barajon I, et al. Computed tomography (CT)-derived radiomic features differentiate prevascular mediastinum masses as thymic neoplasms versus lymphomas. Radiol Med (2020) 125(10):951–60. doi: 10.1007/s11547-020-01188-w
26. Sui H, Liu L, Li X, Zuo P, Cui J, Mo Z. CT-based radiomics features analysis for predicting the risk of anterior mediastinal lesions. J Thorac Dis (2019) 11(5):1809–18. doi: 10.21037/jtd.2019.05.32
27. Wang H, Wang L, Lee EH, Zheng J, Zhang W, Halabi S, et al. Decoding COVID-19 pneumonia: comparison of deep learning and radiomics CT image signatures. Eur J Nucl Med Mol Imaging (2021) 48(5):1478–86. doi: 10.1007/s00259-020-05075-4
Keywords: radiomics, convolutional neural networks, deep learning, machine learning, prevascular mediastinal tumor
Citation: Chang C-C, Tang E-K, Wei Y-F, Lin C-Y, Wu F-Z, Wu M-T, Liu Y-S, Yen Y-T, Ma M-C and Tseng Y-L (2023) Clinical radiomics-based machine learning versus three-dimension convolutional neural network analysis for differentiation of thymic epithelial tumors from other prevascular mediastinal tumors on chest computed tomography scan. Front. Oncol. 13:1105100. doi: 10.3389/fonc.2023.1105100
Received: 22 November 2022; Accepted: 27 March 2023;
Published: 18 April 2023.
Edited by:
Alla Reznik, Lakehead University, CanadaReviewed by:
YuChuan Hu, Fourth Military Medical University, ChinaWei Yang, Beijing Cancer Hospital, Peking University, China
Copyright © 2023 Chang, Tang, Wei, Lin, Wu, Wu, Liu, Yen, Ma and Tseng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yi-Ting Yen, Yjg1NDAxMDY3QGdtYWlsLmNvbQ==; Mi-Chia Ma, bWNtYUBtYWlsLm5ja3UuZWR1LnR3
†These authors have contributed equally to this work and share first authorship