- 1Medical imaging center, People’s Hospital of Ningxia Hui Autonomous Region, Yinchuan, China
- 2Department of Radiology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- 3Department of Pathology, People’s Hospital of Ningxia Hui Autonomous Region, Yinchuan, China
- 4Department of Radiology, Bayer Healthcare, Wuhan, China
Objective: To develop and validate a deep learning signature for noninvasive prediction of spread through air spaces (STAS) in clinical stage I lung adenocarcinoma and compare its predictive performance with conventional clinical-semantic model.
Methods: A total of 513 patients with pathologically-confirmed stage I lung adenocarcinoma were retrospectively enrolled and were divided into training cohort (n = 386) and independent validation cohort (n = 127) according to different center. Clinicopathological data were collected and CT semantic features were evaluated. Multivariate logistic regression analyses were conducted to construct a clinical-semantic model predictive of STAS. The Swin Transformer architecture was adopted to develop a deep learning signature predictive of STAS. Model performance was assessed using area under the receiver operating characteristic curve (AUC), sensitivity, specificity, positive and negative predictive value, and calibration curve. AUC comparisons were performed by the DeLong test.
Results: The proposed deep learning signature achieved an AUC of 0.869 (95% CI: 0.831, 0.901) in training cohort and 0.837 (95% CI: 0.831, 0.901) in validation cohort, surpassing clinical-semantic model both in training and validation cohort (all P<0.01). Calibration curves demonstrated good agreement between STAS predicted probabilities using deep learning signature and actual observed probabilities in both cohorts. The inclusion of all clinical-semantic risk predictors failed to show an incremental value with respect to deep learning signature.
Conclusions: The proposed deep learning signature based on Swin Transformer achieved a promising performance in predicting STAS in clinical stage I lung adenocarcinoma, thereby offering information in directing surgical strategy and facilitating adjuvant therapeutic scheduling.
Introduction
Lung cancer remains the leading lethal malignancy, responsible for 12.4% of all newly-diagnosed cases worldwide in 2022 (1). As the predominant cause of lung cancer-related mortality, lung adenocarcinoma exhibits distinctive histological growth pattern and molecular genotyping (2). Spread through air spaces (STAS) is a unique invasion pattern separate from lymphatic-vascular and visceral pleural invasion, with a predisposition in lung adenocarcinoma. Initially introduced by Kadota et al. and explicitly defined in the World Health Organization Classification of Lung Cancer in 2015, STAS refers to the dissemination of tumor cells as solid nests, micropapillary clusters or single cells into the peritumoral alveolar airspaces (3). Multiply studies have consistently demonstrated that STAS serves as a well-established prognosticator for lung adenocarcinoma undergoing sublobectomy, indicating an increased risk of postoperative relapse and worse prognosis (4–6). STAS is recognized as a pathological indicator for identifying the beneficiaries of adjuvant chemotherapy among stage IB patients (7). Therefore, STAS is of great significance in identifying high-risk patients and guiding personalized therapeutic strategies.
However, intraoperative pathological assessment for STAS through rapid frozen sections has been proved to be of limited sensitivity and reproducibility (8). The shifting of tumor cells to the peritumoral alveolar airspaces caused by manual operations such as extrusion, blade cutting and tissue dysfixation were hardly distinguished from STAS cell clusters, thereby hindering the reliable application of this approach. Several scholars exploited CT semantic indicators for STAS by visual inspection or manual measurement, such as tumor diameter, ground-glass opacity (GGO) components, and pleural retraction (9, 10). Nevertheless, these indicators rely on subjective judgement and professional skills, making them unsuitable for widespread clinical practice due to inconsistent interpretation criteria. Several studies developed CT-based radiomics signature predictive of STAS, but the radiomics approach involves several sequential processing steps such as tumor delineation, dimension reduction and model building (11, 12). The efficiency of radiomics modeling is highly influenced by interobserver heterogeneity and handling quality at each step.
Deep learning is an end-to-end network architecture, characterized by the ingestion of data from the input end and the generation of prediction results from the output end. The error between prediction result and actual observation is iteratively propagated through each layer, facilitating model adjustment and convergence. On account of the advantages of automatically learning and extracting representative information, deep learning has achieved remarkable efficacy in distinguishing histological subtypes, evaluating treatment response, and predicting survival (13–15). In this study, we employed Swin Transformer, a deep learning framework exploited by Microsoft Research Asia, to construct and validate a CT-based deep learning predictive model for STAS in lung adenocarcinoma. This study also sought to investigate the incremental value of clinical characteristics and conventional CT semantic features over the deep learning signature.
Methods
Patients
This study was approved by the Ethics Committee and the requirement for informed consent was waived due to its retrospective nature. The patients who underwent radical resection at the main campus of Tongji Hospital (Center 1) from October 2021 to June 2022 were systematically reviewed. Inclusion criteria were: (1) invasive lung adenocarcinoma confirmed by pathology; (2) maximum tumor diameter on CT images ≤ 4 cm; (3) no radiological signs of locoregional lymph node invasion or distant metastasis; (4) no preoperative radiotherapy, chemotherapy or targeted therapy; (5) interval time of preoperative CT examination and operation within two weeks. The exclusion criteria were: (a) rare histological variants; (b) simultaneous or metachronous tumors; (c) unavailable thin-section CT images or obvious image artifacts; (d) insufficient peritumoral parenchyma reserved for STAS assessment; (e) subjected to other cancers. Tumor staging was based on the eighth edition of the TNM staging system. Following the same criteria, patients undergoing radical surgical resection at the Sino-Germany Guanggu Campus of Tongji Hospital (Center 2) from January 2022 to June 2022 were retrospectively enrolled. Clinical information including gender, age, smoking history, pack-year and serum CEA level were acquired from clinical electronic records. The recruitment workflow is illustrated in Figure 1.
Histological assessment
Pathological characteristics including histological subtype, Ki-67 labeling index (LI), visceral pleural invasion, lymphatic-vascular invasion, pathological TNM staging and STAS were documented. The excision specimen was fixed in 10% formalin and embedded in paraffin before sectioned. Hematoxylin-eosin staining, immunohistochemistry staining and elastic fiber staining were performed accordingly. Two pathologists with experiences of 5 years and 11 years independently interpreted STAS on the sections. Initially, tumor smooth interfaces were recognized by naked eyes and at low-magnification (×10). Subsequently, three areas with the most abundant STAS were selected for interpretation at high-magnification (×200). If any of the following forms of tumor cells are observed within peritumoral alveolar airspaces, it is judged to be STAS-positive: (1) micropapillary clusters without a central fibrovascular core; (2) solid tumor nests; (3) discrete single tumor cells. Ki-67 LI is determined by the percentage of cells with stained-brown nuclei among 1000 tumor cells via immunohistochemical staining. Invasive lung adenocarcinoma is categorized into five histological subtypes based on growth architecture: lepidic, acinar, papillary, micropapillary and solid predominant adenocarcinoma.
CT scanning protocol and semantic feature interpretation
The patients were examined using multi-slice spiral CT scanners including GE Discovery 750 HD, TOSHIBA Aquilion One TSX-301A, Philips Brilliance ICT 256 and GE Optima CT 660. The acquisition parameters were detailed in Supplementary Data Sheet 1. CT semantic features were independently evaluated by two radiologists with 12 and 7 years of experience, respectively, blinded to the clinicopathological information. The lung window (width: 1600 HU; level: -600 HU) and mediastinal window (width: 400 HU; level: 40 HU) were fixed, respectively. CT semantic features included affiliated lobe, location, attenuation type, tumor total diameter, tumor consolidation diameter, consolidation-to-tumor ratio (CTR), shape, boundary, lobulation, spiculation, cavity, vacuole, air bronchogram, and plural attachment. CTR is quantified by the ratio of tumor consolidation diameter and total diameter. The definitions of CT semantic features were elucidated in Supplementary Data Sheet 1. The interobserver agreement for categorical and continuous variables was evaluated using Cohen ‘s kappa coefficient and intraclass correlation coefficient (ICC), respectively. The average measured by two radiologists was taken as the final value for continuous variables. Consensus on divergent categorical variables was reached through discussion involving a third radiologist.
Tumor segmentation and deep learning signature development
The automatic virtual adversarial training segmentation algorithm, based on a three-dimensional U-shape convolutional neural network known as 3D U-Net, was employed to achieve tumor segmentation. The topology of U-net was showed in Supplementary Data Sheet 1. For modeling, we proposed a deep learning framework called Swin Transformer to develop a signature predictive of STAS. The overall architecture consists of four transformer stages comprising Patch Embedding/Merging and Swin Transformer Blocks in each stage as revealed in Figure 2 and Supplementary Data Sheet 1. To mitigate overfitting due to limited amounts of data, the model was pretrained in CT images of lung cancer from the Cancer Imaging Archive followed by fine-tuned in 13510 CT images of lung adenocarcinoma in the training cohort. Furthermore, to compare the efficacy of different deep learning methods in predicting STAS, we applied ResNet-50, EfficientNet and ConvNeXt for modeling denoted as ModelResNet-50, ModelEfficientNet and ModelConvNeXt. The original code for implementing Swin Transformer can be acquired at https://github.com/microsoft/Swin-Transformer. We implemented the neural network using PyTorch 1.4.1 library in Python 3.7.0 (https://pytorch.org).
Figure 2. The overall framework of STAS prediction model development and validation. (A) Patients with lung adenocarcinoma were respectively enrolled from Center 1 and Center 2. (B) Imaging preprocessing included isotropic resample, intensity normalization and tumor automatic segmentation. (C) Deep learning signature predictive of STAS was developed based on Swin Transformer. (D) Histological assessment and radiologist interpretation were conducted for all patients in Center 1 and Center 2, and then model performance comparisons were performed. STAS, spread through air spaces.
Clinical-semantic model construction
Univariate analysis was initially performed to identify statistically significant clinical characteristics and CT semantic features between STAS positive and negative subgroups (P < 0.05) in the training cohort. Afterwards, features with Spearman correlation coefficient > 0.7 were removed in view of multicollinearity inference. The remaining features as candidate variables were included in multivariate logistic regression analysis to determine the features independently associated with STAS. The features were combined linearly weighted by their corresponding regression coefficients to construct clinical-semantic model. Given that the inherent design of preoperative prediction, pathological indicators were not included in logistic regression analysis, but compared across different STAS subgroups.
Statistical analysis
Statistical analysis was performed using MATLAB (MathWorksInc., Natick, MA) and SPSS (IBM, ver.26.0). Shapiro-Wilk test and Levene test were used to analyze the normality and homogeneity of variance for continuous variables. The continuous variables were compared using the Student’s t-test and Mann-Whitney U test, as appropriate. The comparisons of categorical variables were conducted by Chi-square test or Fisher exact test. Pearson correlation analysis was used to evaluate the correlation between features. The area under receiver operating characteristic curve (AUC), sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were used to quantify model performance. The calibration curve and Hosmer-Lemeshow test were employed to evaluate the consistency between predicted probabilities by deep learning signature and actual observations. A double-tailed P<0.05 indicated statistical significance.
Results
Baseline characteristics
In total, 126 eligible STAS-positive and 260 STAS-negative patients from Center 1 were enrolled to construct a training cohort (n=386). Accordingly, a total of 45 STAS-positive and 82 STAS-negative patients from Center 2 constituted an independent validation cohort (n=127). As revealed in Table 1, all clinicopathological characteristics and CT semantic features exhibited a balanced distribution between the training cohort and validation cohort. Of 513 patients, 239 (46.6%) were male [median age (interquartile): 59.0 (53.0, 65.0)] and 274 (53.4%) were female [median age (interquartile): 61.0 (54.0, 68.0)]. Totally, there were 171 (33.3%) and 342 (66.7%) patients pathologically-confirmed to be STAS-positive and STAS-negative, respectively.
Table 1. The distribution of clinicopathological characteristics in training cohort and validation cohort.
The interobserver consistency assessment for CT semantic features
As shown in Table 2, ICC for tumor total diameter, tumor consolidation diameter and CTR were 0.988 (95% CI: 0.985, 0.990), 0.991 (95% CI: 0.990, 0.993) and 0.982 (95% CI: 0.979, 0.985), respectively. Cohen ‘s kappa coefficients for the categorical variables ranged from 0.808 to 0.992, indicative of satisfactory interobserver agreement in interpreting CT semantic features. The discrepant numbers (frequency) of categorical variables between two radiologists were also documented as revealed in Table 2.
The association of clinicopathological characteristics with STAS
As shown in Table 3, STAS was more likely occurred in patients with pack-year > 40 (P=0.002) and CEA > 5 ug/L (P<0.001), but it had no significant association with gender, age and smoking history. STAS was more frequently observed in micropapillary and solid predominant adenocarcinoma, but rarely occurred in lepidic predominant adenocarcinomas (P<0.001). Furthermore, STAS was closely related with visceral pleural invasion and lymphatic-vascular invasion (P<0.001 and P<0.001). Ki-67 LI in STAS-positive subgroup significantly exceeded that of STAS-negative subgroup (P<0.001). Additionally, lung adenocarcinoma with higher pathological T and N stages showed a higher prevalence of STAS (P<0.001 for both).
Table 3. The relationships of clinicopathological characteristics and CT semantic features with STAS in training cohort.
The association of CT semantic features with STAS
Tumor total diameter, tumor consolidation diameter and CTR in STAS-positive subgroup were significantly higher than those in STAS-negative subgroup (all P<0.001; Figures 3 and 4). Solid tumors, obscure boundary, spiculation, vacuole and pleural attachment were more frequent in STAS, but air bronchogram was less common in STAS (all P< 0.05). The tumor consolidation diameter and attenuation subtype were excluded from logistic regression analysis considering a strong correlation with CTR (r=0.839 and 0.913, P< 0.001). Finally, CEA (odds ratio [OR]: 2.022; 95% CI: 1.080, 3.784; P=0.028), vacuole (OR: 3.509; 95% CI: 1.488, 8.278; P=0.004), obscure boundary (OR: 2.716; 95% CI: 1.628, 4.529; P<0.001) and CTR (OR: 1.023; 95% CI: 1.014, 1.033; P<0.001) were included to construct the clinical-semantic model as the independent risk indicators for predicting STAS.
Figure 3. CT image and pathological image obtained from a 65-year-old man with spread though air spaces negative lung adenocarcinoma. (A) Th axial CT image (width, 1600 HU; level, -600 HU) shows a sub-solid nodule in the right lower lobe. (B) The photomicrograph of hematoxylin-eosin-stained histological section (magnification × 200) shows clean alveolar spaces (yellow polygon) beyond the boundary (dashed line) of the tumor (black star).
Figure 4. CT image and pathological image obtained from a 59-year-old woman with spread though air spaces positive lung adenocarcinoma. (A) Th axial CT image (width, 1600 HU; level, -600 HU) shows a solid nodule in the right upper lobe. (B) The photomicrograph of hematoxylin-eosin-stained histological section (magnification × 200) shows several solid nests of tumor cell (yellow arrow) beyond the boundary (dashed line) of the tumor (black star).
Model construction and efficacy evaluation
As shown in the Table 4 and Figure 5, the AUC for Swin Transformer based deep learning signature in the training cohort and validation cohort was 0.869 (95% CI: 0.831, 0.901) and 0.837 (95% CI: 0.761, 0.896), respectively. Encouragingly, Swin Transformer based deep learning signature achieved significantly higher AUC than ModelResNet-50, ModelEfficientNet and ModelConvNeXt in training cohort (0.869 vs. 0.800, 0.797 and 0.783; all P < 0.001), as well as than ModelEfficientNet and ModelConvNeXt in validation cohort (0.837 vs. 0.775 and 0.795; P = 0.025 and 0.027), as shown in Supplementary Table E2. Deep learning signature showed an improvement in predictive performance than ModelResNet-50 in validation cohort, but it did not reach statistical significance (0.837 vs. 0.799, P = 0.087).
Figure 5. The performance comparisons of deep learning signature, CTR and clinical-semantic model in predicting STAS. (A, B) The receiver operating characteristic curves of CTR, clinical-semantic model and deep learning signature in training cohort (A) and validation cohort (B). Number in parenthesis is the area under receiver operating characteristic curve. (C, D) The calibration curves depicted the good agreements between predicted probabilities by deep learning signature and actual observed probabilities of STAS in training cohort (C) and validation cohort (D).
Meanwhile, The AUC for CTR alone and clinical-semantic model was 0.709 (95% CI: 0.660, 0.754) and 0.764 (95% CI: 0.719, 0.806) in training cohort, as well as 0.734 (95% CI: 0.648, 0.808) and 0.714 (95% CI: 0.627, 0.790) in validation cohort, respectively. In the training cohort, deep learning signature performed far superior to CTR (0.869 vs. 0.709, P < 0.001) and clinical-semantic model (0.869 vs.0.764, P < 0.001), with a statistically significant difference. Notably, deep learning signature yielded significantly higher AUC than both CTR (0.837 vs. 0.734, P=0.006) and clinical-semantics model (0.837 vs. 0.714, P=0.002) in validation cohort. The sensitivity, specificity, PPV and NPV of deep learning signature in predicting STAS ranged from 0.578 to 0.706, 0.892 to 0.951, 0.761 to 0.867 and 0.804 to 0.862 across two cohorts, respectively. According to the Hosmer-Lemeshow test and calibration curve, the predicted STAS probabilities by deep learning signature revealed good agreement with the actual observations both in training cohort and validation cohort (P=0.600 and 0.082, respectively). Furthermore, when deep learning signature was incorporated into clinical-semantic model, all CT semantic risk predictors were eliminated from multivariate regression analysis, with merely deep learning signature remained. Pearson correlation analysis revealed a strong correlation between CTR and deep learning signature (r = 0.789, P < 0.001).
Discussion
This study revealed that CEA, tumor boundary, vacuolation and CTR are the independent clinical-semantic features associated with STAS in lung adenocarcinoma. The proposed deep learning model predictive of STAS based on Swin Transformer yielded an AUC of 0.869 (95% CI: 0.821, 0.908) and 0.837 (95% CI: 0.742, 0.908) in the training cohort and independent validation cohort, superior to conventional CTR and clinical-semantic model. Furthermore, neither CTR nor clinical-semantic model exhibited an incremental value over deep learning signature, further confirming its superior predictive value.
For early-stage patients, sublobectomy can preserve more pulmonary function, reduce surgical complications, and shorten hospitalization time, particularly with an equivalent therapeutic effect to lobectomy (16). However, sublobectomy is not appropriate for STAS-positive patients due to a higher risk of locoregional relapse and distant metastasis compared with lobectomy. Another study proved that STAS had negligible adverse effects on prognosis if surgical margin distance exceeded 2 cm in limited resection (17). Thus, anatomic lobectomy and sufficient surgical margin should be recommended for STAS-positive patients to prevent recurrence caused by STAS. Dai et al. also demonstrated that recurrence-free survival rates and overall survival rates of stage IA STAS-positive patients were comparable to those of stage IB patients (18). Furthermore, stage IB patients with STAS-positive can benefit from adjuvant chemotherapy (7, 19). Consequently, STAS serves as a pathological indicator for T upstaging and risk stratification, as well as an effective biomarker for identifying the beneficiaries of adjuvant chemotherapy in early-stage patients.
Currently, there is limited research on leveraging deep learning technique to predict STAS, and the predictive capacity remains modest. Tao et al. applied 3D convolutional neural network to predict STAS in NSCLC, yielding an AUC of 0.790 in validation cohort (20). Wang et al. presented SE-Resnet50 for risk estimation of STAS in solid or part-solid lung adenocarcinoma, resulting a highest AUC of 0.933 achieved so far in training cohort. Nevertheless, their model exhibited a substantial performance reduction in validation cohorts (AUC=0.783-0.806), which approximated the performance of our developed ModelResNet-50 in training and validation cohorts (AUC=0.799-0.800)This unfavorable generalization may attribute to model overfitting by reason of complicated architecture (21). Lin et al. enrolled 581 patients with tumor smaller than 3 cm and CTR less than 0.5 from two institutions. They extracted the deep learning features from solid components and the entire tumors respectively, thereby developing deep learning models with and without solid component gate (SCG). The results revealed deep learning model with SCG achieved higher AUCs than deep learning model without SCG (22). Thus, further investigation is required to develop deep learning signature with SCG based on Swin Transformer, in expectation to further improve the prediction efficacy. In this study, Swin Transformer was adopted as the backbone architecture in modeling, achieving a satisfactory and comparable performance in training and validation cohorts with AUC ranging from 0.837 to 0.869, which was superior to ModelResNet-50, ModelEfficientNet and ModelConvNeXt. This finding lent support to the potential of our proposed Swin Transformer in predicting STAS in lung adenocarcinoma. The state-of-the-art Swin Transformer is regarded as the new backbone of machine vision. With two key strengths of non-overlapping shifted windows and hierarchical structures, Swin Transformer can flexibly process images at various scales and reduce computational complexity from the exponential level to the linear level. Growing evidence validated the efficient processing capabilities of Swin Transformer in handling multitasking such as image classification and density detection (23–25). Our previously published study has affirmed the remarkable efficiency of Swin Transformer in predicting lymph node metastasis in lung adenocarcinoma (26). Aside from that, automatic tumor segmentation was conducted in this study using a 3D U-shape convolutional neural network. This deep learning architecture serves as a highly effective tool for accurate, robust, and efficient segmentation. It surpasses the time-consuming and labor-intensive manual delineation or semi-automated segmentation, as evidenced by the Dice similarity coefficients across multiple institutions (27, 28).
Further exploring the relationship between STAS and histopathological factors, micropapillary and solid predominant adenocarcinoma were more commonly observed in STAS. Our findings demonstrated a significant association between STAS and visceral pleural invasion, lympho-vascular invasion and higher pathological T stage, consistent with previous literature (29). Additionally, lymph node invasion was more frequently found in STAS-positive subgroup (34.9% vs 5.4%). In line with our results, Vaghjiani et al. also reported that STAS was an independent predictor of occult lymph node metastasis in clinical stage IA lung adenocarcinoma (30). Although the underlying mechanism of STAS remains unclear till now, it has been found that epithelial-mesenchymal transition (EMT) prominently promotes the occurrence of STAS (31). EMT is widely recognized as a biological process wherein polarized epithelial cells transform into loosely connected interstitial cells; this process is regarded as the key driver of tumor genesis, invasion and metastasis. This may account for the strong association between STAS and the aforementioned invasive histopathological factors.
In clinical-semantic model, tumor boundary, vacuolation and CTR were the independent CT semantic features in predicting STAS. As a reflection of tumor aggressiveness, CTR weighted heavily in regression analysis with a 1.25-fold increased risk of STAS for every 10% increase. In accordance with our finding, Ding et al. and Chen et al. confirmed that CTR was independently associated with STAS (32, 33). Unexpectedly, the inclusion of all clinical-semantic risk predictors failed to show an incremental value with respect to deep learning signature. We found a strong correlation between CTR and deep learning signature (r = 0.789, P < 0.001), which might account for this result. These findings also lead to speculation on whether deep learning signature contains biological information regarding tumor boundary and vacuolation, which should be explored by future in-depth research. We also found CTR and clinical-semantic model showed equivalent NPV and sensitivity to deep learning signature. Notably, in both training and validation cohorts, the deep learning signature exhibited far superior AUC, specificity, and PPV compared to CTR and the clinical-semantic model, which lent support to its predominant efficacy in predicting STAS.
There are some limitations to this study. First, data were retrospectively collected from different CT equipment, so heterogeneity in acquisition parameters and reconstruction protocols might be inevitable. The class-imbalance in sample should be addressed using resample techniques in the future. Second, it is necessary to expand sample size and enroll multi-institutional data to further affirm the repeatability and generalization of deep learning signature. Besides, long-term follow up and survival data should be warranted to affirm the prognostic value of STAS, as well as the relationship of deep learning signature with prognosis. Further investigation is required to enhance the biological interpretability of deep learning, which inherently possesses a black box nature, thereby facilitating its application in clinical practice. Common approaches involve employing the Grad-CAM algorithm for generating visualizations of deep learning and incorporating attentional mechanisms into deep learning networks to achieve the significance weight of diagnosis and decision-making based on attention regions. Additionally, exploring the associations between deep learning and genomics or proteomics can further improve the biological interpretability of deep learning. Last, given that biological behavior varies in different histological subtypes of lung cancer, future research needs to supplement the predictive value of the deep learning signature for STAS in other histological subtypes.
In conclusion, the proposed deep learning signature based on Swin Transformer offers a promising predictive performance for STAS in clinical stage I lung adenocarcinoma, surpassing the conventional clinical-semantic model. The end-to-end deep learning approach harbors the potential as a well-established tool for noninvasive estimation of STAS, directing surgical strategy and facilitating adjuvant therapeutic scheduling.
Data availability statement
I’m regretful that the generated dataset is inappropriate to be disclosed so far, because some work based on parts of this dataset have not yet published. Requests to access the datasets should be directed to bWF4aWFvbGluZzA0MTdAMTYzLmNvbQ==.
Ethics statement
The studies involving humans were approved by Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because Written informed consent was waived because of retrospective study.
Author contributions
XM: Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing, Conceptualization. WH: Conceptualization, Data curation, Methodology, Writing – review & editing, Validation, Visualization. CC: Conceptualization, Writing – review & editing, Data curation, Resources, Software, Visualization. FT: Conceptualization, Methodology, Writing – review & editing, Formal analysis, Investigation. JC: Conceptualization, Software, Visualization, Writing – review & editing, Methodology, Validation. LY: Formal analysis, Investigation, Methodology, Writing – review & editing. DC: Conceptualization, Methodology, Validation, Writing – review & editing, Formal analysis, Funding acquisition, Project administration, Supervision. LX: Formal Analysis, Investigation, Methodology, Writing – review & editing, Conceptualization, Data curation, Project administration, Resources, Supervision.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study has received funding from the Ningxia Natural Science Foundation (2024AAC03483 and 2024AAC03457), and the Pilot Experiment Project of the National Natural Science Foundation of China (2025GZRYSY003).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1482965/full#supplementary-material
References
1. Bray F, Laversanne M, Sung H, Ferlay J, Siegel RL, Soerjomataram I, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. (2024) 74:229–63. doi: 10.3322/caac.21834
2. Yan R, Fan X, Xiao Z, Liu H, Huang X, Liu J, et al. Inhibition of DCLK1 sensitizes resistant lung adenocarcinomas to EGFR-TKI through suppression of Wnt/β-Catenin activity and cancer stemness. Cancer Lett. (2022) 531:83–97. doi: 10.1016/j.canlet.2022.01.030
3. Kadota K, Nitadori JI, Sima CS, Ujiie H, Rizk NP, Jones DR, et al. Tumor Spread through Air Spaces is an Important Pattern of Invasion and Impacts the Frequency and Location of Recurrences after Limited Resection for Small Stage I Lung Adenocarcinomas. J Thorac Oncol. (2015) 10:806–14. doi: 10.1097/jto.0000000000000486
4. Laville D, Désage AL, Fournel P, Bayle-Bleuez S, Neifer C, Picot T, et al. Spread through air spaces in stage I to III resected lung adenocarcinomas: should the presence of spread through air spaces lead to an upstaging? Am J Surg Pathol. (2024) 48:596–604. doi: 10.1097/pas.0000000000002188
5. Si H, Xu L, Zhao Y, Su H, Dai C, Xie H, et al. Spread through air spaces in residual tumor classification for clinical IA lung adenocarcinoma. Ann Thorac Surg. (2024) 118:825–33. doi: 10.1016/j.athoracsur.2024.03.007
6. Chen X, Zhou H, Wu M, Xu M, Li T, Wang J, et al. Prognostic impact of spread through air spaces in patients with ≤2 cm stage IA lung adenocarcinoma. J Thorac Dis. (2024) 16:2432–42. doi: 10.21037/jtd-24-444
7. Lv Y, Li S, Liu Z, Ren Z, Zhao J, Tao G, et al. Impact of surgery and adjuvant chemotherapy on the survival of stage I lung adenocarcinoma patients with tumor spread through air spaces. Lung Cancer. (2023) 177:51–8. doi: 10.1016/j.lungcan.2023.01.009
8. Villalba JA, Shih AR, Sayo TMS, Kunitoki K, Hung YP, Ly A, et al. Accuracy and reproducibility of intraoperative assessment on tumor spread through air spaces in stage 1 lung adenocarcinomas. J Thorac Oncol. (2021) 16:619–29. doi: 10.1016/j.jtho.2020.12.005
9. Suh JW, Jeong YH, Cho A, Kim DJ, Chung KY, Shim HS, et al. Stepwise flowchart for decision making on sublobar resection through the estimation of spread through air space in early stage lung cancer(1). Lung Cancer. (2020) 142:28–33. doi: 10.1016/j.lungcan.2020.02.001
10. Qin L, Sun Y, Zhu R, Hu B, Wu J. Clinicopathological and CT features of tumor spread through air space in invasive lung adenocarcinoma. Front Oncol. (2022) 12:959113. doi: 10.3389/fonc.2022.959113
11. Liao G, Huang L, Wu S, Zhang P, Xie D, Yao L, et al. Preoperative CT-based peritumoral and tumoral radiomic features prediction for tumor spread through air spaces in clinical stage I lung adenocarcinoma. Lung Cancer. (2022) 163:87–95. doi: 10.1016/j.lungcan.2021.11.017
12. Chen LW, Lin MW, Hsieh MS, Yang SM, Wang HJ, Chen YC, et al. Radiomic values from high-grade subtypes to predict spread through air spaces in lung adenocarcinoma. Ann Thorac Surg. (2022) 114:999–1006. doi: 10.1016/j.athoracsur.2021.07.075
13. Wulaningsih W, Villamaria C, Akram A, Benemile J, Croce F, Watkins J. Deep learning models for predicting Malignancy risk in CT-detected pulmonary nodules: A systematic review and meta-analysis. Lung. (2024) 202:625–36. doi: 10.1007/s00408-024-00706-1
14. Peng J, Xie B, Ma H, Wang R, Hu X, Huang Z. Deep learning based on computed tomography predicts response to chemoimmunotherapy in lung squamous cell carcinoma. Aging Dis. (2024). doi: 10.14336/ad.2024.0169
15. Na KJ, Kim YT, Goo JM, Kim H. Clinical utility of a CT-based AI prognostic model for segmentectomy in non-small cell lung cancer. Radiology. (2024) 311:e231793. doi: 10.1148/radiol.231793
16. Zhang B, Liu R, Ren D, Li X, Wang Y, Huo H, et al. Comparison of lobectomy and sublobar resection for stage IA elderly NSCLC patients (≥70 years): A population-based propensity score matching’s study. Front Oncol. (2021) 11:610638. doi: 10.3389/fonc.2021.610638
17. Masai K, Sakurai H, Sukeda A, Suzuki S, Asakura K, Nakagawa K, et al. Prognostic impact of margin distance and tumor spread through air spaces in limited resection for primary lung cancer. J Thorac Oncol. (2017) 12:1788–97. doi: 10.1016/j.jtho.2017.08.015
18. Dai C, Xie H, Su H, She Y, Zhu E, Fan Z, et al. Tumor Spread through Air Spaces Affects the Recurrence and Overall Survival in Patients with Lung Adenocarcinoma >2 to 3 cm. J Thorac Oncol. (2017) 12:1052–60. doi: 10.1016/j.jtho.2017.03.020
19. Chen D, Wang X, Zhang F, Han R, Ding Q, Xu X, et al. Could tumor spread through air spaces benefit from adjuvant chemotherapy in stage I lung adenocarcinoma? A multi-institutional study. Ther Adv Med Oncol. (2020) 12:1758835920978147. doi: 10.1177/1758835920978147
20. Tao J, Liang C, Yin K, Fang J, Chen B, Wang Z, et al. 3D convolutional neural network model from contrast-enhanced CT to predict spread through air spaces in non-small cell lung cancer. Diagn Interv Imaging. (2022) 103:535–44. doi: 10.1016/j.diii.2022.06.002
21. Wang S, Liu X, Jiang C, Kang W, Pan Y, Tang X, et al. CT-based super-resolution deep learning models with attention mechanisms for predicting spread through air spaces of solid or part-solid lung adenocarcinoma. Acad Radiol. (2024) 31:2601–9. doi: 10.1016/j.acra.2023.12.034
22. Lin MW, Chen LW, Yang SM, Hsieh MS, Ou DX, Lee YH, et al. CT-based deep-learning model for spread-through-air-spaces prediction in ground glass-predominant lung adenocarcinoma. Ann Surg Oncol. (2024) 31:1536–45. doi: 10.1245/s10434-023-14565-2
23. Li L, Mei Z, Li Y, Yu Y, Liu M. A dual data stream hybrid neural network for classifying pathological images of lung adenocarcinoma. Comput Biol Med. (2024) 175:108519. doi: 10.1016/j.compbiomed.2024.108519
24. Lo CM, Wang CC, Hung PH. Interactive content-based image retrieval with deep learning for CT abdominal organ recognition. Phys Med Biol. (2024) 69. doi: 10.1088/1361-6560/ad1f86
25. Li C, Bagher-Ebadian H, Sultan R, Elshaikh M, Movsas B, Zhu D, et al. A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images. Med Phys. (2023) 50:6990–7002. doi: 10.1002/mp.16750
26. Ma X, Xia L, Chen J, Wan W, Zhou W. Development and validation of a deep learning signature for predicting lymph node metastasis in lung adenocarcinoma: comparison with radiomics signature and clinical-semantic model. Eur Radiol. (2023) 33:1949–62. doi: 10.1007/s00330-022-09153-z
27. Carles M, Kuhn D, Fechter T, Baltas D, Mix M, Nestle U, et al. Development and evaluation of two open-source nnU-Net models for automatic segmentation of lung tumors on PET and CT images with and without respiratory motion compensation. Eur Radiol. (2024) 34:6701–11. doi: 10.1007/s00330-024-10751-2
28. Park J, Kang SK, Hwang D, Choi H, Ha S, Seo JM, et al. Automatic lung cancer segmentation in [(18)F]FDG PET/CT using a two-stage deep learning approach. Nucl Med Mol Imaging. (2023) 57:86–93. doi: 10.1007/s13139-022-00745-7
29. Shih AR, Mino-Kenudson M. Updates on spread through air spaces (STAS) in lung cancer. Histopathology. (2020) 77:173–80. doi: 10.1111/his.14062
30. Vaghjiani RG, Takahashi Y, Eguchi T, Lu S, Kameda K, Tano Z, et al. Tumor spread through air spaces is a predictor of occult lymph node metastasis in clinical stage IA lung adenocarcinoma. J Thorac Oncol. (2020) 15:792–802. doi: 10.1016/j.jtho.2020.01.008
31. Niu Y, Han X, Zeng Y, Nanding A, Bai Q, Guo S, et al. The significance of spread through air spaces in the prognostic assessment model of stage I lung adenocarcinoma and the exploration of its invasion mechanism. J Cancer Res Clin Oncol. (2023) 149:7125–38. doi: 10.1007/s00432-023-04619-z
32. Ding Y, Chen Y, Wen H, Li J, Chen J, Xu M, et al. Pretreatment prediction of tumour spread through air spaces in clinical stage I non-small-cell lung cancer. Eur J Cardiothorac Surg. (2022) 62:ezac248. doi: 10.1093/ejcts/ezac248
Keywords: deep learning, lung adenocarcinoma, spread though air space, computer tomography, prediction
Citation: Ma X, He W, Chen C, Tan F, Chen J, Yang L, Chen D and Xia L (2025) A CT-based deep learning model for preoperative prediction of spread through air spaces in clinical stage I lung adenocarcinoma. Front. Oncol. 14:1482965. doi: 10.3389/fonc.2024.1482965
Received: 19 August 2024; Accepted: 17 December 2024;
Published: 08 January 2025.
Edited by:
Sandeep Kumar Mishra, Yale University, United StatesReviewed by:
Xiaoliang Shao, The Third Affiliated Hospital of Soochow University, ChinaXiaonan Shao, Third Affiliated Hospital of Soochow University, China
Nazreen Pallikkavaliyaveetil MohammedSheriff, University of Michigan, United States
Copyright © 2025 Ma, He, Chen, Tan, Chen, Yang, Chen and Xia. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Liming Xia, eGlhbGltaW5nMjAxN0BvdXRsb29rLmNvbQ==; Dazhi Chen, Y2hkYXpoaUBzaW5hLmNvbQ==