Skip to main content

ORIGINAL RESEARCH article

Front. Oncol.
Sec. Cancer Imaging and Image-directed Interventions
Volume 15 - 2025 | doi: 10.3389/fonc.2025.1475133
This article is part of the Research Topic Quantitative Imaging: Revolutionizing Cancer Management with biological sensitivity, specificity, and AI integration View all 23 articles

TriSwinUNETR Lobe Segmentation Model for Computing DIR-Free CT-Ventilation

Provisionally accepted
Gabriela Nomura Gabriela Nomura 1Aaron Luong Aaron Luong 1Ananya Prakash Ananya Prakash 1Annabelle Alemand Annabelle Alemand 1Tanish Bhowmick Tanish Bhowmick 1Alisa Ali Alisa Ali 1Jaimie Ren Jaimie Ren 1Basil Rehani Basil Rehani 1Girish Nair Girish Nair 2Richard Castillo Richard Castillo 3Yevgeniy Vinogradskiy Yevgeniy Vinogradskiy 4Edward Castillo Edward Castillo 1*
  • 1 The University of Texas at Austin, Austin, United States
  • 2 Oakland University William Beaumont School of Medicine, Rochester, Michigan, United States
  • 3 Department of Radiation Oncology, Winship Cancer Institute, Emory University, Atlanta, Georgia, United States
  • 4 Department of Radiation Oncology, Thomas Jefferson University, Philadelphia, Pennsylvania, United States

The final, formatted version of the article will be published soon.

    Purpose: Functional radiotherapy avoids the delivery of high-radiation dosages to high-ventilated lung areas. Methods to determine CT-ventilation imaging (CTVI) typically rely on deformable image registration (DIR) to calculate volume changes within inhale/exhale CT image pairs. Since DIR is a non-trivial task that can bias CTVI, we hypothesize that lung volume changes needed to calculate CTVI can be computed from AI-driven lobe segmentations in inhale/exhale phases, without DIR. We utilize a novel lobe segmentation pipeline (TriSwinUNETR), and the resulting inhale/exhale lobe volumes are used to calculate CTVI. Methods: Our pipeline involves three SwinUNETR networks, each trained on 6,501 CT image pairs from the COPDgene study. An initial network provides right/left lung segmentations used to define bounding boxes for each lung. Bounding boxes are resized to focus on lung volumes and then lobes are segmented with dedicated right and left SwinUNETR networks. Fine-tuning was conducted on CTs from 11 patients treated with radiotherapy for non-small cell lung cancer. Five-fold cross-validation was then performed on 51 LUNA16 cases with manually delineated ground truth. Breathing-induced volume change was calculated for each lobe using AI-defined lobe volumes from inhale/exhale phases, without DIR. Resulting lobar CTVI values were validated with 4DCT and positron emission tomography (PET)-Galligas ventilation imaging for 19 lung cancer patients. Spatial Spearman correlation between TriSwinUNETR lobe ventilation and ground-truth PET-Galligas ventilation was calculated for each patient.Results: TriSwinUNETR achieved a state-of-the-art mean Dice score of 93.72% (RUL: 93.49%, RML: 85.78%, RLL: 95.65%, LUL: 97.12%, LLL: 96.58%), outperforming best-reported accuracy of 92.81% for the lobe segmentation task. CTVI calculations yielded a median Spearman correlation coefficient of 0.9 across 19 cases, with 13 cases exhibiting correlations of at least 0.5, indicating strong agreement with PET-Galligas ventilation.Our TriSwinUNETR pipeline demonstrated superior performance in the lobe segmentation task, while our segmentation-based CTVI exhibited strong agreement with PET-Galligas ventilation. Moreover, as our approach leverages deep-learning for segmentation, it provides interpretable ventilation results and facilitates quality assurance, thereby reducing reliance on DIR.

    Keywords: Lobe segmentation, CT-Ventilation, artificial intelligence, Transformer networks, functional radiotherapy

    Received: 02 Aug 2024; Accepted: 27 Jan 2025.

    Copyright: © 2025 Nomura, Luong, Prakash, Alemand, Bhowmick, Ali, Ren, Rehani, Nair, Castillo, Vinogradskiy and Castillo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Edward Castillo, The University of Texas at Austin, Austin, United States

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.