Skip to main content

ORIGINAL RESEARCH article

Front. Oncol.
Sec. Cancer Imaging and Image-directed Interventions
Volume 15 - 2025 | doi: 10.3389/fonc.2025.1456563

Improving Diagnostic Precision in Thyroid Nodule Segmentation from Ultrasound Images with a Self-Attention Mechanism-based Swin U-Net Model

Provisionally accepted
Changan Yang Changan Yang 1*Muhammad Awais Ashraf Muhammad Awais Ashraf 2Mudassar Riaz Mudassar Riaz 3*Hongyuan Huang Hongyuan Huang 4*Yueqin Xu Yueqin Xu 4*
  • 1 Department of Thyroid and Breast Surgery, Jinjiang Municipal Hospital, Quanzhou, Fujian Province, 362200, China, Quanzhou, Fujian Province, China
  • 2 School of Information Engineering, Chang’an University, Xi’an, Shaanxi Province,710000, PR China, Xian, China
  • 3 Central South University, Changsha, China
  • 4 Shanghai Sixth People's Hospital Fujian Campus, Jinjiang City, Quanzhou, Fujian, China, Quanzhou, China

The final, formatted version of the article will be published soon.

    Background: Accurate segmentation of thyroid nodules in ultrasound imaging remains a significant challenge in medical diagnostics, primarily due to edge blurring and substantial variability in nodule size. These challenges directly affect the precision of thyroid disorder diagnoses, which are crucial for metabolic and hormonal regulation. Methods: This study proposes a novel segmentation approach utilizing a Swin U-Net architecture enhanced with a self-attention mechanism. The model integrates residual and multiscale convolutional structures in the encoder path, with long skip connections feeding into an attention module to improve edge preservation and feature extraction. The decoder path employs these refined features to achieve precise segmentation. Comparative evaluations were conducted against traditional models, including U-Net and DeepLabv3+. Results: The Swin U-Net model demonstrated superior performance, achieving an average Dice Similarity Coefficient (DSC) of 0.78, surpassing baseline models such as U-Net and DeepLabv3+. The incorporation of residual and multiscale convolutional structures, along with the use of long skip connections, effectively addressed issues of edge blurring and nodule size variability. These advancements resulted in significant improvements in segmentation accuracy, highlighting the model's potential for addressing the inherent challenges of thyroid ultrasound imaging.The enhanced Swin U-Net architecture exhibits notable improvements in the robustness and accuracy of thyroid nodule segmentation, offering considerable potential for clinical applications in thyroid disorder diagnosis. While the study acknowledges dataset size limitations, the findings demonstrate the effectiveness of the proposed approach. This method represents a significant step toward more reliable and precise diagnostics in thyroid disease management, with potential implications for enhanced patient outcomes in clinical practice.

    Keywords: Swin U-net, image segmentation, deep learning, image dataset, thyroid, Ultrasound images

    Received: 03 Jul 2024; Accepted: 02 Jan 2025.

    Copyright: © 2025 Yang, Ashraf, Riaz, Huang and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Changan Yang, Department of Thyroid and Breast Surgery, Jinjiang Municipal Hospital, Quanzhou, Fujian Province, 362200, China, Quanzhou, Fujian Province, China
    Mudassar Riaz, Central South University, Changsha, China
    Hongyuan Huang, Shanghai Sixth People's Hospital Fujian Campus, Jinjiang City, Quanzhou, Fujian, China, Quanzhou, China
    Yueqin Xu, Shanghai Sixth People's Hospital Fujian Campus, Jinjiang City, Quanzhou, Fujian, China, Quanzhou, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.