Skip to main content

ORIGINAL RESEARCH article

Front. Comput. Sci.

Sec. Computer Vision

Volume 7 - 2025 | doi: 10.3389/fcomp.2025.1463006

Integrating Pyramid Vision Transformer and Topological Data Analysis for Brain Tumor

Provisionally accepted
Dhananjay Laxmikant Joshi Dhananjay Laxmikant Joshi 1*Bhupesh Kumar Singh Bhupesh Kumar Singh 1Kapil Kumar Kapil Kumar 2Nitin S Choubey Nitin S Choubey 3
  • 1 Amity University Jaipur, Jaipur, India
  • 2 Guru Ghasidas Vishwavidyalaya, Bilaspur, Chhattisgarh, India
  • 3 SVKM's Narsee Monjee Institute of Management Studies, Mumbai, Maharashtra, India

The final, formatted version of the article will be published soon.

    Accurate Brain Tumor (BT) classification is crucial due to the complex and varied nature of tumors. This paper presents a novel approach for BT classification that combines a Pyramid Vision Transformer (PVT) with an adaptive deformable attention mechanism and Topological Data Analysis (TDA) to address the complexities of BT detection. While PVT and deformable attention mechanisms have been explored in prior works, this study introduces several key innovations to enhance their performance for medical image analysis. Specifically, we propose an adaptive deformable attention mechanism that dynamically adjusts receptive fields based on tumor complexity, allowing for better focus on critical regions in MRI images. Additionally, we introduce an adaptive sampling rate with hierarchical dynamic position embeddings, enabling context-aware feature extraction across multiple scales. The framework further improves feature diversity through an offset group mechanism that partitions feature channels into specialized groups. To integrate local and global contexts, we employ a hierarchical deformable attention strategy, refining the feature representation. Topological insights from TDA, applied to preprocessed images, facilitate the extraction of meaningful patterns for classification. The BT classification is performed using a Random Forest Classifier. Extensive testing on the Figshare database demonstrates the effectiveness of the proposed methodology, yielding an accuracy of 99.2%, recall of 99.35%, precision of 98.9%, F1-score of 99.12%, Mathews Correlation Coefficient (MCC) of 0.98, LogLoss of 0.05 and an execution time of just 6 seconds. These results highlight the method's ability to combine detailed feature extraction with topological insights, significantly advancing the accuracy and efficiency of BT classification and offering a promising tool for improving diagnostic outcomes.

    Keywords: adaptive PVT, BT classification, brain tumor, Deep feature extraction, Giotto-TDA, Random forest classifier 2

    Received: 11 Jul 2024; Accepted: 11 Mar 2025.

    Copyright: © 2025 Joshi, Singh, Kumar and Choubey. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Dhananjay Laxmikant Joshi, Amity University Jaipur, Jaipur, India

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more