Introduction

AUTHOR=Zuo Qiankun , Shi Zhengkun , Liu Bo , Ping Na , Wang Jiangtao , Cheng Xi , Zhang Kexin , Guo Jia , Wu Yixian , Hong Jin 

TITLE=Multi-resolution visual Mamba with multi-directional selective mechanism for retinal disease detection

JOURNAL=Frontiers in Cell and Developmental Biology

VOLUME=12

YEAR=2024

URL=https://www.frontiersin.org/journals/cell-and-developmental-biology/articles/10.3389/fcell.2024.1484880

DOI=10.3389/fcell.2024.1484880

ISSN=2296-634X

ABSTRACT=<sec><title>Introduction</title><p>Retinal diseases significantly impact patients’ quality of life and increase social medical costs. Optical coherence tomography (OCT) offers high-resolution imaging for precise detection and monitoring of these conditions. While deep learning techniques have been employed to extract features from OCT images for classification, convolutional neural networks (CNNs) often fail to capture global context due to their focus on local receptive fields. Transformer-based methods, on the other hand, suffer from quadratic complexity when handling long-range dependencies.</p></sec><sec><title>Methods</title><p>To overcome these limitations, we introduce the Multi-Resolution Visual Mamba (MRVM) model, which addresses long-range dependencies with linear computational complexity for OCT image classification. The MRVM model initially employs convolution to extract local features and subsequently utilizes the retinal Mamba to capture global dependencies. By integrating multi-scale global features, the MRVM enhances classification accuracy and overall performance. Additionally, the multi-directional selection mechanism (MSM) within the retinal Mamba improves feature extraction by concentrating on various directions, thereby better capturing complex, orientation-specific retinal patterns.</p></sec><sec><title>Results</title><p>Experimental results demonstrate that the MRVM model excels in differentiating retinal images with various lesions, achieving superior detection accuracy compared to traditional methods, with overall accuracies of 98.98\% and 96.21\% on two public datasets, respectively.</p></sec><sec><title>Discussion</title><p>This approach offers a novel perspective for accurately identifying retinal diseases and could contribute to the development of more robust artificial intelligence algorithms and recognition systems for medical image-assisted diagnosis.</p></sec>