AUTHOR=Li Hao , Tao Xiang , Liang Tuo , Jiang Jie , Zhu Jichong , Wu Shaofeng , Chen Liyi , Zhang Zide , Zhou Chenxing , Sun Xuhua , Huang Shengsheng , Chen Jiarui , Chen Tianyou , Ye Zhen , Chen Wuhua , Guo Hao , Yao Yuanlin , Liao Shian , Yu Chaojie , Fan Binguang , Liu Yihong , Lu Chunai , Hu Junnan , Xie Qinghong , Wei Xiao , Fang Cairen , Liu Huijiang , Huang Chengqian , Pan Shixin , Zhan Xinli , Liu Chong TITLE=Comprehensive AI-assisted tool for ankylosing spondylitis based on multicenter research outperforms human experts JOURNAL=Frontiers in Public Health VOLUME=11 YEAR=2023 URL=https://www.frontiersin.org/journals/public-health/articles/10.3389/fpubh.2023.1063633 DOI=10.3389/fpubh.2023.1063633 ISSN=2296-2565 ABSTRACT=Introduction

The diagnosis and treatment of ankylosing spondylitis (AS) is a difficult task, especially in less developed countries without access to experts. To address this issue, a comprehensive artificial intelligence (AI) tool was created to help diagnose and predict the course of AS.

Methods

In this retrospective study, a dataset of 5389 pelvic radiographs (PXRs) from patients treated at a single medical center between March 2014 and April 2022 was used to create an ensemble deep learning (DL) model for diagnosing AS. The model was then tested on an additional 583 images from three other medical centers, and its performance was evaluated using the area under the receiver operating characteristic curve analysis, accuracy, precision, recall, and F1 scores. Furthermore, clinical prediction models for identifying high-risk patients and triaging patients were developed and validated using clinical data from 356 patients.

Results

The ensemble DL model demonstrated impressive performance in a multicenter external test set, with precision, recall, and area under the receiver operating characteristic curve values of 0.90, 0.89, and 0.96, respectively. This performance surpassed that of human experts, and the model also significantly improved the experts' diagnostic accuracy. Furthermore, the model's diagnosis results based on smartphone-captured images were comparable to those of human experts. Additionally, a clinical prediction model was established that accurately categorizes patients with AS into high-and low-risk groups with distinct clinical trajectories. This provides a strong foundation for individualized care.

Discussion

In this study, an exceptionally comprehensive AI tool was developed for the diagnosis and management of AS in complex clinical scenarios, especially in underdeveloped or rural areas that lack access to experts. This tool is highly beneficial in providing an efficient and effective system of diagnosis and management.