AUTHOR=Ma Yirong , Lai Junyu , Wan Qiang , Sun Liqiang , Wang Yang , Li Xingliang , Zhang Qinhe , Wu Jianguang TITLE=Exploring the common mechanisms and biomarker ST8SIA4 of atherosclerosis and ankylosing spondylitis through bioinformatics analysis and machine learning JOURNAL=Frontiers in Cardiovascular Medicine VOLUME=11 YEAR=2024 URL=https://www.frontiersin.org/journals/cardiovascular-medicine/articles/10.3389/fcvm.2024.1421071 DOI=10.3389/fcvm.2024.1421071 ISSN=2297-055X ABSTRACT=Background

Atherosclerosis (AS) is a major contributor to cerebrovascular and cardiovascular events. There is growing evidence that ankylosing spondylitis is closely linked to AS, often co-occurring with it; however, the shared pathogenic mechanisms between the two conditions are not well understood. This study employs bioinformatics approaches to identify common biomarkers and pathways between AS and ankylosing spondylitis.

Methods

Gene expression datasets for AS (GSE100927, GSE28829, GSE155512) and ankylosing spondylitis (GSE73754, GSE25101) were obtained from the Gene Expression Omnibus (GEO). Differential expression genes (DEGs) and module genes for AS and ankylosing spondylitis were identified using the Limma R package and weighted gene co-expression network analysis (WGCNA) techniques, respectively. The machine learning algorithm SVM-RFE was applied to pinpoint promising biomarkers, which were then validated in terms of their expression levels and diagnostic efficacy in AS and ankylosing spondylitis, using two separate GEO datasets. Furthermore, the interaction of the key biomarker with the immune microenvironment was investigated via the CIBERSORT algorithm, single-cell analysis was used to identify the locations of common diagnostic markers.

Results

The dataset GSE100927 contains 524 DEGs associated with AS, whereas dataset GSE73754 includes 1,384 genes categorized into modules specific to ankylosing spondylitis. Analysis of these datasets revealed an overlap of 71 genes between the DEGs of AS and the modular genes of ankylosing spondylitis. Utilizing the SVM-RFE algorithm, 15 and 24 central diagnostic genes were identified in datasets GSE100927 and GSE73754, respectively. Further validation of six key genes using external datasets confirmed ST8SIA4 as a common diagnostic marker for both conditions. Notably, ST8SIA4 is upregulated in samples from both diseases. Additionally, ROC analysis confirmed the robust diagnostic utility of ST8SIA4. Moreover, analysis through CIBERSORT suggested an association of the ST8SIA4 gene with the immune microenvironment in both disease contexts. Single-cell analysis revealed that ST8SIA4 is primarily expressed in Macrophages, Monocytes, T cells, and CMPs.

Conclusion

This study investigates the role of ST8SIA4 as a common diagnostic gene and the involvement of the lysosomal pathway in both AS and ankylosing spondylitis. The findings may yield potential diagnostic biomarkers and offer new insights into the shared pathogenic mechanisms underlying these conditions.