AUTHOR=Shu Xingyi , Zeng Chengcheng , Zhu Yanfei , Chen Yuqing , Huang Xiao , Wei Ruili TITLE=Screening of pathologically significant diagnostic biomarkers in tears of thyroid eye disease based on bioinformatic analysis and machine learning JOURNAL=Frontiers in Cell and Developmental Biology VOLUME=12 YEAR=2024 URL=https://www.frontiersin.org/journals/cell-and-developmental-biology/articles/10.3389/fcell.2024.1486170 DOI=10.3389/fcell.2024.1486170 ISSN=2296-634X ABSTRACT=Background

Lacrimal gland enlargement is a common pathological change in patients with thyroid eye disease (TED). Tear fluid has emerged as a new source of diagnostic biomarkers, but tear-based diagnostic biomarkers for TED with high efficacy are still lacking.

Objective

We aim to investigate genes associated with TED-associated lacrimal gland lesions. Additionally, we seek to identify potential biomarkers for diagnosing TED in tear fluid.

Methods

We obtained two expression profiling datasets related to TED lacrimal gland samples from the Gene Expression Omnibus (GEO). Subsequently, we combined the two separate datasets and conducted differential gene expression analysis and weighted gene co-expression network analysis (WGCNA) on the obtained integrated dataset. The genes were employed for Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The genes were intersected with the secretory proteins profile to get the potential proteins in the tear fluid. Machine learning techniques were then employed to identify optimal biomarkers and develop a diagnostic nomogram for predicting TED. Finally, gene set enrichment analysis (GSEA) and immune infiltration analysis were conducted on screened hub genes to further elucidate their potential mechanisms in TED.

Results

In our analysis of the integrated TED dataset, we identified 2,918 key module genes and 157 differentially expressed genes and finally obtained 84 lacrimal-associated key genes. Enrichment analysis disclosed that these 84 genes primarily pertain to endoplasmic reticulum organization. After intersecting with the secretory proteins, 13 lacrimal gland-associated secretory protein genes (LaSGs) were identified. The results from machine learning indicated the substantial diagnostic value of dyslexia associated gene (KIAA0319) and peroxiredoxin4 (PRDX4) in TED-associated lacrimal gland lesions. The two hub genes were chosen as candidate biomarkers in tear fluid and employed to establish a diagnostic nomogram. Furthermore, single-gene GSEA results and immune cell infiltration analysis unveiled immune dysregulation in the lacrimal gland of TED, with KIAA0319 and PRDX4 showing significant associations with infiltrating immune cells.

Conclusions

We uncovered the distinct pathophysiology of TED-associated lacrimal gland enlargement compared to TED-associated orbital adipose tissue enlargement. We have demonstrated the endoplasmic reticulum-related pathways involved in TED-associated lacrimal gland lesions and established a diagnostic nomogram for TED utilizing KIAA0319 and PRDX4 through integrated bioinformatics analysis. This contribution offers novel insights for non-invasive, prospective diagnostic approaches in the context of TED.