AUTHOR=He Juanjuan , Liu Ahui , Shen Haofei , Jiang Yanbiao , Gao Min , Yu Liulin , Du Wenjing , Zhang Xuehong , Fu Fen TITLE=Shared diagnostic genes and potential mechanisms between polycystic ovary syndrome and recurrent miscarriage revealed by integrated transcriptomics analysis and machine learning JOURNAL=Frontiers in Endocrinology VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/endocrinology/articles/10.3389/fendo.2024.1335106 DOI=10.3389/fendo.2024.1335106 ISSN=1664-2392 ABSTRACT=Objective

More and more studies have found that polycystic ovary syndrome (PCOS) is significantly associated with recurrent spontaneous abortion (RSA), but the specific mechanism is not yet clear.

Methods

Based on the GEO database, we downloaded the PCOS (GSE10946, GSE6798 and GSE137684) and RSA (GSE165004, GSE26787 and GSE22490) datasets and performed differential analysis, weighted gene co-expression network (WGCNA), functional enrichment, and machine learning, respectively, on the datasets of the two diseases, Nomogram and integrated bioinformatics analysis such as immune infiltration analysis. Finally, the reliability of the diagnostic gene was verified by external verification and collection of human specimens.

Results

In this study, PCOS and RSA datasets were obtained from Gene Expression Omnibus (GEO) database, and a total of 23 shared genes were obtained by differential analysis and WGCNA analysis. GO results showed that the shared genes were mainly enriched in the functions of lipid catabolism and cell cycle transition (G1/S). DO enrichment revealed that shared genes are mainly involved in ovarian diseases, lipid metabolism disorders and psychological disorders. KEGG analysis showed significant enrichment of Regulation of lipolysis in adipocytes, Prolactin signaling pathway, FoxO signaling pathway, Hippo signaling pathway and other pathways. A diagnostic gene FAM166 B was obtained by machine learning and Nomogram screening, which mainly played an important role in Cellular component. GSEA analysis revealed that FAM166B may be involved in the development of PCOS and RSA by regulating the cell cycle, amino acid metabolism, lipid metabolism, and carbohydrate metabolism. CIBERSORT analysis showed that the high expression of FAM166 B was closely related to the imbalance of multiple immune cells. Further verification by qPCR suggested that FAM166 B could be used as a common marker of PCOS and RSA.

Conclusions

In summary, this study identified FAM166B as a common biomarker for PCOS and RSA, and conducted in-depth research and analysis of this gene, providing new data for basic experimental research and early prognosis, diagnosis and treatment of clinical diseases.