Polycystic ovary syndrome (PCOS) is defined by oligo/anovulation, hyperandrogenism, and polycystic ovaries with uncertain pathogenesis. The proteome represents a substantial source of therapeutic targets, and their coding genes may elucidate the mechanisms underlying PCOS. However, reports on the profiles of the human plasma protein-coding genes and PCOS are limited. Here, we aimed to investigate novel biomarkers or drug targets for PCOS by integrating genetics and the human plasma proteome.
Our study acquired the protein quantitative trait loci from DECODE Genetics, offering 4,907 proteins in 35,559 individuals while obtaining PCOS summary statistics by accessing the FinnGen biobank (1,639 cases and 218,970 controls) and the genome-wide association study catalog (797 cases and 140,558 controls). Herein, we sequentially used two-sample Mendelian randomization (MR) analyses and colocalization to verify the causal link between candidate proteins, their coding genes, and PCOS. Further PCOS data download was conducted by accessing the Gene Expression Omnibus and Zenodo platforms. Gene expression level analysis, pathway enrichment analysis, immune cell infiltration, and transcription factor prediction were performed, aiming at detecting specific cell types with enriched expression and exploring potential optimized treatments for PCOS.
MR analysis revealed 243 protein-coding genes with a causal relationship to PCOS risk, of which 12 were prioritized with the most significant evidence. Through colocalization analysis, three key genes, CUB domain-containing protein 1 (
Overall, three protein-coding genes (