Ovarian cancer (OC) is still the leading aggressive and lethal disease of gynecological cancers, and platinum-based regimes are the standard treatments. However, nearly 20%–30% of patients with OC are initial platinum resistant (IPR), and there is a lack of valid tools to predict whether they will be primary platinum resistant or not prior to chemotherapy.
Transcriptome data from The Cancer Genome Atlas (TCGA) was downloaded as the training data, and transcriptome data of GSE15622, GSE102073, GSE19829, and GSE26712 were retrieved from Gene Expression Omnibus (GEO) as the validation cohorts. Differentially expressed genes (DEGs) were selected between platinum-sensitive and platinum-resistant patients from the training cohort, and multiple machine-learning algorithms [including random forest, XGboost, and least absolute shrinkage and selection operator (LASSO) regression] were utilized to determine the candidate genes from DEGs. Then, we applied logistic regression to establish the IPR signature based on the expression. Finally, comprehensive clinical, genomic, and survival feature were analyzed to understand the application value of the established IPR signature.
A total of 532 DEGs were identified between platinum-resistant and platinum-sensitive samples, and 11 of them were shared by these three-machine learning algorithms and utilized to construct an IPR prediction signature. The area under receiver operating characteristic curve (AUC) was 0.841 and 0.796 in the training and validation cohorts, respectively. Notably, the prediction capacity of this signature was stable and robust regardless of the patients’ homologous recombination deficiency (HRD) and mutation burden status. Meanwhile, the genomic feature was concordant between samples with high- or low-IPR signature, except a significantly higher prevalence of gain at Chr19q.12 (regions including
Our study established an IPR signature based on the expression of 11 genes that could stably and robustly distinguish OC patients with IPR and/or poor outcomes, which may guide therapeutic regimes tailoring.