- 1Department of Biomaterials, FUTURE, Center for Functional Tissue Reconstruction, Faculty of Dentistry, University of Oslo, Oslo, Norway
- 2Laboratory of Reproductive Biology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- 3Department of Obstetrics and Gynecology, Seventh People’s Hospital of Shanghai University of Traditional Chinese Medicine, Shanghai, China
Background: Ovarian Serous Adenocarcinoma is a malignant tumor originating from epithelial cells and one of the most common causes of death from gynecological cancers. The objective of this study was to develop a prediction model based on extracellular matrix proteins, using artificial intelligence techniques. The model aimed to aid healthcare professionals to predict the overall survival of patients with ovarian cancer (OC) and determine the efficacy of immunotherapy.
Methods: The Cancer Genome Atlas Ovarian Cancer (TCGA-OV) data collection was used as the study dataset, whereas the TCGA-Pancancer dataset was used for validation. The prognostic importance of 1068 known extracellular matrix proteins for OC were determined by the Random Forest algorithm and the Lasso algorithm establishing the ECM risk score. Based on the gene expression data, the differences in mRNA abundance, tumour mutation burden (TMB) and tumour microenvironment (TME) between the high- and low-risk groups were assessed.
Results: Combining multiple artificial intelligence algorithms we were able to identify 15 key extracellular matrix genes, namely, AMBN, CXCL11, PI3, CSPG5, TGFBI, TLL1, HMCN2, ESM1, IL12A, MMP17, CLEC5A, FREM2, ANGPTL4, PRSS1, FGF23, and confirm the validity of this ECM risk score for overall survival prediction. Several other parameters were identified as independent prognostic factors for OC by multivariate COX analysis. The analysis showed that thyroglobulin (TG) targeted immunotherapy was more effective in the high ECM risk score group, while the low ECM risk score group was more sensitive to the RYR2 gene-related immunotherapy. Additionally, the patients with low ECM risk scores had higher immune checkpoint gene expression and immunophenoscore levels and responded better to immunotherapy.
Conclusion: The ECM risk score is an accurate tool to assess the patient’s sensitivity to immunotherapy and forecast OC prognosis.
Introduction
Ovarian cancer (OC) is one of the most common gynaecological malignancies. According to the Global Cancer Observatory of the World Health organization (WHO) international agency for research on cancer, a total of 207,252 new fatalities due to ovarian cancer were reported in 2020, placing it 14th out of 36 different types of tissue cancers (World Health Organization International Agency for Research on Cancer, 2020). Most ovarian malignancies originate from epithelial cells, and the most prevalent histological subtype of epithelial ovarian cancer is ovarian serous adenocarcinoma (Heintz et al., 2006). Early OC detection is the best treatment scenario, but as OC presents with nonspecific symptoms and reflects detection, most patients are given a stage III diagnosis, indicating that the disease has spread throughout the peritoneum and/or has involved the lymph nodes (Prat and FIGO Committee on Gynecologic Oncology, 2014). A multi-stage evaluation is necessary to manage OC, to determine personalized treatment, and to predict the presence of distant metastases, tumour stage and prognosis.
As a new treatment option, immunosuppressants, address the tumour microenvironment (TME) (Pitt et al., 2016). For ovarian cancer, this cutting-edge therapeutical approach is recently being studied and applied (Yang et al., 2022, 2023). Despite the fact that many variables have been demonstrated to predict the therapeutic effectiveness of immunosuppressant’s, the accuracy of this strategy still needs to be improved (Gibney et al., 2016). Tumour development, spread and invasion are all dependent on the TME (Schreiber et al., 2011; Lei et al., 2020), which contains multiple cell types, including stroma, vasculature, secretory factors, surrounding stroma and the internal environment of the tumour cells. As the TME is primarily determined by the genomic landscape of the tumour, several algorithms have been developed to predict tumour purity and estimate the abundance of tumour-infiltrating immune cells based on gene expression profiles (Tamborero et al., 2018). These include CIBERSORT, MCP, Xcell, EPIC, ESTIMATE, Timer, IPS, and QuantiSeq.
As an essential component of TME, the non-cellular network surrounding the cells, known as the extracellular matrix (ECM), is tightly associated to the pathophysiology of healthy and cancerous tissue (Frantz et al., 2010; Henke et al., 2020; Zhu et al., 2022). This renders ECM a crucial study niche for the initiation, progression, dissemination, and furthermore treatment and prognosis of epithelial ovarian cancer (Ween et al., 2011). The metabolic disruption of various ECM protein-related factors derived from epithelial cells during tumorigenesis leads to the formation of a pro-tumorigenic microenvironment that favors tumor growth and metastasis. This is followed by tumour cell-mediated ECM remodelling, which ultimately promotes the survival of tumour cells at the expense of healthy tissue (Zigrino et al., 2005). Therefore, ECM proteins, which have bidirectional effects on the generation, recurrence and metastasis of tumour cells (Valmiki et al., 2021 should be considered key players to the treatment and prognosis of tumours (Donelan et al., 2022; Zhu et al., 2022).
An artificial intelligence algorithm, Random Forest (RF) has been recently employed to predict disease progression by virtue of its high performance and interpretability (Wu et al., 2021). A convincing predictive model can be constructed by combining analysis of gene expression data with diagnostic and therapeutic data. This model could be effective at forecasting patient survival, the course of the tumour, and recurrence following various types of treatment (Lin et al., 2022; Miao et al., 2022). Big data machine learning may also also be applied. Despite the recent advances in machine learning methods for ovarian cancer survival analysis, integration of multi-omics data with immunotherapy targeting is an approach that has not been thoroughly explored (Henderson et al., 2016; Belotti et al., 2022). This approach could be advantageous for the identification of potential therapeutic targets and may lead to improved outcomes for ovarian cancer patients.
In this study we used artificial intelligence algorithms to integrate multifaceted omics data with immunotherapy targets in ovarian cancer. Specifically, we employed the Random Forest and Lasso algorithms to process gene expression and survival data from the TCGA database. The tumour risk score was calculated to construct features for predicting OC prognosis and immunotherapy efficacy.
Materials and methods
Datasets and data quality control
Transcriptome expression profiles, somatic mutation data and clinical survival data were downloaded from the TCGA database (Supplementary Table S1). FPKM expression data from the UCSC XENA Project (https://xenabrowser.net/datapages/), which included the TCGA cancer gene expression sequencing data, were analysed together to increase the reliability of data analysis. Normal ovary tissue transcriptome sequencing data from the GTEx database (https://www.gtexportal.org/home/) were used as representative normal/healthy tissue data. We utilized the immune cell markers used in the Tumour MicroEnvironment (TME) analysis following the method described at Bindea et al. (2013) and ECM-related gene information was obtained from Naba et al. (2016). Following quality control measures on gene expression data and somatic mutations (SNPs and small INDELs), we filtered out 373 valid sample samples from the pool of 758 valid patient survival datasets of the TCGA-OV collection.
Construction and evaluation of an ECM risk score model related to survival
The TCGA-OV data were randomly partitioned into a training set (n = 298) and a test set (n = 75). We used the randomForestSRC package (3.1.1) (Ishwaran et al., 2022) to down-size the 1068 ECM genes including survival information of OC patients. Further dimensionality reduction was performed by the Lasso algorithm in the glmnet package (4.1–2) (Friedman et al., 2010). Survival analysis of key genes in OV was performed with multivariate COX regression in the survival package (3.2–10) (Therneau, 2015).
Differential expression and enrichment analyses
Two groups of patients with high and low-risk scores were generated. To calculate the differential gene expression between cancer data and normal tissue data we used DESeq2 v.1.36.0 (Love et al., 2014) in R (4.2.1). We performed Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses using ClusterProfiler v.3.14.3 (Yu et al., 2012) in R (3.6.3). To find BP term enrichment, the Gene Set Enrichment Analysis (GSEA) (Subramanian et al., 2005) of ranked lists of differentially expressed genes was carried out. Significant enrichment in GSEA analysis is achieved when the False discovery rate (FDR) is 0.25 and an adjusted p-value of 0.05.
Tumour microenvironment (TME) and somatic mutation analyses of the TCGA-OV dataset
We used the “maftools” package (2.12.0) (Mayakonda et al., 2018) for the calculation and evaluation of somatic mutations for each patient. The “drugInteractions” function was employed to analyse the correspondence between mutated genes and currently available genetic drugs based on the DGIdb database (Griffith et al., 2013). We further used multiple algorithms built into the IOBR package (0.99.9) (Zeng et al., 2021) to assess the immune cell infiltration level, including B cells, CD4+ T cells, CD8+ T cells, neutrophils, macrophages and dendritic cells. Then we explored the variations in immune infiltration and somatic mutation between groups with high and low-risk scores.
Statistical analysis
The differences between the two datasets were determined using the Mann-Whitney U test (also known as the Wilcoxon rank sum test) and independent t-test. To evaluate between-group differences, one-way analysis of variance (ANOVA) with the Kruskal–Wallis test and chi-square test were utilised. Correlation analysis was conducted using non-parametric Spearman correlation tests. The connection between potential genes and overall survival was examined using a single-variable Cox regression analysis (OS). The difference was shown to be statistically significant when p < 0.05 was used (p < 0.05 *; p < 0.01 **; p < 0.001 ***).
Results
Screening and validation of ECM-related prognostic key genes
The clinicopathological characteristics of 379 OC patients in the TCGA database, are summarized in Table 1. The random forest algorithm was used to decrease the training set. 147 genes were screened out of 1068 ECM-related genes, and the accuracy of this survival prediction model was validated using the test set. The receiver operating characteristic curve (ROC) for the training set and test set were plotted separately, with the area under the curve (AUC) of 0.810 for the training set and 0.684 for the test set (Figure 1A).
FIGURE 1. Construction of ECM risk score model (A) AUC for random forest train model (blue) and test model (red); (B) Lasso dimensionality reduction for random forest prognosis model; (C) Locus plot of all genes in random forest prognosis model; (D) Nomogram survival plot for 15 key prognosis genes; (E) KM survival plot for high/low ECM risk score group.
The results of “lambda.min” of the Lasso algorithm were employed and 15 key genes closely related to prognosis were obtained (Figures 1B, C). These were Ameloblastin (AMBN), Chemokine (C-X-C motif) ligand 11 (CXCL11), Peptidase inhibitor 3 (PI3), Chondroitin sulfate proteoglycan 5 (CSPG5), Transforming growth factor (TGFBI), Tolloid-like 1 (TLL1), Endothelial cell-specific molecule 1 (ESM1), Matrix metallopeptidase 17 (MMP17), Angiopoietin-like 4 (ANGPTL4), Fibroblast growth factor 23 (FGF23), Hemicentin 2 (HMCN2), Interleukin 12A (IL12A), C-type lectin domain family 5, member A (CLEC5A), FRAS1 related extracellular matrix protein 2 (FREM2), serine protease 1 (PRSS1), and the gen prediction model with the risk coefficient of 15 genes given by the Lasso algorithm was constructed:
The Cox model was used to verify the predictive ability of the 15 key genes for the 1-year, 3-year and 5-year overall survival (OS), and the key genes fit well with the ideal line at the three-time points (Figure 1D). The TCGA-OV sample was divided into two groups with high and low-risk scores based on the average risk score. Furthermore, the Kaplan Meier (KM) curves of high/low-risk score groups were plotted, showing a significant difference between the high and low-risk score groups (Figure 1E). Additional multifactorial Cox models were used to analyse the relationship between the 15 key genes and ovarian cancer OS (Table 2), and we found that AMBN, CXCL11, CLEC5A, CSPG5 FREM2, MMP17, and PI3 were independent prognostic factors for ovarian cancer.
Differential expression analysis and functional enrichment of high and low ECM risk score groups
The ECM risk score, survival information, and one-to-one correspondence to the expression of the 15 key genes for each sample in TCGA-OV are presented in Figure 2A. The results of differential gene expression of high vs. the low-risk score group showed that 1004 genes were significantly upregulated (logFC > 0.4, adj. p < 0.05), and 378 genes were significantly downregulated (Figure 2B).
FIGURE 2. Differential analysis of function between high/low ECM risk score group (A) Information of sample group based on ECM risk score and 15 key prognosis gene expression heatmap. (B) Volcano map of differential gene expression analysis in TCGA-OV. (C) Principal component analysis (PCA cluster) based on the 15 key prognostic genes in TCGA-OV. (D) 15 key genes expressed in normal ovary and TCGA-OV. (E) 15 key genes expression in ECM high/low-risk score groups. The genes BCRA2, MUC1 and MUC16 (in red frames) have been functionally characterized in other studies, for their prognostic role in OC, and used as a reference.
We documented that there is a substantial difference in the extracellular matrix-related processes in the enrichment of GOKEGG functions (Supplementary Figure S1A). Additionally, it was shown that biological processes associated with immune cells differed dramatically (GO:0071621, GO:0043030 et al.). Two immune-related pathways were revealed to be blocked in the high ECM risk score group in the GSEA results (Supplementary Figure S1C). Based on the 15 key prognostic genes in the TCGA-OV at the principal component analysis (PCA cluster), there was little difference between the high and low-risk subgroups in the PC1 and PC2 dimensions (Figure 2C).
We also examined the expression of the 15 key genes in normal ovarian tissue vs. the TCGA-OV collection, and in low/high ECM risk score groups (Figures 2D, E). In the expression analysis of normal vs. tumour tissues, only PRSS1 was not significantly differentially expressed in normal versus tumour tissues. The expression levels of TLL1, HMCN2, FREM2 and MMP17 were significantly higher in normal ovarian tissues than in tumour tissues. AMBN, TGFBI, CSPG5, PI3, CXCL11, ESM1, FGF23, ANGPTL4, CLEC5A and IL12A all showed significantly higher expression in tumour tissues samples.
In the differential expression analysis of low/high ECM risk score groups, AMBN, IL12A and FREM2 were not statistically different, whereas PI3, TGFBI, TLL1, HMCN2, MMP17, CLEC5A, ANGPTL4, FGF23 had higher expression in the high-risk group and CXCL11, CSPG5, ESM1, PRSS1 were highly expressed in the low-risk group.
The genes BCRA2, MUC1 and MUC16 (in the red-framed rectangles in Figures 1C, D, and the heat map), were also assessed to supplement our analysis with three genes from the same dataset, previously functionally characterized for their prognostic role in OC (Wang et al., 2007; Zhai et al., 2020; Custódio et al., 2022). The genes are also.
Assessing the role of ECM risk score in tumour immune cell infiltration and immunotherapy response
The analysis using almost all algorithms, documented that CD8 T cells showed a significant difference, with lower levels of infiltration in the high-risk group than in the low-risk group. In the high-risk group, the signature score of CD4 T memory resting cells was higher, and lower in all other T cells (Figure 3A). Neutrophils scored variable results among the four algorithms: there were group differences in the infiltration levels of neutrophils in the CIBERSORT and MCPcounter algorithms, with both algorithms showing higher levels in the high-risk group (Figure 3B). According to the EPIC, MCPcounter, and xCell algorithms, the high-risk group had higher numbers of cancer-associated fibroblasts (CAFs) (Figure 3C).
FIGURE 3. Comparison of immune cell infiltration between high/low ECM risk score group groups in TCGA-OV. Comparison with several algorithms for T cell (A) infiltration, Neutrophil (B), cancer-associated fibroblast (CAFs) (C), Immune microenvironment score (D) infiltration, expression levels of 8 immune checkpoint genes in high/low ECM risk score group in TCGA-OV (E), and IPS score in high/low ECM risk score group in TCGA-OV (F).
Both the xCell and ESTIMATE algorithms indicated a lower immune microenvironment score for the high-risk group when computing the immune microenvironment score. The high-risk group displayed a higher stromal score in the ESTIMATE algorithm, indicating the presence of more stromal cells (Figure 3D).
The analysis of B cells also showed high variation among the algorithms used: significant differences between groups were only documented by the xCell algorithm; naive B cells and plasma cells showed group differences under the CIBERSORT algorithm but not when the xCell algorithm was employed (Supplementary Figure S2). There was no discernible difference between the two groups in monocytes (Supplementary Figure S2). Only xCell revealed group differences in DC cells (Supplementary Figure S2). NK cells only showed between-group differences under the MCPcounter and quantiseq algorithms, but there was an opposite trend: NK cells showed relatively low levels in the high-risk group under the MCPcounter algorithm but relatively high levels in the high-risk group under the quantiseq algorithm (Supplementary Figure S2). The high-risk group’s Macrophage M1 levels were only marginally different to the low-risk group according to the CIBERSORT and xCell results, and there was no difference between groups when the EPIC method was used (Supplementary Figure S2).
We also extracted the expression levels of eight immunological checkpoint genes (Figure 3E). The expression levels of six immune checkpoint genes (TIGIT, CD274, PDCD1, CTLA4, LAG3, and PDCD1LG2) was higher in the low-risk group than in the high-risk group, with the exception of SIGLEC15 and HAVCR2. The IPS score for the major histocompatibility complex (MHC), and for senescent cells (SC) was greater in the low-risk group than in the high-risk group. Endothelial cells (EC) IPS score did not differ statistically significantly between the two groups. However, the high-risk group had a higher Classical Pathway (CP) IPS score than the low-risk group. Both the aggregated z-score (AZ) and the weighted total IPS showed that the low-risk group was higher than the high-risk group (Figure 3F).
Validation of the prognostic function of ECM risk score in the TCGA-pan-cancer dataset
TCGA pan-cancer data with survival information were used to validate the ECM risk score. A total of 9162 “Primary Solid Tumour” data with both gene expression data and survival data were included in the analysis. We screened all adenocarcinoma expression data and survival data as a validation dataset. 2084 samples meeting the criteria were extracted, of which 1580 samples carried information on initial treatment outcome. We found that the low-risk group had a higher initial treatment Complete Response (CR) and Partial Response (PR), there was no significant difference in the number of patients with Progressive Disease (PD) between the low- and high-risk groups, and the number of patients with Stable Disease (SD) was significantly higher in the high-risk group than in the low-risk group (Figure 4A). We also analysed the expression levels of eight immune checkpoint genes in the high/low ECM risk score group (Figure 4B). Only four immune checkpoint genes showed significant differences between the two groups. Fifteen ECM’ key genes were extracted from all gene expression data, and the ECM risk score was calculated for each patient. The expression levels of 15 ECM key genes were differentially expressed in both the high and low-risk groups (Figure 4C). Computes the predicted survivor function for a Cox proportional hazards model and plots the KM curve (Figure 4D). The ECM risk score was found to be a good predictive tool for overall survival prognosis in the adenocarcinoma data. However, it was not adequate in effectively predicting for OS between the high and low-risk groups after 4000 days.
FIGURE 4. Validation of ECM risk key genes in TCGA Adenocarcinoma. First-course treatment outcome of high/low ECM risk score group in TCGA-Adenocarcinoma (A). Expression levels of 8 immune checkpoint genes (B) and 15 ECM key genes (C) in high/low ECM risk score group in TCGA-Adenocarcinoma. KM survival plot for high/low ECM risk score group in TCGA-Adenocarcinoma (D).
Relationship between ECM risk score and tumour mutation burden (TMB)
We further investigated the connection between the ECM risk score and TMB because OC is marked by a high degree of somatic mutation. Missense Mutation was the main component of somatic mutation in the high/low ECM risk score group. In the single nucleotide variant (SNV) analysis showed that the highest rate of C > T was observed in both groups. Variants per sample of the high ECM risk score group were lower than in the low ECM risk score group (Supplementary Figures S3A, B). In both high/low ECM risk score groups, CSMD3, TTN, TP53, FLG2, MUC16, FLG and FAT3 were found in the top 20 mutated genes (Supplementary Figures S3C, D). In comparison to the high-risk group, the mutation rates in TP53, TTN and RYR2 were higher in the low-risk group, while those of CSMD3, USH2A and FLG2 were nearly identical (Supplementary Figures S3E, F).
In the mutation Exclusive/Co-occurring analysis of the top 20 mutation genes, we found 11 pairs of genes with Co-occurring relationships in each of the high/low ECM risk score groups but 2 pairs of Exclusive in the low ECM risk score group (Figures 5A, B). We also examined the mutation status of the elements of eight signalling pathways that have been demonstrated to be crucial in the development of tumours (Sanchez-Vega et al., 2018). We found that the number of genes affected by somatic mutations in most signalling pathways was approximately the same between the two groups, except for RTK-RAS, NOTCH, WNT, and PIK2 (Figures 5C, D). The expression levels of all somatic mutations were analysed between the high/low ECM risk score groups, and we found statistically significant differences in CSMD1, FRMPD1, IL1PARL2 and PKHD1. All four genes showed a higher mutation rate in the low ECM risk score group (Figures 5D, E). Based on the differences in somatic mutations between the two groups, we enriched the analysis for drug-gene interactions, and the “Druggable Genome: was found to be highly enriched within both groups (Figure 5F, G).
FIGURE 5. Differential analysis of somatic mutations between high/low ECM risk score group in TCGA-OV. The relation of the top 25 mutated genes in the high (A) and low (B) ECM risk score groups. Analysis of cancer-related key pathway components affected by somatic gene mutations in the high (C) and low (D) ECM risk score groups. Differential analysis of somatic gene mutatios between high/low ECM risk score group (E). Oncogenic signalling pathways enrichment analysis in high (F) and low (G) ECM risk score group.
Additionally, we documented that the low ECM risk score group was more responsive to RYR2 gene-related immunotherapy, whereas the high ECMs risk score group may be more responsive to TG-related immunotherapeutics (Supplementary Table S2). We also analysed the mutations in 15 key genes (Supplementary Figure S4).
Discussion
Ovarian cancer is a malignant tumour with a high mortality rate. Because of its insidious onset, it is usually late-stage when obvious clinical symptoms appear (Prat J and FIGO Committee on Gynecologic Oncology, 2014). Patient prognosis is thus based on an accurate and reliable assessment. As the extracellular matrix is closely linked to epithelial cells, we aimed to construct a prognostic prediction model based on extracellular matrix proteins to evaluate the prognostic survival of patients with ovarian serous adenocarcinomas. We successfully filtered 15 key genes out of 1068 extracellular matrix-associated proteins, with excellent predictive ability on ovarian serous adenocarcinoma prognosis using the Random Forest and the Lasso algorithms.
We used the random forest algorithm as is a popular machine learning technique with documented outstanding performance in a wide range of predictive modeling tasks, including cancer prognosis prediction (Toth et al., 2019; Li Y et al., 2020). When analyzing tumor sequencing data, the relationship between variables and outcomes can be complex due to the volume of data. Random forest can handle a large number of input variables without overfitting, which has been a challenge for traditional models such as logistic regression or decision trees (Maroco et al., 2011; Lan et al., 2020). Additionally, traditional linear models are unable to handle the nonlinear relationship between a large amount of sequencing data and survival outcomes, but random forest can address this issue (Lee and Lim, 2019). Compared to support vector machines (SVM), random forest is less sensitive to outliers and missing data, which is important in cancer prognosis prediction, where data quality may vary and missing data is common (Pelckmans et al., 2005; Lee and Lim, 2019).
Among the 15 key genes generated by our our analysis, CSPG5, CXCL11 and ESM1 mRNAs were abundantly expressed in OC tissue compared to normal tissue. Cancer cells, fibroblasts, endothelial cells, and immune cells such as leukocytes, monocytes, and dendritic cells are primarily responsible for CXCL11 production (Gao and Zhang, 2021) CXCL11 is an effector chemokine regulating T cell recruitment that promotes effector immune cells (e.g., CD8T cells, Th1 cells, TH17 cells and antigen-presenting cells). Studies have shown that induction of CXCL10 and CXCL11 expression in breast cancer cells enhances the infiltration of CD8 T cells (Liu et al., 2011; Gao et al., 2019). In addition to its influence on tumour progression through its angiostatic effects (Strieter et al., 2006), CXCL11 is part of the CXCR7/CXCL11 axis that was shown to induce the epithelial–mesenchymal transition and metastatic behaviour of OC cells under ERα control (Benhadjeba et al., 2018). ESM1 is a soluble proteoglycan expressed by vascular endothelial cells and is associated with inflammatory cell recruitment (Hung et al., 2020). Vascular endothelial dysfunction can be brought on by high levels of ESM1 (Kalantaridou et al., 2006; Rocha et al., 2014; Sun et al., 2019), whereas it has also been shown to be closely correlated with OC development and progression (Li et al., 2023). Our analysis showed that the tumour tissue had significantly higher ESM1 expression levels than normal ovarian tissue, which may be associated with abnormally elevated cell proliferation and tumour tissue revascularization. Furthermore, CSPG5, also known as Neuroglycan C (NGC), is a protein originally associated with extracellular matrix production in the nervous system (Pintér et al., 2020) and shown to decrease first and then increase following ischemic and hypoxic injury, presumably associated with ECM damage repair (Matsui et al., 2005). Of note, two recent studies showed that expression of CSPG5 was significantly correlated with the prognosis of patients with epithelial OC (Su et al., 2021; Wang et al., 2023). In our study, higher CSPG5, CXCL11 and ESM1 expression and inflammatory cell infiltration, especially of CD8 T cells, were present in the low-risk group. The high level of immune cell environment may explain the better prognosis for overall survival in the low-risk group.
Interestingly, two proteins commonly associated with bone biology and development were identified among the ECM prognostic markers. AMBN was discovered as a tooth enamel matrix protein, playing an important role in enamel, cranial and long bone development. It was however demonstrated that AMBN was among four factors that were potential independent prognostic factors for prostate cancer (Xu et al., 2018). In our multifactorial Cox analysis, AMBN was shown to be one of the independent prognostic factors for ovarian serous adenocarcinomas. Furthermore, the analysis revealed a prognostic role for FGF23, the bone-derived hormone secreted by osteoblasts and osteocytes (Guo and Yuan, 2015). Previous studies have shown its expression alterations in breast cancer (Cekin et al., 2020) and identified that serum or plasma FGF23 concentrations are elevated in patients with advanced stage epithellal ovarian cancer (Tebben et al., 2005). According to our data, FGF23 was expressed in the tumours, and its expression levels were higher in the high-risk group.
MMP17, PI3, TLL1, ANGPTL4, and TGFBI have all been previously associated with cancer. Our analysis found that transcripts of all these five genes were expressed at higher levels in the high-risk group than in the low-risk group. MMP17 has been associated with the maintenance of normal physiological function in vascular smooth muscle (Martín-Alonso et al., 2015) and a promotive effect on tumour cells (Paye et al., 2014). Additionally, it was shown that its expression was much higher in EOC patients than in pericarcinomatous tissues (Xiao et al., 2022). PI3, also known as elafin, is an elastase-specific inhibitor that directly affects tumour suppression by inhibiting elastase (Hunt et al., 2013). High levels of PI3 are associated with severe disease severity in various cancers (Hunt et al., 2013; Longatto-Filho et al., 2021), while another TCGA-OV analysis showed its prognostic value in OC (Li J et al., 2020). Additionally, high elafin expression has been associated with unfavorable OS but better immunotherapy responses (Lu et al., 2023). Recent studies have found that TGFBI CpG islands are hypermethylated in adjacent normal colon tissue, with the corresponding sequences showing hypomethylation in colon cancer tissue, and that higher TGFBI levels are associated with poorer prognosis (Zhang H. et al., 2019). In mammals, Tolloid-like (mTLL)-1 is a BMP-1-associated protease, and BMP1/TLL1 is involved in the process of tissue remodelling in the ovary, assisting in the maturation of pre-collagen molecules and the deposition of collagen fibres (Ohnishi et al., 2005). In hepatocellular carcinoma (Matsuura et al., 2017), SNP may impact the splicing of TLL1 mRNA and result in short variants with high catalytic activity, speeding up the development of liver fibrosis and cancer. In a recent study, TGFBI, PI3, TLL1 and MMP17 were predicted to be among the 19 genes that comprise the TME-related high grade serous ovarian carcinoma prognostic genetic panel (Belotti et al., 2022). ANGPTL4 is regulated by peroxisome proliferator-activated receptor γ (PPARγ) (Aryal et al., 2019), who has been observed to be significantly increased in malignant ovarian tumours (grade 1, 2 and 3) compared to benign and borderline tumours (Zhang et al., 2005). It was also recently identified in a scRNA-seq study of ovarian cancer CAF ligands to epithelial cells (Carvalho et al., 2022). Finally, ANGPTL4 and TGFBI were identified both in a hypoxia risk model constructed to reflect the OC immune microenvironment in and predict prognosis (Wei et al., 2021), and among the genes that comprise an OC glycolysis-related gene signature (Zhang et al., 2021).
To further assess the relationship between the ECM risk score and the immune microenvironment, we assessed the abundance of multiple inflitrating cells in the immune microenvironment of these patients using multiple algorithms. Patients with more infiltrating and activated immune cells in TME may have better immunotherapeutic outcomes (Li et al., 2016). We observed higher levels of CD8 T cell infiltration in the low-risk group, suggesting enhanced immune surveillance via CD8+ T cells, while this implies a possible enhanced susceptibility to PD-1/PD-L1-targeted immune checkpoint therapies (Strickland et al., 2016; Iyer et al., 2021). Furthermore, the degree of macrophage infiltration was similar to both CD8 and CD4 T cell infiltration, with both showing high levels in the low-risk group. This is consistent with previous studies describing a positive correlation between T cells and macrophage infiltration levels (Desbois et al., 2020).
Previous studies have also shown that cancer cells regulate their local microenvironment to promote tumour survival, chemoresistance and evasion of immune surveillance (Kim et al., 2007) and that there is a tight association between malignant tumour cells and CAFs in promoting tumour growth and survival (Xing et al., 2010; Karagiannis et al., 2012; Chen et al., 2021). scRNA-seq analysis of high grade serous ovarian cancers, also showed that CAFs induce epithelial-mesenchymal transition (EMT) of tumor cells via TGFβ signaling, with consequent effects on chemoresistance and metastasis (Kan et al., 2020). In accordance with these, our results also showed that higher levels of CAFs were found in high-risk groups with poor prognosis.
It is known that the immune microenvironment immune cells, immunomodulating factors and immune checkpoint molecules are crucial for the immune escape of tumour cells (Charoentong et al., 2017; Zhang Y. et al., 2019). We thus developed an immunophenoscore (IPS) based on immune subpopulation infiltration and expression of immune regulatory molecules using the random forest to identify determinants of immunogenicity. Among several IPS subtypes tested, we found that the low-risk group had higher IPS and could benefit during treatment with immune checkpoint inhibitors. As the immunophenoscore is a surrogate to patients’ immunotherapeutic outcomes, our IPS results of the ECMs risk scores may only be considered of predictive value, and future studies will confirm their clinical importance.
We also explored the tumour mutation burden (TMB) changes in the TCGA-OV cohort. The ovarian cancer genome exhibits high levels of instability, as evidenced by functional cells (Stewart et al., 2011), copy number changes (Schwarz et al., 2015), and status of somatic mutations (Bashashati et al., 2013). TMB is the total number of substitutions and insertion/deletion mutations per megabase in the exon-coding region of the gene under evaluation in the tumour cell genome (Stenzinger et al., 2019). Somatic mutations may result in tumourigenesis and many somatic mutations can generate neoantigens facilitating anti-tumour immunity (Gubin et al., 2015). In a study on immunotherapy for lung cancer, researchers discovered that patients with PD-L1 1% but a subgroup of 10 mutations/Mb in the combination chemotherapy group had a better objective response rate (ORR) and median progression-free survival (median PFS, mPFS) with the immune combination regimen CheckMate 227 (Hellmann et al., 2018). This suggests that in the higher TMB population, PFS was better in the combination immunotherapy group than in the chemotherapy alone group, irrespective of PD-L1 expression. In our study, the mean TMB values were higher in the low-risk group than in the high-risk group, implying that the low-risk group may have more potential for immunotherapy. TP53, the gene encoding the tumour suppressor protein p53, is one of the most commonly mutated genes in human cancers, and driver mutations are prevalent in high-grade ovarian plasmacytoma (Ahmed et al., 2010). Chalmers and coworkers have shown that TP53 mutations were associated with high TMB (Chalmers et al., 2017). Our study similarly confirmed that in the TCGA-OV datasets TP53 mutations were the most frequent in the high- and low-risk groups, and that TP53 mutations were higher in the low-risk group than in the high-risk group. However, there are limitations to cohort-based studies. Most mutated genes are unique to each case, and in clinical treatment, patients should be treated based on their mutation sequencing results. Our analysis may provide theoretical support for the selection of immunological agents.
Our study aimed to investigate the role of the matrisome and the gene changes in the ECM in a widely studied, publicly available ovarian cancer transcriptomic and clinicopathological collection of patients. The role of matrisome has been regretfully understudied in this type or cancer that commonly goes undetected till it reaches high grades, as the ECM can influence drug resistance. We used bioinformatics and machine learning methods to investigate the TCGA-OV collection and identified several prognostic genes, some of which have also been identified by previous studies. Given the current cost-effectiveness of biotechnological approaches, rapid genetic testing tools are commonly promoted and widely applied in clinical diagnostics and treatment (Young and Argáez, 2019). These tools have improved accuracy and testing times have significantly shortened. Targeted multigene tests and genetic screening can be thus rapidly employed to assist diagnosis postoperatively and develop more effective treatment plans.
Conclusion
In conclusion, this study developed an ECM risk score prediction model to enable prognosis of patients with ovarian serous adenocarcinoma. We further identified the tumour microenvironment and somatic mutations using the TCGA-OV collection datasets. These results should be further validated with targeted future studies to evaluate their real predictive function and their use in personalized immunotherapy applications.
Data availability statement
Publicly available datasets were analyzed in this study. This data can be found here: https://xenabrowser.net/datapages/?cohort=GDC%20TCGA%20Ovarian%20Cancer%20(OV)&removeHub=https%3A%2F%2Fxena.treehouse.gi.ucsc.edu%3A443.
Ethics statement
Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
Author contributions
Conceptualization, TG, MZ and JER, methodology and data visualization TG and YW, software and data curation TG, formal analysis TG, MZ, and AS, draft preparation TG, MZ and AS, review, editing, supervision, project administration, and funding acquisition AS and JR. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the Research Council of Norway (287953).
Acknowledgments
TG would like to acknowledge financial support of the China Scholarship Council (CSC), the Faculty of Dentistry, University of Oslo and the Xiantao academic platform.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2023.1200354/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | Enrichment analysis for different expression genes (A) GOKEGG results of TCGA-OV. Top 5 Head (B) and Tail (C) aggregation of GSEA results in TCGA-OV DEGs.
SUPPLEMENTARY FIGURE S2 | Comparison of immune cell infiltration between high/low ECM risk score group groups in TCGA-OV.
SUPPLEMENTARY FIGURE S3 | Summary of somatic mutation analysis in high/low ECM risk score group in TCGAOV Summary of mutations in the high (A) and low (B) ECM risk score groups. Mutation profile of TCGAOV patients in high (C) and low (D) ECM risk score group and percentage of top 6 mutated genes in high/low ECM risk score group (E). Comparison of mutation load among high/low ECM risk score group and TCGA cohorts (F).
SUPPLEMENTARY FIGURE S4 | Details of ECM risk score in the 15 key gene mutations.
SUPPLEMENTARY TABLE S1 | ECM-related gene list.
SUPPLEMENTARY TABLE S2 | Drug-gene interactions.
References
Ahmed, A. A., Etemadmoghadam, D., Temple, J., Lynch, A. G., Riad, M., Sharma, R., et al. (2010). Driver mutations in TP53 are ubiquitous in high grade serous carcinoma of the ovary. J. Pathol. 221, 49–56. doi:10.1002/path.2696
Aryal, B., Price, N. L., Suarez, Y., and Fernández-Hernando, C. (2019). ANGPTL4 in metabolic and cardiovascular disease. Trends Mol. Med. 25, 723–734. doi:10.1016/j.molmed.2019.05.010
Bashashati, A., Ha, G., Tone, A., Ding, J., Prentice, L. M., Roth, A., et al. (2013). Distinct evolutionary trajectories of primary high-grade serous ovarian cancers revealed through spatial mutational profiling. J. Pathol. 231, 21–34. doi:10.1002/path.4230
Belotti, Y., Lim, E. H., and Lim, C. T. (2022). The role of the extracellular matrix and tumor-infiltrating immune cells in the prognostication of high-grade serous ovarian cancer. Cancers 14, 404. doi:10.3390/cancers14020404
Benhadjeba, S., Edjekouane, L., Sauvé, K., Carmona, E., and Tremblay, A. (2018). Feedback control of the CXCR7/CXCL11 chemokine axis by estrogen receptor α in ovarian cancer. Mol. Oncol. 12 (10), 1689–1705. Epub 2018 Aug 23. doi:10.1002/1878-0261.12362
Bindea, G., Mlecnik, B., Tosolini, M., Kirilovsky, A., Waldner, M., Obenauf, A. C., et al. (2013). Spatiotemporal dynamics of intratumoral immune cells reveal the immune landscape in human cancer. Immunity 39, 782–795. doi:10.1016/j.immuni.2013.10.003
Carvalho, R. F., do Canto, L. M., Abildgaard, C., Aagaard, M. M., Tronhjem, M. S., Waldstrøm, M., et al. (2022). Single-cell and bulk RNA sequencing reveal ligands and receptors associated with worse overall survival in serous ovarian cancer. Cell Commun. Signal 20 (1), 176. doi:10.1186/s12964-022-00991-4
Cekin, R., Arici, S., Atci, M. M., Secmeler, S., and Cihan, S. (2020). The clinical i?mportance of fibroblast growth factor 23 on breast cancer patients. EJMI 4 (4), 471–476.
Chalmers, Z. R., Connelly, C. F., Fabrizio, D., Gay, L., Ali, S. M., Ennis, R., et al. (2017). Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome Med. 9, 34. doi:10.1186/s13073-017-0424-2
Charoentong, P., Finotello, F., Angelova, M., Mayer, C., Efremova, M., Rieder, D., et al. (2017). Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade. Cell Rep. 18, 248–262. doi:10.1016/j.celrep.2016.12.019
Chen, Y., McAndrews, K. M., and Kalluri, R. (2021). Clinical and therapeutic relevance of cancer-associated fibroblasts. Nat. Rev. Clin. Oncol. 18 (12), 792–804. doi:10.1038/s41571-021-00546-5
Custódio, N., Savisaar, R., Carvalho, C., Bak-Gordon, P., Ribeiro, M. I., Tavares, J., et al. (2022). Expression profiling in ovarian cancer reveals coordinated regulation ofBRCA1/2and homologous recombination genes. Biomedicines 10 (2), 199. doi:10.3390/biomedicines10020199
Desbois, M., Udyavar, A. R., Ryner, L., Kozlowski, C., Guan, Y., Dürrbaum, M., et al. (2020). Integrated digital pathology and transcriptome analysis identifies molecular mediators of T-cell exclusion in ovarian cancer. Nat. Commun. 11, 5583. doi:10.1038/s41467-020-19408-2
Donelan, W., Dominguez-Gutierrez, P. R., and Kusmartsev, S. (2022). Deregulated hyaluronan metabolism in the tumor microenvironment drives cancer inflammation and tumor-associated immune suppression. Front. Immunol. 13, 971278. doi:10.3389/fimmu.2022.971278
Frantz, C., Stewart, K. M., and Weaver, V. M. (2010). The extracellular matrix at a glance. J. Cell Sci. 123, 4195–4200. doi:10.1242/jcs.023820
Friedman, J., Hastie, T., and Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22. doi:10.18637/jss.v033.i01
Gao, Q., Wang, S., Chen, X., Cheng, S., Zhang, Z., Li, F., et al. (2019). Cancer-cell-secreted CXCL11 promoted CD8+ T cells infiltration through docetaxel-induced-release of HMGB1 in NSCLC. J. Immunother. Cancer 7, 42. doi:10.1186/s40425-019-0511-6
Gao, Q., and Zhang, Y. (2021). CXCL11 signaling in the tumor microenvironment. Adv. Exp. Med. Biol. 1302, 41–50. doi:10.1007/978-3-030-62658-7_4
Gibney, G. T., Weiner, L. M., and Atkins, M. B. (2016). Predictive biomarkers for checkpoint inhibitor-based immunotherapy. Lancet Oncol. 17, e542–e551. doi:10.1016/S1470-2045(16)30406-5
Griffith, M., Griffith, O. L., Coffman, A. C., Weible, J. V., McMichael, J. F., Spies, N. C., et al. (2013). DGIdb: Mining the druggable genome. Nat. Methods 10, 1209–1210. doi:10.1038/nmeth.2689
Gubin, M. M., Artyomov, M. N., Mardis, E. R., and Schreiber, R. D. (2015). Tumor neoantigens: Building a framework for personalized cancer immunotherapy. J. Clin. Invest. 125, 3413–3421. doi:10.1172/JCI80008
Guo, Y.-C., and Yuan, Q. (2015). Fibroblast growth factor 23 and bone mineralisation. Int. J. Oral Sci. 7, 8–13. doi:10.1038/ijos.2015.1
Heintz, A., Odicino, F., Maisonneuve, P., Quinn, M. A., Benedet, J. L., Creasman, W. T., et al. (2006). Carcinoma of the ovary. Int. J. Gynaecol. Obstet. Off. Organ Int. Fed. Gynaecol. Obstet. 95 (1), S161–S192. doi:10.1016/S0020-7292(06)60033-7
Hellmann, M. D., Ciuleanu, T.-E., Pluzanski, A., Lee, J. S., Otterson, G. A., Audigier-Valette, C., et al. (2018). Nivolumab plus ipilimumab in lung cancer with a high tumor mutational burden. N. Engl. J. Med. 378, 2093–2104. doi:10.1056/NEJMoa1801946
Henderson, M. P. A., Hirte, H., Hotte, S. J., and Kavsak, P. A. (2016). Cytokines and cell adhesion molecules exhibit distinct profiles in health, ovarian cancer, and breast cancer. Heliyon 2, e00059. doi:10.1016/j.heliyon.2015.e00059
Henke, E., Nandigama, R., and Ergün, S. (2020). Extracellular matrix in the tumor microenvironment and its impact on cancer therapy. Front. Mol. Biosci. 6, 160. doi:10.3389/fmolb.2019.00160
Hung, T.-W., Chu, C.-Y., Yu, C.-L., Lee, C.-C., Hsu, L.-S., Chen, Y.-S., et al. (2020). Endothelial cell-specific molecule 1 promotes endothelial to mesenchymal transition in renal fibrosis. Toxins 12, 506. doi:10.3390/toxins12080506
Hunt, K. K., Wingate, H., Yokota, T., Liu, Y., Mills, G. B., Zhang, F., et al. (2013). Elafin, an inhibitor of elastase, is a prognostic indicator in breast cancer. Breast Cancer Res. BCR 15, R3. doi:10.1186/bcr3374
Ishwaran, H., Kogalur, U. B., and Kogalur, M. U. B. (2022). Package ‘randomForestSRC. ’ breast 6, 1.
Iyer, S., Zhang, S., Yucel, S., Horn, H., Smith, S. G., Reinhardt, F., et al. (2021). Genetically defined syngeneic mouse models of ovarian cancer as tools for the discovery of combination immunotherapy. Cancer Discov. 11, 384–407. doi:10.1158/2159-8290.CD-20-0818
Kalantaridou, S. N., Naka, K. K., Bechlioulis, A., Makrigiannakis, A., Michalis, L., and Chrousos, G. P. (2006). Premature ovarian failure, endothelial dysfunction and estrogen-progestogen replacement. Trends Endocrinol. Metab. Tem. 17, 101–109. doi:10.1016/j.tem.2006.02.003
Kan, T., Wang, W., Ip, P. P., Zhou, S., Wong, A. S., Wang, X., et al. (2020). Single-cell EMT-related transcriptional analysis revealed intra-cluster heterogeneity of tumor cell clusters in epithelial ovarian cancer ascites. Oncogene 39 (21), 4227–4240. doi:10.1038/s41388-020-1288-2
Karagiannis, G. S., Poutahidis, T., Erdman, S. E., Kirsch, R., Riddell, R. H., and Diamandis, E. P. (2012). Cancer-associated fibroblasts drive the progression of metastasis through both paracrine and mechanical pressure on cancer tissue. Mol. Cancer Res. 10 (11), 1403–1418. doi:10.1158/1541-7786.MCR-12-0307
Kim, R., Emi, M., and Tanabe, K. (2007). Cancer immunoediting from immune surveillance to immune escape. Immunology 121, 1–14. doi:10.1111/j.1365-2567.2007.02587.x
Kim, S.-H., Park, Y.-Y., Kim, S.-W., Lee, J.-S., Wang, D., and DuBois, R. N. (2011). ANGPTL4 induction by prostaglandin E2 under hypoxic conditions promotes colorectal cancer progression. Cancer Res. 71, 7010–7020. doi:10.1158/0008-5472.CAN-11-1262
Lan, T., Hu, H., Jiang, C., Yang, G., and Zhao, Z. (2020). A comparative study of decision tree, random forest, and convolutional neural network for spread-F identification. Adv.Space Res. 65, 2052–2061. doi:10.1016/j.asr.2020.01.036
Lee, S., and Lim, H. (2019). Review of statistical methods for survival analysis using genomic data. Genomics Inf. 17, e41. doi:10.5808/GI.2019.17.4.e41
Lei, X., Lei, Y., Li, J.-K., Du, W.-X., Li, R.-G., Yang, J., et al. (2020). Immune cells within the tumor microenvironment: Biological functions and roles in cancer immunotherapy. Cancer Lett. 470, 126–133. doi:10.1016/j.canlet.2019.11.009
Li, B., Severson, E., Pignon, J.-C., Zhao, H., Li, T., Novak, J., et al. (2016). Comprehensive analyses of tumor immunity: Implications for cancer immunotherapy. Genome Biol. 17, 174. doi:10.1186/s13059-016-1028-7
Li J, J., Tian, Y., Zhu, Y., Zhou, T., Li, J., Ding, K., et al. (2020). A multicenter random forest model for effective prognosis prediction in collaborative clinical research network. Artif. Intell. Med. 103, 101814. doi:10.1016/j.artmed.2020.101814
Li Y, Y., Li, H., Yang, B., Wei, J., Zhen, C., and Feng, L. (2020). Clinical significance of PI3 and HLA-DOB as potential prognostic predicators for ovarian cancer. Transl. Cancer Res. 9 (2), 466–476. doi:10.21037/tcr.2019.11.30
Li, Y. K., Zeng, T., Guan, Y., Liu, J., Liao, N. C., Wang, M. J., et al. (2023). Validation of ESM1 related to ovarian cancer and the biological function and prognostic significance. Int. J. Biol. Sci. 19 (1), 258–280. doi:10.7150/ijbs.66839
Lin, J., Yin, M., Liu, L., Gao, J., Yu, C., Liu, X., et al. (2022). The development of a prediction model based on random survival forest for the postoperative prognosis of pancreatic cancer: A SEER-based study. Cancers 14, 4667. doi:10.3390/cancers14194667
Liu, M., Guo, S., and Stiles, J. K. (2011). The emerging role of CXCL10 in cancer (Review). Oncol. Lett. 2, 583–589. doi:10.3892/ol.2011.300
Longatto-Filho, A., Fregnani, J. H., Mafra da Costa, A., de Araujo-Souza, P. S., Scapulatempo-Neto, C., Herbster, S., et al. (2021). Evaluation of elafin immunohistochemical expression as marker of cervical cancer severity. Acta Cytol. 65, 165–174. doi:10.1159/000512010
Love, M. I., Huber, W., and Anders, S. (2014). Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550. doi:10.1186/s13059-014-0550-8
Lu, W., Xie, B., Tan, G., Dai, W., Ren, J., Pervaz, S., et al. (2023). Elafin is related to immune infiltration and could predict the poor prognosis in ovarian cancer. Front. Endocrinol. (Lausanne) 14, 1088944. doi:10.3389/fendo.2023.1088944
Maroco, J., Silva, D., Rodrigues, A., Guerreiro, M., Santana, I., and de Mendonça, A. (2011). Data mining methods in the prediction of dementia: A real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests. BMC Res. Notes 4, 299. doi:10.1186/1756-0500-4-299
Martín-Alonso, M., García-Redondo, A. B., Guo, D., Camafeita, E., Martínez, F., Alfranca, A., et al. (2015). Deficiency of MMP17/MT4-MMP proteolytic activity predisposes to aortic aneurysm in mice. Circ. Res. 117, e13–e26. doi:10.1161/CIRCRESAHA.117.305108
Matsui, F., Kakizawa, H., Nishizuka, M., Hirano, K., Shuo, T., Ida, M., et al. (2005). Changes in the amounts of chondroitin sulfate proteoglycans in rat brain after neonatal hypoxia-ischemia. J. Neurosci. Res. 81, 837–845. doi:10.1002/jnr.20603
Matsuura, K., Sawai, H., Ikeo, K., Ogawa, S., Iio, E., Isogawa, M., et al. (2017). Genome-wide association study identifies TLL1 variant associated with development of hepatocellular carcinoma after eradication of hepatitis C virus infection. Gastroenterology 152, 1383–1394. doi:10.1053/j.gastro.2017.01.041
Mayakonda, A., Lin, D.-C., Assenov, Y., Plass, C., and Koeffler, H. P. (2018). Maftools: Efficient and comprehensive analysis of somatic variants in cancer. Genome Res. 28, 1747–1756. doi:10.1101/gr.239244.118
Miao, Y., Liu, J., Liu, X., Yuan, Q., Li, H., Zhang, Y., et al. (2022). Machine learning identification of cuproptosis and necroptosis-associated molecular subtypes to aid in prognosis assessment and immunotherapy response prediction in low-grade glioma. Front. Genet. 13, 951239. doi:10.3389/fgene.2022.951239
Naba, A., Clauser, K. R., Ding, H., Whittaker, C. A., Carr, S. A., and Hynes, R. O. (2016). The extracellular matrix: Tools and insights for the “omics” era. Matrix Biol. J. Int. Soc. Matrix Biol. 49, 10–24. doi:10.1016/j.matbio.2015.06.003
Ohnishi, J., Ohnishi, E., Shibuya, H., and Takahashi, T. (2005). Functions for proteinases in the ovulatory process. Biochim. Biophys. Acta 1751, 95–109. doi:10.1016/j.bbapap.2005.05.002
Paye, A., Truong, A., Yip, C., Cimino, J., Blacher, S., Munaut, C., et al. (2014). EGFR activation and signaling in cancer cells are enhanced by the membrane-bound metalloprotease MT4-MMP. Cancer Res. 74, 6758–6770. doi:10.1158/0008-5472.CAN-13-2994
Pelckmans, K., De Brabanter, J., Suykens, J. A. K., and De Moor, B. (2005). Handling missing values in support vector machine classifiers. Neural Netw. 18, 684–692. doi:10.1016/j.neunet.2005.06.025
Pintér, A., Hevesi, Z., Zahola, P., Alpár, A., and Hanics, J. (2020). Chondroitin sulfate proteoglycan-5 forms perisynaptic matrix assemblies in the adult rat cortex. Cell. Signal. 74, 109710. doi:10.1016/j.cellsig.2020.109710
Pitt, J. M., Marabelle, A., Eggermont, A., Soria, J.-C., Kroemer, G., and Zitvogel, L. (2016). Targeting the tumor microenvironment: Removing obstruction to anticancer immune responses and immunotherapy. Ann. Oncol. Off. J. Eur. Soc. Med. Oncol. 27, 1482–1492. doi:10.1093/annonc/mdw168
Prat, J.FIGO Committee on Gynecologic Oncology (2014). Staging classification for cancer of the ovary, fallopian tube, and peritoneum. Int. J. Gynaecol. Obstet. Off. Organ Int. Fed. Gynaecol. Obstet. 124, 1–5. doi:10.1016/j.ijgo.2013.10.001
Rocha, S. F., Schiller, M., Jing, D., Li, H., Butz, S., Vestweber, D., et al. (2014). Esm1 modulates endothelial tip cell behavior and vascular permeability by enhancing VEGF bioavailability. Circ. Res. 115, 581–590. doi:10.1161/CIRCRESAHA.115.304718
Sanchez-Vega, F., Mina, M., Armenia, J., Chatila, W. K., Luna, A., La, K. C., et al. (2018). Oncogenic signaling pathways in the cancer genome Atlas. Cell 173, 321–337.e10. e10. doi:10.1016/j.cell.2018.03.035
Schreiber, R. D., Old, L. J., and Smyth, M. J. (2011). Cancer immunoediting: Integrating immunity’s roles in cancer suppression and promotion. Science 331, 1565–1570. doi:10.1126/science.1203486
Schwarz, R. F., Ng, C. K. Y., Cooke, S. L., Newman, S., Temple, J., Piskorz, A. M., et al. (2015). Spatial and temporal heterogeneity in high-grade serous ovarian cancer: A phylogenetic analysis. PLoS Med. 12, e1001789. doi:10.1371/journal.pmed.1001789
Stenzinger, A., Allen, J. D., Maas, J., Stewart, M. D., Merino, D. M., Wempe, M. M., et al. (2019). Tumor mutational burden standardization initiatives: Recommendations for consistent tumor mutational burden assessment in clinical samples to guide immunotherapy treatment decisions. Genes Chromosom. Cancer 58 (8), 578–588. doi:10.1002/gcc.22733
Stewart, J. M., Shaw, P. A., Gedye, C., Bernardini, M. Q., Neel, B. G., and Ailles, L. E. (2011). Phenotypic heterogeneity and instability of human ovarian tumor-initiating cells. Proc. Natl. Acad. Sci. U. S. A. 108, 6468–6473. doi:10.1073/pnas.1005529108
Strickland, K. C., Howitt, B. E., Shukla, S. A., Rodig, S., Ritterhouse, L. L., Liu, J. F., et al. (2016). Association and prognostic significance of BRCA1/2-mutation status with neoantigen load, number of tumor-infiltrating lymphocytes and expression of PD-1/PD-L1 in high grade serous ovarian cancer. Oncotarget 7, 13587–13598. doi:10.18632/oncotarget.7277
Strieter, R. M., Burdick, M. D., Mestas, J., Gomperts, B., Keane, M. P., and Belperio, J. A. (2006). Cancer CXC chemokine networks and tumour angiogenesis. Eur. J. Cancer 42, 768–778. doi:10.1016/j.ejca.2006.01.006
Su, T., Zhang, P., Zhao, F., and Zhang, S. (2021). A novel immune-related prognostic signature in epithelial ovarian carcinoma. Aging (Albany NY) 13 (7), 10289–10311. doi:10.18632/aging.202792
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U. S. A. 102, 15545–15550. doi:10.1073/pnas.0506580102
Sun, H., Fang, F., Li, K., Zhang, H., Zhang, M., Zhang, L., et al. (2019). Circulating ESM-1 levels are correlated with the presence of coronary artery disease in patients with obstructive sleep apnea. Respir. Res. 20, 188. doi:10.1186/s12931-019-1143-6
Tamborero, D., Rubio-Perez, C., Deu-Pons, J., Schroeder, M. P., Vivancos, A., Rovira, A., et al. (2018). Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 10, 25. doi:10.1186/s13073-018-0531-8
Tebben, P. J., Kalli, K. R., Cliby, W. A., Hartmann, L. C., Grande, J. P., Singh, R. J., et al. (2005). Elevated fibroblast growth factor 23 in women with malignant ovarian tumors. Mayo Clin. Proc. 80 (6), 745–751. doi:10.1016/S0025-6196(11)61528-0
The Cancer Genome Atlas Research Network (2011). Integrated genomic analyses of ovarian carcinoma. Nature 474, 609–615. doi:10.1038/nature10166
Toth, R., Schiffmann, H., Hube-Magg, C., Büscheck, F., Höflmayer, D., Weidemann, S., et al. (2019). Random forest-based modelling to detect biomarkers for prostate cancer progression. Clin. Epigenetics 11, 148. doi:10.1186/s13148-019-0736-8
Valmiki, S., Aid, M. A., Chaitou, A. R., Zahid, M., Valmiki, M., Fawzy, P., et al. (2021). Extracellular matrix: A treasure trove in ovarian cancer dissemination and chemotherapeutic resistance. Cureus 13, e13864. doi:10.7759/cureus.13864
Wang, H., Li, S., Wang, Q., Jin, Z., Shao, W., Gao, Y., et al. (2021). Tumor immunological phenotype signature-based high-throughput screening for the discovery of combination immunotherapy compounds. Sci. Adv. 7, eabd7851. doi:10.1126/sciadv.abd7851
Wang, L., Ma, J., Liu, F., Yu, Q., Chu, G., Perkins, A. C., et al. (2007). Expression of MUC1 in primary and metastatic human epithelial ovarian cancer and its therapeutic significance. Gynecol. Oncol. 105 (3), 695–702. doi:10.1016/j.ygyno.2007.02.004
Wang, X., Zhao, J., Zhang, Y., Liu, Y., Wang, J., Shi, R., et al. (2023). Molecular mechanism of Wilms’ tumor (Wt1) (+/−KTS) variants promoting proliferation and migration of ovarian epithelial cells by bioinformatics analysis. J. Ovarian Res. 16, 46. doi:10.1186/s13048-023-01124-2
Ween, M. P., Oehler, M. K., and Ricciardelli, C. (2011). Role of versican, hyaluronan and CD44 in ovarian cancer metastasis. Int. J. Mol. Sci. 12, 1009–1029. doi:10.3390/ijms12021009
Wei, C., Liu, X., Wang, Q., Li, Q., and Xie, M. (2021). Identification of hypoxia signature to assess the tumor immune microenvironment and predict prognosis in patients with ovarian cancer. Int. J. Endocrinol. 2021, 4156187. doi:10.1155/2021/4156187
World Health Organization International Agency for Research on Cancer (WHO-IARC) (2020). Globocan 2020: Estimated cancer incidence, mortality and prevalence worldwide in 2020. Ovary Avaialable at: https://gco.iarc.fr/today/data/factsheets/cancers/25-Ovary-fact-sheet.pdf (Accessed May 5, 2023).
Wu, Q.-W., Xia, J.-F., Ni, J.-C., and Zheng, C.-H. (2021). Gaerf: Predicting lncRNA-disease associations by graph auto-encoder and random forest. Brief. Bioinform 22, bbaa391. doi:10.1093/bib/bbaa391
Xiao, C., Wang, Y., Cheng, Q., and Fan, Y. (2022). Increased expression of MMP17 predicts poor clinical outcomes in epithelial ovarian cancer patients. Med. Baltim. 101 (34), e30279. doi:10.1097/MD.0000000000030279
Xing, F., Saidou, J., and Watabe, K. (2010). Cancer associated fibroblasts (CAFs) in tumor microenvironment. Front. Biosci. (Landmark Ed. 15 (1), 166–179. doi:10.2741/3613
Xu, N., Wu, Y.-P., Yin, H.-B., Xue, X.-Y., and Gou, X. (2018). Molecular network-based identification of competing endogenous RNAs and mRNA signatures that predict survival in prostate cancer. J. Transl. Med. 16, 274. doi:10.1186/s12967-018-1637-x
Yang, J., Wang, C., Zhang, Y., Cheng, S., Wu, M., Gu, S., et al. (2023). A novel autophagy-related gene signature associated with prognosis and immune microenvironment in ovarian cancer. J. Ovarian Res. 16 (1), 86. doi:10.1186/s13048-023-01167-5
Yang, Y., Zhao, T., Chen, Q., Li, Y., Xiao, Z., Xiang, Y., et al. (2022). Nanomedicine strategies for heating “cold” ovarian cancer (OC): Next evolution in immunotherapy of OC. Adv. Sci. Weinh. Baden-Wurtt. Ger. 9, e2202797. doi:10.1002/advs.202202797
Young, C., and Argáez, C. (2019). Rapid genome-wide testing: A review of clinical utility, cost-effectiveness, and guidelines. Ottawa (ON): Canadian Agency for Drugs and Technologies in Health Available at: http://www.ncbi.nlm.nih.gov/books/NBK549546/ (Accessed May 2, 2023).
Yu, G., Wang, L.-G., Han, Y., and He, Q.-Y. (2012). clusterProfiler: an R package for comparing biological themes among gene clusters. Omics J. Integr. Biol. 16, 284–287. doi:10.1089/omi.2011.0118
Zeng, D., Ye, Z., Shen, R., Yu, G., Wu, J., Xiong, Y., et al. (2021). Iobr: Multi-Omics immuno-oncology biological research to decode tumor microenvironment and signatures. Front. Immunol. 12, 687975. doi:10.3389/fimmu.2021.687975
Zhai, Y., Lu, Q., Lou, T., Cao, G., Wang, S., and Zhang, Z. (2020). MUC16affects the biological functions of ovarian cancer cells and induces an antitumor immune response by activating dendritic cells. Ann. Transl. Med. 8 (22), 1494. doi:10.21037/atm-20-6388
Zhang, D., Li, Y., Yang, S., Wang, M., Yao, J., Zheng, Y., et al. (2021). Identification of a glycolysis-related gene signature for survival prediction of ovarian cancer patients. Cancer Med. 10 (22), 8222–8237. doi:10.1002/cam4.4317
Zhang, G. Y., Ahmed, N., Riley, C., Oliva, K., Barker, G., Quinn, M. A., et al. (2005). Enhanced expression of peroxisome proliferator-activated receptor gamma in epithelial ovarian carcinoma. Br. J. Cancer 92 (1), 113–119. doi:10.1038/sj.bjc.6602244
Zhang, H., Dong, S., and Feng, J. (2019a). Epigenetic profiling and mRNA expression reveal candidate genes as biomarkers for colorectal cancer. J. Cell. Biochem. 120, 10767–10776. doi:10.1002/jcb.28368
Zhang, Y., Xu, J., Zhang, N., Chen, M., Wang, H., and Zhu, D. (2019b). Targeting the tumour immune microenvironment for cancer therapy in human gastrointestinal malignancies. Cancer Lett. 458, 123–135. doi:10.1016/j.canlet.2019.05.017
Zhu, Y., Li, X., Wang, L., Hong, X., and Yang, J. (2022). Metabolic reprogramming and crosstalk of cancer-related fibroblasts and immune cells in the tumor microenvironment. Front. Endocrinol. 13, 988295. doi:10.3389/fendo.2022.988295
Keywords: extracellular matrix, ovarian serous adenocarcinoma, artificial intelligence, prognosis, immunity therapy
Citation: Geng T, Zheng M, Wang Y, Reseland JE and Samara A (2023) An artificial intelligence prediction model based on extracellular matrix proteins for the prognostic prediction and immunotherapeutic evaluation of ovarian serous adenocarcinoma. Front. Mol. Biosci. 10:1200354. doi: 10.3389/fmolb.2023.1200354
Received: 04 April 2023; Accepted: 31 May 2023;
Published: 14 June 2023.
Edited by:
Wei Zhang, University of Southern California, United StatesReviewed by:
Guangyu Yao, Southern Medical University, ChinaOscar Maiques, Queen Mary University of London, United Kingdom
Copyright © 2023 Geng, Zheng, Wang, Reseland and Samara. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Athina Samara, YXRoaW5hLnNhbWFyYUBvZG9udC51aW8ubm8=