- 1Institute of Blood Transfusion, Chinese Academy of Medical Sciences and Peking Union Medical College, Chengdu, China
- 2Institute of Oral Basic Research, School and Hospital of Stomatology, Cheeloo College of Medicine, Shandong University, Jinan, China
- 3Department of Obstetrics and Gynecology, The First Hospital of Jilin University, Changchun, China
- 4National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
- 5School of Public Health, Anhui Medical University, Hefei, China
Background: Cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) are among the most prevalent gynecologic malignancies globally. The prognosis is abysmal once cervical cancer progresses to lymphatic metastasis. Anoikis, a specialized form of apoptosis induced by loss of cell adhesion to the extracellular matrix, plays a critical role. The prediction model based on anoikis-related genes (ARGs) expression and clinical data could greatly aid clinical decision-making. However, the relationship between ARGs and CESC remains unclear.
Methods: ARGs curated from the GeneCards and Harmonizome portals were instrumental in delineating CESC subtypes and in developing a prognostic framework for patients afflicted with this condition. We further delved into the intricacies of the immune microenvironment and pathway enrichment across the identified subtypes. Finally, our efforts culminated in the creation of an innovative nomogram that integrates ARGs. The utility of this prognostic tool was underscored by Decision Curve Analysis (DCA), which illuminate its prospective benefits in guiding clinical interventions.
Results: In our study, We discerned a set of 17 survival-pertinent, anoikis-related differentially expressed genes (DEGs) in CESC, from which nine were meticulously selected for the construction of prognostic models. The derived prognostic risk score was subsequently validated as an autonomous prognostic determinant. Through comprehensive functional analyses, we observed distinct immune profiles and drug response patterns among divergent prognostic stratifications. Further, we integrated the risk scores with the clinicopathological characteristics of CESC to develop a robust nomogram. DCA corroborated the utility of our model, demonstrating its potential to enhance patient outcomes through tailored clinical treatment strategies.
Conclusion: The predictive signature, encompassing nine pivotal genes, alongside the meticulously constructed nomogram developed in this research, furnishes clinicians with a sophisticated tool for tailoring treatment strategies to individual patients diagnosed with CESC.
1 Introduction
Cervical carcinoma ranks as the fourth most common malignancy among women on a global scale (1). Data sourced from the Global Cancer Observatory, encompassing 185 countries, indicated that in 2018, there were approximately 570,000 active cases and 311,000 fatalities attributable to cervical cancer (2). Notably, breast and cervical carcinomas are among the trio of cancers witnessing a significant surge in incidence among women, particularly in developing nations. Furthermore, China and India collectively bear over a third of the worldwide cervical cancer burden (3). In developed countries, due to the popularization of Pap smear screening, the incidence of cervical cancer has been dramatically reduced (4).
Chronic infection with high-risk human papillomavirus (HPV) is recognized as a primary risk factor for cervical cancer development, with ≥90% of such malignancies being associated with high-risk HPV types (5, 6). Early-stage cervical cancer patients typically achieve reasonable local control yet are prone to distant recurrence, maintaining an overall survival rate of approximately 90%. Conversely, individuals with locally advanced cervical cancer face a higher risk of both distant and local failure, evidenced by a mortality rate of roughly 35% (7–9). Consequently, there is an imperative need for the identification of novel biomarkers that can accurately predict the prognosis of early-stage CESC, thereby facilitating timely and effective clinical interventions (10).
Anoikis refers to a distinct form of programmed cell death precipitated by cellular detachment from the extracellular matrix or neighboring cells (11). This process is pivotal in maintaining tissue homeostasis and preventing anomalous cell growth and attachment in inappropriate locations. Studies have found that most tumor cells have anti-apoptotic properties, whereby pro-apoptotic proteins are inhibited, the internal and external pathways of apoptosis are blocked, and cell survival factors are upregulated, which collectively facilitate the survival, invasion, and metastasis of tumor cells (12). However, research on the relationship between anoikis and distant metastasis in CESC is insufficient.
In this investigation, we explored the prognostic significance of ARGs in CESC and formulated an ARGs-based prognostic scoring model. Additionally, we executed a comprehensive examination of the variations within the tumor microenvironment (TME) and specifically scrutinized the immune-related aspects of the TME based on the risk score categorization.
2 Methods
2.1 Gene expression profile acquisition and patient data analysis
Gene expression profiles were meticulously selected based on predefined criteria from two distinct sources: 300 samples of CESC tissues were obtained from the Gene Expression Omnibus (GEO: GSE44001), and an additional 304 profiles, inclusive of three samples from normal adjacent tissues, were sourced from The Cancer Genome Atlas (TCGA-CESC). The inclusion criteria for these datasets were as follows: The cohort size needs to be large enough, including patients of different ages and stages, regardless of race, to be included in this study. All data underwent standardized normalization and batch effect removal processes to ensure consistency and reliability in subsequent analyses.
2.2 Anoikis-related gene identification
A comprehensive dataset of 514 ARGs was acquired from the GeneCards database (https://www.genecards.org/) and the Harmonizome portal, referenced in publications (13, 14). Following the initial data acquisition, a comprehensive analysis was performed on the TCGA-CESC cohort using the ‘limma’ package (Version: 3.58.1), a component of the Bioconductor software suite (Version: 3.18). This analytical process resulted in the identification of 170 differentially expressed genes (DEGs), achieved through a comparative evaluation of the expression profiles of the 514 ARGs between CESC tumor tissues and adjacent normal tissues.
2.3 Harmonized gene cluster analysis
To elucidate distinct patterns in anoikis regulation, we employed an advanced consensus clustering algorithm, utilizing the k-means method to categorize anoikis-related gene expression profiles. Subsequently, sophisticated techniques for dimensionality reduction were used, specifically t-distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP). These methods facilitated the validation of the clustering’s robustness and reliability, utilizing the comprehensive capabilities of the R package ‘ggplot2’ for intricate data visualization and analysis.
2.4 Gene functional enrichment analysis
Gene Set Variation Analysis (GSVA) was chosen for its ability to provide a comprehensive assessment of gene set enrichment variations within CESC tissues, offering insights into the underlying molecular mechanisms. The gene set file ‘c2.cp.kegg.v7.4.symbols.gmt’ required for this analysis was obtained from the Molecular Signatures Database (MSigDB). The enrichment analysis was then meticulously executed using the ‘GSVA’ R package (version 1.50.0). Subsequently, the enrichment analysis was meticulously executed using the ‘GSVA’ R package (version 1.50.0), ensuring a rigorous and precise evaluation of the gene sets (15).
2.5 Elaboration and verification of anoikis-related prognostic signatures
The patient cohort was systematically partitioned into two distinct groups: a training set for the initial construction of the risk model and a validation set dedicated to its subsequent verification. Initial identification of genes associated with survival outcomes commenced with univariate Cox regression analysis. Subsequently, a refined analysis employing a minor absolute shrinkage and selection operator (LASSO) regression technique was performed, utilizing the ‘glmnet’ package (Version: 4.1–8) in R for implementation (16). Determination of the optimal penalty regularization parameter λ was achieved through an exhaustive 10-fold cross-validation methodology. In the ensuing stage, a sophisticated multivariate Cox regression approach was employed to delineate essential genes and ascertain their respective coefficients meticulously. A selection of nine pivotal anoikis-associated gene signatures was made to formulate the risk signatures, guided by the most favorable lambda values and their corresponding coefficients. The computation of each patient’s ARG signature risk score was .
In this model, ‘Coefi’ denotes the risk coefficient for each gene, while ‘Expi’ represents the gene’s expression level. To ascertain the predictive efficacy of this model, we utilized Kaplan-Meier (KM) survival analysis and time-dependent Receiver Operating Characteristic (ROC) curves. The dual approach facilitated a thorough evaluation of the model’s aptitude in forecasting patient outcomes. Then, a set of nine anoikis-related DEGs exhibiting a significant correlation with overall survival (OS) were discerned through meticulous multivariate Cox regression and LASSO analyses within both the GSE44001 and TCGA-CESC cohorts.
2.6 Analysis of risk score and immune cell correlation
In this research, the CIBERSORT algorithm and single-sample Gene Set Enrichment Analysis (ssGSEA) R scripts were employed for the quantitative assessment of the relative abundances of immune cell subsets infiltrating the tumor microenvironment (17). Specifically, CIBERSORT was utilized to ascertain the composition of various immune cell types within the groups with lower and higher prognostic risk. This method guaranteed that the aggregate score attributed to all inferred immune cell types in each specimen was normalized to one. Additionally, Spearman’s rank correlation methodology was utilized to elucidate the relationships between the computed risk scores and the degrees of immune cell infiltration.
2.7 Formulation and appraisal of predictive nomogram
An intricately crafted predictive nomogram integrating clinicopathological features with risk scores has been developed. A calibration plot is employed for internal validation purposes to enhance the model’s accuracy. Additionally, the nomogram’s prognostic capabilities were verified through the application of a time-cumulative index, providing a robust measure of its predictive performance. Furthermore, a decision curve analysis (DCA) was conducted meticulously to evaluate the net clinical benefits of the nomogram (18).
2.8 Tumor immune microenvironment single-cell analysis
The Tumor Immune Single-Cell Hub (TISCH; http://tisch.comp-genomics.org), a comprehensive online repository specializing in scRNA-sequencing data pertinent to the TME, was utilized as a pivotal resource in this study (19). Subsequently, Hallmark analysis was employed to assess apoptosis level changes within various cell clusters. This extensive database facilitated a systematic exploration of the heterogeneity within the TME across diverse datasets and cellular phenotypes.
2.9 Protein expression verification using human protein atlas
To meticulously corroborate the differential expression of the nine selected genes, including BCL2, BAX, IGF1, PLAU, EDA2R, ABL1, MIR200A, FASN, and NTRK3, in control versus cervical cancer tissues, immunohistochemical staining data from the Human Protein Atlas (HPA, http://www.proteinatlas.org) were systematically analyzed (20). This approach enabled a comprehensive visualization of the distinct expression patterns of these genes in tumorous tissue compared to normal tissue. The immunohistochemical analysis provided clear evidence of aberrant gene expression in the cancerous specimens, thereby substantiating the potential role of these genes in cervical carcinogenesis.
2.10 Cell culture
The Ect1/e6e7 and HeLa cell lines used in this study were purchased from the Shanghai Cell Bank (Shanghai, China). The cells were cultured in a complete medium (DMEM + 10% fetal bovine serum + 1% penicillin-streptomycin) and maintained in a 37°C incubator with 5% CO2 and saturated humidity. The medium was refreshed every 2–3 days. Cells were passaged when they reached over 90% confluence.
2.11 Reverse transcriptase-quantitative polymerase chain reaction
Cells were first rinsed with PBS after discarding the culture medium to prepare for total RNA extraction. RNA was isolated using the Trizol reagent (AG21102, Precision Biotechnology). The cDNA synthesis was performed with the Evo M-MLV RT Reverse Transcription Kit II (AG11711, Accurate Biotechnology). RT-qPCR was carried out using the SYBR Green Pro Taq HS premixed qPCR kit (AG11701, Accurate Biotechnology) on a LightCycler® 96 system (Roche Ltd, Switzerland). Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) served as the internal control. Gene expression fold changes were calculated using the 2-ΔΔCT method. Primer sequences are listed in Table 1.
2.12 Validating the accuracy of prognostic indicators using the GEPIA data platform
Gene Expression Profiling Interactive Analysis (GEPIA) (http://gepia.cancer-pku.cn/) has RNA sequencing data of tumor tissues and normal tissues from TCGA and Genotype-Tissue Expression Project (GTEx), through which the platform not only allows access to gene expression in different tumors but also allows single-gene survival analysis (21). TCGA tumors vs TCGA normal + GTEx normal were compared using one-way ANOVA with a p-value cutoff of 0.05.
2.13 Comprehensive statistical methodology
Comprehensive statistical analyses were conducted utilizing R software (Version: 4.3.2). Specific statistical tests, including t-tests and ANOVA, were applied to meticulously analyze the data. The threshold for statistical significance was established at a P value of less than 0.05, and the false discovery rate (FDR) was rigorously controlled with a cut-off set at q<0.05. The experimental data are presented as mean ± standard deviation.
3 Results
3.1 Identification of prognosis-related ARGs in CESC
Initially, 563 ARGs were identified from the GeneCards and Harmonizome portals and narrowed down to 514 ARGs through Venn diagram analysis (Figure 1A). Comparison with normal adjacent tissues revealed 170 DEGs in CESC samples. Data from the TCGA-CESC cohort and GSE44001 were integrated, forming a consolidated ‘CESC-GSE44001’ cohort with 17,073 genes. Univariate Cox regression analysis identified 33 out of these 170 DEGs as significantly associated with patient survival (P < 0.05) (Figure 1B). Apart from BCL2, IGF1, GLI2, ITGA8, NTRK3, and ONECUT1, which indicated varied prognoses, the remaining 27 genes were predominantly correlated with poor prognosis. A network diagram revealed tight interconnections among these 33 genes (Figure 1C). Considering the frequent chromosomal aberrations in CESC, copy number variation data was extracted from the TCGA database (Figure 1D). Chromosomal changes in ARGs were thoroughly analyzed to locate each gene accurately. Chromosome one exhibited the highest occurrence of ARGs, including CLIC4, SLC2A1, KIF14, NRAS, IRF6, and PARP1. A significant genomic ‘GAIN’ was observed in COL4A2 on chromosome 13, coupled with a notable ‘LOSS’ of the BRCA2 gene on the same chromosome (Figure 1E).
Figure 1 Differentially expressed ARGs in CESC and their correlation with prognosis were identified. (A) Identification of 514 genes associated with anoikis from study GSE44001 and the TCGA-CESC dataset. (B) A forest plot illustrating the most significant 33 ARGs (P < 0.05) identified through univariate Cox regression analysis. (C) A network diagram depicting the interrelationships among the top 33 ARGs. (D) Analysis of copy number variations (CNVs) in the 33 ARGs within the TCGA-CESC cohort. (E) Examination of chromosomal locations and alterations in the ARGs.
3.2 Molecular subgroup distinctions in CESC via clustering of 33 ARGs
To elucidate the functional implications of ARGs in CESC, we utilized the Consensus Cluster Plus R package (Version: 1.66.0) to perform consensus clustering of 33 prognosis-related DEGs (P <0.05). Optimal stratification into two distinct subtypes was achieved at k = 2 (Figure 2A). Subsequent overall survival (OS) analysis (P < 0.001) revealed a statistically notable prognostic disparity between these subtypes (Figure 2B). The validity of this bifurcation was corroborated through t-SNE and UMAP analyses, confirming the distinct segregation at k = 2 (Figures 2C, D). Heat maps delineating ARGs expression across these subtypes, alongside their clinicopathological attributes, indicated ITGA8 as a potential prognostic enhancer (Figure 2E). Furthermore, KEGG pathway enrichment analysis conducted for both subgroups identified significant enrichment of the histidine metabolism pathway in subgroup B, which exhibited a favorable prognosis (Figure 2F). Additionally, Gene Set Enrichment Analysis (GSEA) underscored reduced cytokine-cytokine receptor interactions, enrichment of focal adhesion, and JAK-STAT signaling pathways in subgroup B, pivotal pathways implicated in tumor cell migration and colonization of new anchorage sites (Figure 2F).
Figure 2 ARGs-based subgroup categorization in CESC. (A) A consensus matrix was established for k=2, utilizing consensus clustering. (B) The analysis of overall survival revealed significant differences between the two identified subgroups (P < 0.001). (C, D) Employing tSNE and UMAP techniques, the two subtypes were discerned based on ARG expression patterns. (E) A comprehensive heat map displayed the expression profiles of ARGs alongside the clinicopathological attributes of the two subtypes. (F) GSVA was conducted to pinpoint the distinct KEGG pathway enrichments contrasting between cluster B and cluster A.
3.3 Differential gene expression and immune cell infiltration patterns in subgroups
Box plots show the expression patterns of ARGs in the two subgroups (Figure 3A). IGF1, GLI2, MIR200A, ITGA8, and NTRK3 levels were lower in subgroup A than in subgroup B. Conversely, the other genes were highly expressed in subgroup A. Given their correlation with overall survival, these DEGs may encode pivotal molecules that significantly influence the prognosis of patients with CESC. Furthermore, they present as potential targets for developing targeted therapeutic strategies. There were also significant differences in the extent of immune cell infiltration, and the proportions of activated CD4+ T cells, dendritic cells, and other immune cells in subgroup A were significantly higher than those in subgroup B (Figure 3B).
Figure 3 Expression patterns of ARGs and immune infiltration in distinct subtype clusters. (A) Analysis of ARGs expression across the two different subtype clusters. (B) Examination of immune infiltration characteristics within these two subtype clusters. Significance: *P < 0.05, **P < 0.01, ***P < 0.001.
3.4 Construction and validation of prognostic signals related to anoikis
In our investigation into the clinical relevance of ARGs, 33 ARGs (p < 0.05) were subjected to Lasso-penalized Cox regression analysis (Figures 4A, B). The optimal lambda (λ) value for Lasso regression, corresponding to the minimum cross-validation error, was determined to be 17. To refine our predictive model, multivariate Cox regression was employed, leading to the selection of 9 pivotal ARGs from the initial 17. These were instrumental in constructing the prognostic model. The resultant risk score, based on this nine-ARG signature, was designated as the “ARG score” with corresponding correlation coefficients detailed in Supplementary Table S1. The Prognostic Index (PI) was calculated as follows: PI = (0.929 × BAX expression) + (0.177 × PLAU expression) + (0.729 × EDA2R expression) + (0.782 × ABL1 expression) + (0.197 × MIR200A expression) + (0.317 × FASN expression) − (0.616 × BCL2 expression) − (0.362 × IGF1 expression) − (0.978 × NTRK3 expression).
Figure 4 Development of an ARGs-based prognostic signature. (A) Identification of prognostic ARGs using LASSO regression with 10-fold cross-validation. (B) Visualization of coefficients for seventeen key prognostic ARGs. (C, D) Kaplan-Meier curves showing survival variations in different risk subtypes. (E, F) Time-dependent ROC analysis for OS at 1, 3, and 5 years. (G) Risk score evaluation in two established clusters. (H) Alluvial diagram depicting transitions between subtypes and survival status.
Kaplan-Meier analysis indicated a poorer prognosis for patients in the high-risk group, a trend corroborated in the TCGA-CESC validation cohort (Figures 4C, D). Time-dependent ROC curves for the model, assessing 1-, 3-, and 5-year overall survival, demonstrated its robust predictive accuracy (Figures 4E, F). A significant disparity in risk scores was observed between the previously identified subtypes (Figures 4G, H). An alluvial diagram was constructed to illustrate transitions among ARG clusters, changes in ARG scores, and survival status.
3.5 Gene set enrichment analysis and immune activity with the different risk score
We investigated the differences in the TME of CESC patients classified into high- and low-risk categories. We conducted a quantitative analysis of immune cell proportions using CIBERSORT R Scripts, ordering CESC samples from lowest to highest risk scores to reflect the variability in immune cell types (Figure 5A). Notably, a rising trend in activated mast cells correlated with increasing risk scores (R = 0.25) (Figure 5B). Additionally, eight other immune cell types showed correlations with risk scores (Supplementary Figure S1). In high-risk patients, activated mast cells constituted a more significant portion of the TME (Figure 5C), suggesting their influence on adverse prognoses. This association of various immune cells with CESC risk categories enhances our understanding of TME composition in these tumors (Figure 5D). The nine-gene ARG score model demonstrated distinct expression patterns between risk groups, corresponding to variations in immune cell infiltration (Figures 5E, F). Differential stromal and immune scores, derived from expression profile evaluations, were observed between the high- and low-risk groups (Figure 5G). Moreover, using the ‘oncoPredict’ R package (Version: 0.2), we identified differential drug susceptibility profiles across these groups, highlighting potential therapeutic avenues (Supplementary Table S2; Supplementary Figures S2, S3).
Figure 5 Exploring the immune landscape in CESC’s TME across varying risk scores. (A) Analysis of the composition of immune cell infiltration at different risk levels. (B) Investigating the relationship between risk scores and the abundance of activated Mast cells in CESC. (C) Comparison of immune cell profiles in high-risk and low-risk groups. (D) Examination of interrelations among different immune cells. (E) A heatmap presenting the expression patterns of nine central ARGs. (F) Study of the associations between immune cells and these nine pivotal ARGs. (G) Comparative analysis of estimate scores based on expression profiles in high-risk versus low-risk groups. Significance: *P < 0.05, **P < 0.01, ***P < 0.001.
3.6 Establishment of a prognostic nomogram with high efficacy
We integrated the ARG score with the clinical and pathological characteristics of CESC patients to develop a comprehensive nomogram (Figure 6A). The accuracy and prognostic efficacy of this nomogram were confirmed by the calibration plot (Figure 6B). The cumulative risk curve (Figure 6C) showed a progressive increase in overall survival risk for CESC patients with higher nomogram scores, underscoring its prognostic value. Decision Curve Analysis demonstrated the ability of the nomogram to predict short- and long-term survival outcomes in CESC patients (Figure 6D). The forest plot identified the T-stage, N-stage, and risk score as crucial risk factors within the nomogram (Figure 6E). Survival analyses comparing T1N0 vs. T1N1 and T2N0 vs. T2N1 indicated significantly lower survival rates for patients with N1 during T1 compared to N0 (P<0.001) (Figure 6F). These findings collectively validate our ARG-based nomogram as a robust tool for clinical prognosis in CESC patients, significantly contributing to personalized patient care strategies.
Figure 6 Creation of a nomogram for predicting outcomes in CESC. (A) Construction of a nomogram integrating ARG-score with various clinicopathological parameters. (B) A calibration curve is employed to verify the accuracy of the nomogram. (C) A cumulative hazard curve depicting survival probabilities over time for patients. (D) DCA is used to evaluate the nomogram’s effectiveness at 1, 3, and 5 years for OS in CESC. (E) A forest plot summarizing the outcomes of multivariable Cox regression analyses, including clinical features and risk scores in CESC. (F) Survival analyses of T1N0 vs. T1N1, T2N0 vs T2N1. Significance: *P < 0.05, **P < 0.01, ***P < 0.001.
3.7 BAX and PLAU are closely associated with lower apoptosis levels in tumor cells
Mapping the expression of anoikis-associated genes to different cell types in the tumor microenvironment elucidated the interactions between these genes in tumor, stromal, and immune cells, crucial for understanding anoikis resistance and cancer progression. We analyzed the expression profiles of eight ARGs in the TME using the CESC_GSE168652 single-cell dataset from TISCH. This dataset identified 21 cell clusters across seven distinct cell types (Figure 7A). ABL1, BCL2, and IGF1 exhibited high expression in fibroblasts and smooth muscle cells (SMCs). BAX and PLAU were highly expressed in malignant cells, while FASN showed broad expression across multiple cell types. EDA2R had low expression in various cell types, whereas NTRK3 was highly expressed in endometrial stromal cells (Figures 7B, C). Hallmark analysis revealed lower apoptosis levels in tumor cells, marked by BAX and PLAU, indicating their unique roles (Figure 7A). Since MIR200A was not expressed in the cell clusters, it was excluded from the analysis.
Figure 7 Analysis of ARGs in CESC’s TME cells using scRNA-seq database. (A) Cell type identification, quantification, and Hallmark analysis in dataset GSE168652. (B, C) Analysis of the percentages and expression levels of crucial ARGs, including ABL1, BAX, BCL2, EDA2R, FASN, IGF1, NTRK3, and PLAU.
3.8 Verification results of ARGs align with expression trends in the risk prediction model
Immunohistochemical analysis utilizing the HPA database revealed a notable disparity in gene expression within cervical cancer tissues. Specifically, the genes ABL1, BAX, FASN, and PLAU exhibited abnormally high expression levels in cervical cancer specimens, in stark contrast to BCL2, IGF1, and NTRK3, which were markedly under-expressed (Figure 8A). This differential expression pattern was further corroborated by survival analysis. The survival curves, based on the expression of each gene, provided additional evidence supporting the observed expression trends in relation to patient survival outcomes (Figure 8B). Further in vitro experiments indicated that, compared to Ect1/e6e7 cells, Hela cells exhibited upregulated mRNA levels of BAX, FASN, and PLAU, while BCL2, IGF1, and NTRK3 mRNA levels were downregulated. ABL1 and EDA2R did not show significant differences (Figure 9A). To validate these findings further, we analyzed GEPIA data, which showed similar results. Compared with normal tissue, BAX, FASN, and PLAU were significantly upregulated in CESC, but ABL1 and EDA2R did not achieve the expected results (Figure 9B). Meanwhile, BCL2, IGF1, and NTRK3 expression were significantly reduced in the tumors.
Figure 8 Expression of ARGs in risk-stratified groups and their prognostic significance. (A) Comparison of ARGs expression through immunohistochemical staining in both control and pathological tissues. (B) Prognostic implications of ARGs for patients categorized into high and low-risk groups.
Figure 9 Verification results of ARGs. (A) Comparison of ARGs mRNA expression in normal and tumor cell lines using RT-qPCR (with GAPDH as the internal control). * p<0.05, **p<0.01, ***p<0.001. Data are presented as mean ± SD. (B) Prognostic indicators expressed in TCGA tumors vs TCGA normal + GTEx normal. The “ns” means “no significance”.
4 Discussion
In this study, we identified 33 differentially expressed ARGs associated with the prognosis of CESC and explored their clinical significance via consensus clustering. Analyzing these genes, we investigated the dynamic tumor microenvironment of CESC, particularly the immune milieu. Using Lasso-penalized Cox regression, we developed a prognostic model based on nine ARGs (BCL2, BAX, IGF1, PLAU, EDA2R, ABL1, FASN, NTRK3, and MIR200A), revealing significant clinical potential. This model demonstrated the association of these genes with the tumor immune microenvironment, notably T-cells and mast cells, and validated expression differences both in vivo and in vitro. Therefore, this model provides valuable insights for early screening, prognostic prediction, and clinical treatment of CESC.
Identifying specific biomarkers is crucial for early detection, prognosis, and personalized treatment (22). Understanding mechanisms like anti-apoptosis, cell invasion, and immune evasion is vital for developing targeted therapies. Although models based on ferroptosis, pyroptosis, and N6-methyladenosine have been proposed, their predictive efficacy is insufficient, necessitating new models (23–25). Anoikis serves as a natural barrier against metastasis by preventing the colonization of tumor cells at non-native sites. However, malignant tumor cells, mainly those capable of distant metastasis, often develop resistance to anoikis. They employ various strategies to circumvent this process, including the autocrine release of growth factors like IL-6, paracrine signaling with molecules such as VEGFa/VEGFR2, activation of pro-survival pathways like ERK and PI3K, and alterations in integrin expression patterns (26–28).
These adaptive mechanisms enable tumor cells to survive and thrive in new microenvironments, facilitating metastasis (29). Anoikis-based predictive models have broad applications. For instance, Tianlei X et al. (30) developed the four-gene feature “Ascore,” which demonstrated superior predictive capability for bladder cancer immunotherapy response, surpassing PD-L1. Additionally, Junyi L et al. (31) found that ARGs are closely associated with the drug resistance of clear cell renal cell carcinoma. The aberrant expression of ARGs is notably linked to the distant metastasis of cervical cancer, underscoring the need for more in-depth research in this area (32). In this study, ARGs-related nomograms demonstrated excellent prognostic predictive capability and were closely associated with the biological mechanisms of CESC development, consistent with previous research findings.
In the present study, we identified nine ARGs, including BCL2, BAX, IGF1, PLAU, EDA2R, ABL1, FASN, NTRK3, and MIR200A, which are closely associated with tumor development. The BCL2 protein family predominantly orchestrates the intrinsic apoptotic cascade, and perturbations within the TP53 pathway are frequently implicated in the oncogenesis of various tumors. Our findings substantiate this, revealing a conspicuous reduction in BCL2 expression within the high-risk cohort (33). Insulin-like growth factor-1 (IGF-1), essential for cellular proliferation, when inhibited, leads to significant shifts, including increased cellular accumulation at the G2M/S phase, augmented apoptosis, and reduced invasive capabilities of tumor cells (34). EDA2R mediates the activation of NF-κB and JNK pathways and is closely associated with cancer cachexia (35). Additionally, the ABL1 gene regulates cytoskeletal dynamics and is linked to tumor drug resistance and cell migration (36, 37). FASN encodes a fatty acid synthase that catalyzes the conversion of acetyl coenzyme A and malonyl coenzyme A to palmitate (38). Overexpression and hyperactivity of FASN are typically associated with malignant cells. Mutations in the NTRK gene, which encodes a member of the neurotrophic tyrosine receptor kinase (NTRK) family, promote myeloid medulloblastoma and secretory breast cancer (39). Although machine learning proved that these genes are closely related to CESC, further analytical biology experiments are needed to confirm their correlation with CESC.
In the present study, single-cell sequencing analysis revealed that BAX and PLAU are closely associated with lower apoptosis levels within CESC. BAX, a member of the pro-apoptotic protein family, is a crucial effector of mitochondrial apoptosis induced by most BH3 mimetics and chemotherapeutic agents (40). Recent studies have shown that co-targeting BAX and BCL-XL proteins can overcome cancer resistance to apoptosis (41). The present study indicated that high BAX expression is associated with poor prognosis, suggesting that using BAX activators might yield better clinical outcomes. Additionally, concurrent research has highlighted the pronounced upregulation of PLAU in cervical carcinoma cells. PLAU is known to be closely related to tumor diagnosis, therapeutic targeting, and patient prognosis (42). A recent study demonstrated that overexpressed PLAU in tumor tissues synergizes with FOXM1 to promote gastric cancer progression (43). Rigorous in vitro analyses have shown that targeted attenuation of PLAU expression significantly reduces the migratory and invasive capabilities of HeLa and HT3 cell lines. Furthermore, the core promoter of PLAU was delineated to reveal transcriptional regulation by YinYang 1 (YY1), a crucial modulator of PLAU mRNA expression (44). This study emphasizes the promising clinical applications of targeting BAX and PLAU in tumor cells. However, considering that the mechanisms of anoikis resistance primarily occur in distant metastatic tumors, our subsequent research will focus on single-cell sequencing results from both primary and distant metastatic tumors, which may provide more valuable insights.
Various methodologies for sample classification based on predefined gene expression profiles have been documented in the literature, employing diverse analytical techniques (45–47). In our investigation, we developed a nomogram predicated upon a selected array of ARGs, facilitating the stratification of patients into distinct prognostic categories. We observed marked variances in the expression of these ARGs among the identified subgroups, correlating significantly with patient prognoses. It underscores the efficacy of our nine-gene nomogram in prognostication, thereby aiding clinicians in devising tailored treatment strategies. Moreover, the DCA indicated that the nomogram, based on these nine genes, potentially offers clinical benefits to patients with CESC at one-, three-, and five-years post-diagnosis. Future research will aim to apply this model in clinical practice.
The study extended the examination to the TME, significantly influencing tumor metastasis and the efficacy of targeted therapies, building on the previously discussed nomogram-based classification (48). Our analysis included the infiltration levels of 21 immune cell types across different patient subtypes. High-risk subgroups with lower survival rates showed a significant increase in activated mast cell infiltration. Mast cells promote angiogenesis and neovascularization by releasing pro-angiogenic factors, including VEGF, FGF-2, PDGF, and IL-6, and non-classical factors like trypsin-like and chymotrypsin (49). This underscores their critical role in CESC progression. Among the nine identified risk genes, PLAU had the strongest correlation with activated mast cell levels, highlighting the PLAU/activated mast cell axis for further exploration. Additionally, severe dysregulation of the T-cell population, including decreased CD8+ T cells, was identified. Recent studies have shown that many patients do not benefit from immune checkpoint blockade therapy due to low CD8+ T cell infiltration in the TME (50). Furthermore, the specific biological mechanisms of ARGs and the TME, as well as whether targeted therapies can reduce tumor development, resistance, and distant metastasis, require more extensive in vivo studies, such as xenografts or allografts.
In conclusion, our study has successfully established a nine-gene model that exhibits remarkable accuracy in prognosticating outcomes for patients with CESC. The nomogram, derived from this model, serves as a valuable tool in clinical practice, enabling physicians to tailor personalized treatment strategies for CESC patients. However, it is crucial to delve deeper into the molecular mechanisms that underpin these gene signatures, particularly at the single-cell level, to gain a more comprehensive understanding of their role in tumor progression and patient prognosis. Furthermore, the expansion of our research to include larger patient cohorts and the conduction of prospective randomized clinical trials are imperative for validating and refining our model. Such endeavors will undoubtedly contribute significantly to the field of precision medicine, potentially leading to more effective and individualized therapeutic approaches for CESC patients.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
M-W: Data curation, Formal analysis, Methodology, Visualization, Writing – original draft, Writing – review & editing. Q-Y: Investigation, Project administration, Validation, Writing – original draft, Writing – review & editing. RD: Methodology, Writing – original draft. Y-X: Validation, Writing – original draft. JW: Writing – review & editing. Y-P: Writing – review & editing. BP: Writing – review & editing. G-X: Writing – review & editing. ZL: Conceptualization, Funding acquisition, Resources, Supervision, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. The CAMS Innovation Fund provided funding for this research for Medical Sciences under grant number 2021-I2M-1-060.
Acknowledgments
We thank the CAMS Innovation Fund for Medical Sciences for assistance.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1352638/full#supplementary-material
References
1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA: Cancer J Clin. (2021) 71:7–33. doi: 10.3322/caac.21654
2. Arbyn M, Weiderpass E, Bruni L, de Sanjosé S, Saraiya M, Ferlay J, et al. Estimates of incidence and mortality of cervical cancer in 2018: a worldwide analysis. Lancet Global Health. (2020) 8:e191–203. doi: 10.1016/S2214-109X(19)30482-6
3. Buskwofie A, David-West G, Clare CA. A review of cervical cancer: incidence and disparities. J Natl Med Assoc. (2020) 112:229–32. doi: 10.1016/j.jnma.2020.03.002
5. Goodman A. HPV testing as a screen for cervical cancer. Bmj. (2015) 350:h2372–2. doi: 10.1136/bmj.h2372
6. Schiffman M, Castle PE, Jeronimo J, Rodriguez AC, Wacholder S. Human papillomavirus and cervical cancer. Lancet. (2007) 370:890–907. doi: 10.1016/S0140-6736(07)61416-0
7. Barbera L, Thomas G. Management of early and locally advanced cervical cancer. Semin Oncol. (2009) 36:155–69. doi: 10.1053/j.seminoncol.2008.12.007
8. Cibula D, Dostalek L, Hillemanns P, Scambia G, Jarkovsky J, Persson J, et al. Completion of radical hysterectomy does not improve survival of patients with cervical cancer and intraoperatively detected lymph node involvement: ABRAX international retrospective cohort study. Eur J Cancer. (2021) 143:88–100. doi: 10.1016/j.ejca.2020.10.037
9. Mereu L, Pecorino B, Ferrara M, Tomaselli V, Scibilia G, Scollo P. Neoadjuvant chemotherapy plus radical surgery in locally advanced cervical cancer: retrospective single-center study. Cancers (Basel). (2023) 15:5207. doi: 10.3390/cancers15215207
10. Arip M, Tan LF, Jayaraj R, Abdullah M, Rajagopal M, Selvaraja M. Exploration of biomarkers for the diagnosis, treatment and prognosis of cervical cancer: a review. Discovery Oncol. (2022) 13:91. doi: 10.1007/s12672-022-00551-9
12. Raeisi M, Zehtabi M, Velaei K, Fayyazpour P, Aghaei N, Mehdizadeh A. Anoikis in cancer: The role of lipid signaling. Cell Biol Int. (2022) 46:1717–28. doi: 10.1002/cbin.11896
13. Rebhan M, Chalifa-Caspi V, Prilusky J, Lancet D. GeneCards: integrating information about genes, proteins and diseases. Trends Genet. (1997) 13:163. doi: 10.1016/S0168-9525(97)01103-7
14. Rouillard AD, Gundersen GW, Fernandez NF, Wang Z, Monteiro CD, McDermott MG, et al. The harmonizome: a collection of processed datasets gathered to serve and mine knowledge about genes and proteins. Database (Oxford). (2016) 2016:baw100. doi: 10.1093/database/baw100
15. Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinf. (2013) 14:7. doi: 10.1186/1471-2105-14-7
16. Simon N, Friedman J, Hastie T, Tibshirani R. Regularization paths for cox's proportional hazards model via coordinate descent. J Stat Softw. (2011) 39:1–13. doi: 10.18637/jss.v039.i05
17. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. (2015) 12:453–7. doi: 10.1038/nmeth.3337
18. Vickers AJ, Cronin AM, Elkin EB, Gonen M. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. (2008) 8:53. doi: 10.1186/1472-6947-8-53
19. Sun D, Wang J, Han Y, Dong X, Ge J, Zheng R, et al. TISCH: a comprehensive web resource enabling interactive single-cell transcriptome visualization of tumor microenvironment. Nucleic Acids Res. (2021) 49:D1420–30. doi: 10.1093/nar/gkaa1020
20. Ponten F, Jirstrom K, Uhlen M. The Human Protein Atlas–a tool for pathology. J Pathol. (2008) 216:387–93. doi: 10.1002/path.2440
21. Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic Acids Res. (2017) 45:W98–W102. doi: 10.1093/nar/gkx247
22. Bedell SL, Goldstein LS, Goldstein AR, Goldstein AT. Cervical cancer screening: past, present, and future. Sex Med Rev. (2020) 8:28–37. doi: 10.1016/j.sxmr.2019.09.005
23. Han S, Wang S, Lv X, Li D, Feng Y. Ferroptosis-related genes in cervical cancer as biomarkers for predicting the prognosis of gynecological tumors. Front Mol Biosci. (2023) 10:1188027. doi: 10.3389/fmolb.2023.1188027
24. Chen D, Guo W, Yu H, Yang J. Construction and validation of prognostic prediction established on N6-methyladenosine related genes in cervical squamous cell carcinoma. Transl Cancer Res. (2022) 11:3064–79. doi: 10.21037/tcr
25. Zhang M, Wu P. Construction of prognosis model of pyroptosis gene and characterization of tumor microenvironment infiltration in cervical cancer. Medicine. (2022) 101:e31599. doi: 10.1097/MD.0000000000031599
26. Mehner C, Miller E, Hockla A, Coban M, Weroha SJ, Radisky DC, et al. Targeting an autocrine IL-6-SPINK1 signaling axis to suppress metastatic spread in ovarian clear cell carcinoma. Oncogene. (2020) 39:6606–18. doi: 10.1038/s41388-020-01451-4
27. Prasad CB, Singh D, Pandey LK, Pradhan S, Singh S, Narayan G. VEGFa/VEGFR2 autocrine and paracrine signaling promotes cervical carcinogenesis via beta-catenin and snail. Int J Biochem Cell Biol. (2022) 142:106122. doi: 10.1016/j.biocel.2021.106122
28. Weems AD, Welf ES, Driscoll MK, Zhou FY, Mazloom-Farsibaf H, Chang BJ, et al. Blebs promote cell survival by assembling oncogenic signalling hubs. Nature. (2023) 615:517–25. doi: 10.1038/s41586-023-05758-6
29. Taddei ML, Giannoni E, Fiaschi T, Chiarugi P. Anoikis: an emerging hallmark in health and diseases. J Pathol. (2012) 226:380–93. doi: 10.1002/path.3000
30. Xie T, Peng S, Liu S, Zheng M, Diao W, Ding M, et al. Multi-cohort validation of Ascore: an anoikis-based prognostic signature for predicting disease progression and immunotherapy response in bladder cancer. Mol Cancer. (2024) 23:23–30. doi: 10.1186/s12943-024-01945-9
31. Li J, Cao Q, Tong M. Deciphering anoikis resistance and identifying prognostic biomarkers in clear cell renal cell carcinoma epithelial cells. Sci Rep. (2024) 14:12044. doi: 10.1038/s41598-024-62978-0
32. Zhang M, Hong X, Ma N, Wei Z, Ci X, Zhang S. The promoting effect and mechanism of Nrf2 on cell metastasis in cervical cancer. J Transl Med. (2023) 21:433. doi: 10.1186/s12967-023-04287-0
33. Wang Y, Wang C, Liu N, Hou J, Xiao W, Wang H. HOXC6 promotes cervical cancer progression via regulation of Bcl-2. FASEB J. (2019) 33:3901–11. doi: 10.1096/fj.201801099RR
34. Javed S, Bhattacharyya S, Bagga R, Srinivasan R. Insulin growth factor-1 pathway in cervical carcinoma cancer stem cells. Mol Cell Biochem. (2020) 473:51–62. doi: 10.1007/s11010-020-03807-6
35. Bilgic SN, Domaniku A, Toledo B, Agca S, Weber BZC, Arabaci DH, et al. EDA2R-NIK signalling promotes muscle atrophy linked to cancer cachexia. Nature. (2023) 617:827–34. doi: 10.1038/s41586-023-06047-y
36. Ge G, Wen Y, Li P, Guo Z, Liu Z. Single-cell plasmonic immunosandwich assay reveals the modulation of nucleocytoplasmic localization fluctuation of ABL1 on cell migration. Anal Chem. (2023) 95:17502–12. doi: 10.1021/acs.analchem.3c02593
37. Kumar S, Talluri S, Zhao J, Liao C, Potluri LB, Buon L, et al. ABL1 kinase plays an important role in spontaneous and chemotherapy-induced genomic instability in multiple myeloma. Blood. (2024) 143:996–1005. doi: 10.1182/blood.2023021225
38. Ye M, Hu C, Chen T, Yu P, Chen J, Lu F, et al. FABP5 suppresses colorectal cancer progression via mTOR-mediated autophagy by decreasing FASN expression. Int J Biol Sci. (2023) 19:3115–27. doi: 10.7150/ijbs.85285
39. Yuan L, Bing Z, Yan P, Li R, Wang C, Sun X, et al. Integrative data mining and meta-analysis to investigate the prognostic role of microRNA-200 family in various human Malignant neoplasms: A consideration on heterogeneity. Gene. (2019) 716:144025. doi: 10.1016/j.gene.2019.144025
40. Garner TP, Lopez A, Reyna DE, Spitz AZ, Gavathiotis E. Progress in targeting the BCL-2 family of proteins. Curr Opin Chem Biol. (2017) 39:133–42. doi: 10.1016/j.cbpa.2017.06.014
41. Lopez A, Reyna DE, Gitego N, Kopp F, Zhou H, Miranda-Roman MA, et al. Co-targeting of BAX and BCL-XL proteins broadly overcomes resistance to apoptosis in cancer. Nat Commun. (2022) 13:1199. doi: 10.1038/s41467-022-28741-7
42. Liu Q, Li W, Yang S, Liu Z. High expression of uPA related to p38MAPK in esophageal cancer indicates poor prognosis. Onco Targets Ther. (2018) 11:8427–34. doi: 10.2147/OTT
43. Ai C, Zhang J, Lian S, Ma J, Gyorffy B, Qian Z, et al. FOXM1 functions collaboratively with PLAU to promote gastric cancer progression. J Cancer. (2020) 11:788–94. doi: 10.7150/jca.37323
44. Gao Y, Ma X, Lu H, Xu P, Xu C. PLAU is associated with cell migration and invasion and is regulated by transcription factor YY1 in cervical cancer. Oncol Rep. (2023) 49:25. doi: 10.3892/or.2022.8462
45. Li L, Weinberg CR, Darden TA, Pedersen LG. Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics. (2001) 17:1131–42. doi: 10.1093/bioinformatics/17.12.1131
46. Voyle N, Keohane A, Newhouse S, Lunnon K, Johnston C, Soininen H, et al. A pathway based classification method for analyzing gene expression for alzheimer's disease diagnosis. J Alzheimers Dis. (2016) 49:659–69. doi: 10.3233/JAD-150440
47. Benso A, Di Carlo S, Politano G, Savino A, Hafeezurrehman H. Building gene expression profile classifiers with a simple and efficient rejection option in R. BMC Bioinf. (2011) 12 Suppl 13:S3. doi: 10.1186/1471-2105-12-S13-S3
48. Lei X, Lei Y, Li JK, Du WX, Li RG, Yang J, et al. Immune cells within the tumor microenvironment: Biological functions and roles in cancer immunotherapy. Cancer Lett. (2020) 470:126–33. doi: 10.1016/j.canlet.2019.11.009
49. Komi DEA, Redegeld FA. Role of mast cells in shaping the tumor microenvironment. Clin Rev Allergy Immunol. (2020) 58:313–25. doi: 10.1007/s12016-019-08753-w
Keywords: cervical cancer, anoikis-related genes, immune microenvironment, decision curve analysis, prognostic nomogram
Citation: Wang M-, Ying Q-, Ding R, Xing Y-, Wang J, Pan Y-, Pan B, Xiang G- and Liu Z (2024) Elucidating prognosis in cervical squamous cell carcinoma and endocervical adenocarcinoma: a novel anoikis-related gene signature model. Front. Oncol. 14:1352638. doi: 10.3389/fonc.2024.1352638
Received: 08 December 2023; Accepted: 10 June 2024;
Published: 26 June 2024.
Edited by:
Penelope Jayne Duerksen-Hughes, Loma Linda University, United StatesReviewed by:
Haitao Wang, National Cancer Institute (NIH), United StatesGiuseppe Scibilia, Gynecology and Obstetrics Department, Italy
Basilio Pecorino, Kore University of Enna, Italy
Copyright © 2024 Wang, Ying, Ding, Xing, Wang, Pan, Pan, Xiang and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhong Liu, bGl1ekBpYnQucHVtYy5lZHUuY24=
†These authors have contributed equally to this work