- 1Department of Gynecology and Obstetrics, Sir Run Run Shaw Hospital, Medical School of Zhejiang University, Hangzhou, China
- 2Biomedical Informatics Research Lab, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
- 3Cancer Genomics Research Center, School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, Nanjing, China
- 4Big Data Research Institute, China Pharmaceutical University, Nanjing, China
- 5Department of Medical Oncology, Sir Run Run Shaw Hospital, Medical School of Zhejiang University, Hangzhou, China
- 6Department of Rehabilitation Medicine, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
- 7Department of Gynecology and Obstetrics, Women's Hospital, School of Medicine, Zhejiang University, Hangzhou, China
Background: Human papillomavirus-positive (HPV+) cervical cancers are highly heterogeneous in clinical and molecular characteristics. Thus, an investigation into their heterogeneous immunological profiles is meaningful in providing both biological and clinical insights into this disease.
Methods: Based on the enrichment of 29 immune signatures, we discovered immune subtypes of HPV+ cervical cancers by hierarchical clustering. To explore whether this subtyping method is reproducible, we analyzed three bulk and one single cell transcriptomic datasets. We also compared clinical and molecular characteristics between the immune subtypes.
Results: Clustering analysis identified two immune subtypes of HPV+ cervical cancers: Immunity-H and Immunity-L, consistent in the four datasets. In comparisons with Immunity-L, Immunity-H displayed stronger immunity, more stromal contents, lower tumor purity, proliferation potential, intratumor heterogeneity and stemness, higher tumor mutation burden, more neoantigens, lower levels of copy number alterations, lower DNA repair activity, as well as better overall survival prognosis. Certain genes, such as MUC17, PCLO, and GOLGB1, showed significantly higher mutation rates in Immunity-L than in Immunity-H. 16 proteins were significantly upregulated in Immunity-H vs. Immunity-L, including Caspase-7, PREX1, Lck, C-Raf, PI3K-p85, Syk, 14-3-3_epsilon, STAT5-α, GATA3, Src_pY416, NDRG1_pT346, Notch1, PDK1_pS241, Bim, NF-kB-p65_pS536, and p53. Pathway analysis identified numerous immune-related pathways more highly enriched in Immunity-H vs. Immunity-L, including cytokine-cytokine receptor interaction, natural killer cell-mediated cytotoxicity, antigen processing and presentation, T/B cell receptor signaling, chemokine signaling, supporting the stronger antitumor immunity in Immunity-H vs. Immunity-L.
Conclusion: HPV+ cervical cancers are divided into two subgroups based on their immune signatures' enrichment. Both subgroups have markedly different tumor immunity, progression phenotypes, genomic features, and clinical outcomes. Our data offer novel perception in the tumor biology as well as clinical implications for HPV+ cervical cancer.
Introduction
The human papillomavirus (HPV) infection is a leading etiology for cervical cancer, the most common gynecological malignancy (1). Among 15 high-risk HPV strains, HPV16 and HPV18 infection are associated with 70% of HPV-related cervical cancers (2). Squamous cell carcinomas and adenocarcinomas constitute 90 and 10% of cervical cancers, respectively. Furthermore, cervical squamous cell carcinomas are predominantly HPV-positive (HPV+), compared to 25% of cervical adenocarcinomas being HPV-negative (HPV-) (3). Prior investigations have demonstrated the high heterogeneity of cervical cancer (4). For example, The Cancer Genome Atlas (TCGA) network classified cervical cancers into four subtypes, namely HPV clade A9, A7, HPV-negative, and other, based on the HPV status (4). TCGA also identified three subtypes of cervical cancers: hormone, epithelial-mesenchymal transition (EMT), and PI3K-AKT, based on protein expression profiles (4). By an integrative analysis of multi-omics profiles, including mRNA and microRNA expression, copy number alterations (CNAs), and DNA methylation, TCGA uncovered three subtypes of cervical cancer: keratin-high squamous, keratin-low squamous, and adenocarcinoma-rich (4). Li et al. (5) identified six cervical cancer subtypes by consensus clustering, of which two immune clusters-associated gene signatures were valuable prognostic factors.
Surgery, radiotherapy, and chemotherapy are the major therapeutic approaches for cervical cancer to date. However, these approaches often provide only limited efficiency for late stage of cervical cancers (6). Immunotherapies, particularly immune checkpoint inhibitors (ICIs), have recently demonstrated successes in treating various malignancies, including melanoma, lung cancer, bladder cancer, head and neck cancer, kidney cancer, liver cancer, breast cancer, cervical cancer, prostate cancer, and the cancers with high tumor mutation loads or mismatch repair deficiency (dMMR). Studies have shown that ICIs may result in the remission of virus infection-associated malignancies, including HPV infection-associated cervical (7) and head and neck (8) cancers. Nevertheless, the response rates to ICIs are relatively low, with currently only around 20% of cancer patients responding to ICIs (9). Thus, identifying the determinants of immunotherapy responses may aid in improving the cancer immunotherapeutic efficiency. Accumulating evidence has demonstrated that, certain molecular features, e.g., PD-L1 expression (10), high tumor mutation burden (TMB) (11), and dMMR (12), are associated with better immunotherapeutic responsiveness. In addition, the “hot” tumors with strong immune infiltration likely respond better to immunotherapy than the “cold” tumors with weak immune infiltration (13). Thus, differentiating “hot” tumors from “cold” tumors may facilitate the optimal selection of cancer patients for immunotherapy. With the recent emergence of massive multi-omics data in cancer research, many algorithms have been developed to uncover “hot” and “cold” tumor subtypes, such as unsupervised machine learning algorithms (14–16).
In the present study, we identified immune-related subtypes of HPV+ cervical cancers based on the enrichment of 29 immune signatures by unsupervised machine learning. This analysis identified two subtypes of HPV+ cervical cancers, which exhibited high and low enrichment of immune signatures, respectively, reproducibly in four different cohorts. Furthermore, we comprehensively compared clinical and molecular characteristics between both subtypes of HPV+ cervical cancers. This study could offer new understanding of the tumor biology as well as clinical implications for the management of HPV+ cervical cancer.
Methods
Datasets
We downloaded multi-omics data for cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) in TCGA, termed TCGA-CESC (4), from the genomic data commons (GDC) data portal (https://portal.gdc.cancer.gov/). These multi-omics data included mRNA gene expression profiles (normalized by RSEM), somatic mutations (“maf” file), somatic copy number alterations (SCNAs) (“SNP6” files), protein expression profiles (Reverse Phase Protein Array (RPPA), normalized), and clinical data. We obtained a CESC transcriptomic dataset (SGCX) from a prior publication (17), and a CESC transcriptomic dataset (GSE29570) (18) from the NCBI gene expression omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/). We also downloaded a single-cell transcriptomic dataset (GSE171894) for CESC from the NCBI GEO. A description of these datasets is shown in Supplementary Table S1.
Single-sample gene set enrichment analysis
We employed the single-sample gene set enrichment analysis (ssGSEA) (19) to assess the enrichment levels of immune signatures, biological processes, and pathways in tumors. The ssGSEA evaluates the enrichment score of a gene set in a sample based on the expression profiles of their marker or pathway genes (19). The marker or pathway genes of immune signatures, pathways, and biological processes we analyzed are listed in Supplementary Table S2. The “GSVA” R package (19) was utilized to perform the ssGSEA.
Clustering analysis
We identified immune-related subtypes of HPV+ cervical cancers by hierarchical clustering based on the enrichment scores of 29 immune signatures. These immune signatures included antigen presenting cell (APC) co-inhibition, APC co-stimulation, B cells, cytokine and cytokine receptor (CCR), CD8+ T cells, immune checkpoint, cytolytic activity, dendritic cells (DCs), activated DCs (aDCs), immature DCs (iDCs), plasmacytoid dendritic cells (pDCs), human leukocyte antigen (HLA), inflammation-promoting, macrophages, mast cells, major histocompatibility complex (MHC) class I, neutrophils, natural killer (NK) cells, parainflammation, T cell co-inhibition, T cell co-stimulation, T helper cells, T follicular helper (Tfh), T helper 1 (Th1) cells, T helper 2 (Th2) cells, tumor infiltrating lymphocytes (TILs), regulatory T (Treg) cells, Type I interferon (IFN) response, and Type II IFN response. Before clustering, we performed normalization of the ssGSEA scores by Z-score and transformed them into distance matrices by the R function “dist” with the parameter: method = “euclidean.” The hierarchical clustering was implemented with the function “hclust” in the R package “Stats” with the parameters: method = “ward.D2” and members = NULL.
Class prediction
We employed the Random Forest (RF) algorithm to predict the immune-related subtypes of HPV+ cervical cancers based on the ssGSEA scores of immune signatures. The number of trees was 500 and the predictors were the 29 immune signatures in the RF. The accuracy and weighted F-score were reported as the prediction performance. The RF algorithm was implemented by using the “randomForest” R package (20).
Survival analysis
We utilized the Kaplan-Meier (K-M) model (21) to compare overall survival (OS) and disease-free survival (DFS) time between the immune subtypes of HPV+ cervical cancers. K-M curves were plotted to display the survival time differences, and the log-rank test was utilized to assess whether the survival time differences were significant with a threshold of P < 0.05.
Calculation of TMB, SCNA, intratumor heterogeneity, immune scores, stromal content, and tumor purity in tumors
We determined a tumor's TMB as the total number of its somatic mutations. With the input of “SNP6” files, we employed GISTIC2 (22) to calculate G-scores in tumors. The G-score represents the amplitude of the SCNA and the frequency of its occurrence across a class of samples (22). We utilized the DITHER algorithm (23) to score ITH, which measures ITH based on both somatic mutation and SCNA profiles. The ESTIMATE algorithm (24) was used to calculate immune scores, stromal scores, and tumor purity for bulk tumors, which represent immune infiltration levels, stromal contents, and proportions of tumor cells in bulk tumors.
Pathway analysis
To identify pathways more enriched in one subgroup relative to another subgroup, we first identified upregulated genes in the subgroup vs. another subgroup by Student's t-test with a threshold of false discovery rate (FDR) < 0.05 and fold change (FC) > 1.5. By inputting the upregulated genes into the GSEA web tool (25), we obtained the KEGG (26) pathways more enriched in the subgroup with a threshold of FDR < 0.05.
Single-cell RNA sequencing data analysis
We analyzed a scRNA-seq transcriptomic dataset (GSE171894) for HPV+ cervical cancers. The gene expression values in single cells were normalized by transcripts per million (TPM). We performed hierarchical clustering of HPV+ cervical cancer single cells based on immune signatures' scores to identify their subtypes. In addition, the single-cell consensus clustering (SC3) method (27) were utilized to cluster single cells in each immune subtype, respectively.
Statistical analysis
We used the two-tailed Student's t-test to compare two classes of normally-distributed data, including gene expression values, protein expression values, and the enrichment ratios of two immune signatures. The ratios were the log2-transformed values of the average expression levels of all marker genes in an immune signature over those of all marker genes in another immune signature. When comparing two classes of non-normally distributed data, including ssGSEA scores, TMB, neoantigens, ITH, immune scores, stromal scores, and tumor purity, we used the one-tailed Mann–Whitney U-test. We used the Spearman method to assess the correlation between pathway enrichment scores (ssGSEA scores) and immune scores. The Fisher's exact test was utilized to analyze contingency tables. To adjust for P-values in multiple tests, we calculated FDR by using the Benjamini and Hochberg method (28). We performed all statistical analyses and visualizations with the R programming (version 3.6.1).
Results
Unsupervised clustering identifies two immune subtypes of HPV+ cervical cancer
Based on the enrichment scores of 29 immune signatures, we identified two immune subtypes of HPV+ cervical cancer, consistently in three datasets (TCGA-CESC, GSE29570, and SGCX) (Figure 1A). Both subtypes had high and low enrichment of these immune signatures, termed Immunity-H and Immunity-L, respectively. Of note, both immunostimulatory signatures (such as CD8+ T cells, T cell co-stimulation, APC co-stimulation, cytolytic activity, and NK cells) and immunosuppressive signatures (such as CD4+ regulatory T cells, APC co-inhibition, T cell co-inhibition, and immune checkpoint molecules) showed significantly higher enrichment levels in Immunity-H than in Immunity-L (Figure 1A). Nevertheless, the ratios of immunostimulatory to immunosuppressive signatures (CD8+/CD4+ regulatory T cells, pro-/anti-inflammatory cytokines, and M1/M2 macrophages) were significantly higher in Immunity-H than in Immunity-L (two-tailed Student's t-test, P < 0.05) (Figure 1B). Collectively, these results supported that Immunity-H had more active antitumor immune responses than Immunity-L.
Figure 1. Clustering analysis identifies two immune subtypes of HPV+ cervical cancer. (A) Based on the enrichment levels of 29 immune signatures, hierarchical clustering identifies two immune subtypes: Immunity-H and Immunity-L, which have high and low immunity, respectively, consistently in three datasets. (B) Immunity-H has significantly higher ratios of immunostimulatory to immunosuppressive signatures than Immunity-L. The two-tailed Student's t-test P-values are shown. (C) Prediction of the immune subtypes of HPV+ cervical cancer by Random Forest based on the enrichment scores of 29 immune signatures. The 10-fold cross-validation results in the training set and prediction results in the other test sets are shown.
To explore whether this subtyping method is predictable, we used one of the three datasets as the training set and the other two datasets as test sets in turn to predict both subtypes with RF. The 10-fold CV accuracies and weighted F-scores in the training sets were >77%. The prediction accuracies and weighted F-scores in test sets were all above 80% (Figure 1C). These results support that the subtyping is predictable and robust.
The immune subtypes of HPV+ cervical cancer have significantly different clinical and molecular characteristics
We compared 5-year OS and DFS prognosis between both subtypes in TCGA-CESC, which had related data available. Kaplan-Meier curves showed that Immunity-H had significantly higher OS rates than Immunity-L (log-rank test, P < 0.05) (Figure 2A). We found that the proportion of tumor-free patients was significantly higher in Immunity-H than in Immunity-L (Fisher's exact test, P = 0.009; odds ratio (OR) = 0.46) (Figure 2B). It confirmed that Immunity-H had a better prognosis than Immunity-L. We further compared several phenotypic or molecular features associated with tumor progression, including tumor proliferation, stemness, and ITH. Of note, these features were more enriched in Immunity-L than in Immunity-H (P < 0.05) (Figure 2C). Furthermore, tumor purity was likely higher in Immunity-L than in Immunity-H, while stromal content was more enriched in Immunity-H than in Immunity-L (Figure 2D).
Figure 2. Comparisons of clinical and molecular features between the immune subtypes of HPV+ cervical cancer. (A) Kaplan–Meier curves showing 5-year overall survival and disease-free survival time differences between the immune subtypes. The log-rank test P-values are shown. (B) The proportion of tumor-free patients is significantly higher in Immunity-H than in Immunity-L The Fisher's exact-test P-value is shown. (C) Immunity-H displays significantly lower enrichment scores of tumor proliferation, stemness, and ITH than in Immunity-L. (D) Comparisons of tumor purity and stromal content between the immune subtypes. The one-tailed Mann–Whitney U-test P-values are shown in (C,D). ITH, intratumor heterogeneity.
Immunity-H had significantly higher TMB (P = 0.008) and thus more predicted neoantigens (29) (P = 0.034) than Immunity-L (Figure 3A). In contrast, Immunity-H had significantly lower levels of tumor aneuploidy, also known as CNA, than Immunity-L, as evidenced by markedly lower CNA scores (30) and G-scores of copy number amplifications and deletions (Figure 3B). These results conform to previous findings that TMB and tumor aneuploidy correlate positively and negatively with antitumor immune responses, respectively (31). Furthermore, we compared the enrichment of nine major DNA damage response (DDR) pathways between both subtypes. These pathways included base excision repair, mismatch repair, nucleotide excision repair, the Fanconi anemia (FA) pathway, homology-dependent recombination, non-homologous DNA end joining, direct damage reversal/repair, translesion DNA synthesis, and damage sensor (30). Of note, six of the nine pathways displayed significantly higher enrichment in Immunity-L than in Immunity-H (P < 0.05) (Figure 3C). It indicates that Immunity-L has a stronger DDR than Immunity-H. It is justified since Immunity-L displays higher genomic instability than Immunity-H.
Figure 3. Comparisons of somatic mutation, copy number alteration, and protein expression profiles between the immune subtypes of HPV+ cervical cancer. (A) Immunity-H has significantly higher tumor mutation burden (TMB) and more predicted neoantigens than Immunity-L. (B) Immunity-H has significantly lower extent of copy number alterations than Immunity-L. (C) Six of the nine major DNA damage response (DDR) pathways shows significantly higher enrichment scores in Immunity-L than in Immunity-H. BER, base excision repair. DR, direct damage reversal/repair. NER, nucleotide excision repair. MMR, mismatch repair. FA, Fanconi Anemia HR, homologous recomination. NHEJ, non-homologous end joining. TLS, translesion synthesis. (D) The mutation profiles of the genes in HPV+ Cervical cancer, which have significantly higher mutation rates in Immunity-L than in Immunity-H. (E) Heatmap shows 25 proteins differentially expressed between Immunity-L and Immunity-H (two-tailed Student's t-test, FDR < 0.05). The one-tailed Mann–Whitney U-test P-values are shown in (A–C). *, P < 0.05; **, P < 0.01; ***, P < 0.001; ns, P ≥ 0.05.
We found 17 genes showing significantly higher mutation rates in Immunity-L than in Immunity-H (Fisher's exact test, P < 0.05; OR > 2), but no gene having significantly higher mutation rates in Immunity-H versus Immunity-L (Figure 3D). These genes included PCLO, TTN, IGSF10, MUC17, FAT3, APOB, MED1, ODZ2, UBR4, CREBBP, GOLGB1, USH2A, ABCA12, HSPG2, SPEN, CELSR3, MDC1, PLXNB2, MLL3, NAV3, BSN, and RYR3. CREBBP (CREB binding protein) encodes a transcription factor that is involved in the transcriptional coactivation of many transcription factors. This gene had a mutation rate of 10% in Immunity-L vs. 2.8% in Immunity-H (P = 0.01). Its mutations have been associated with antitumor immunosuppression (32), consistent with our result. MUC17 (mucin 17, cell surface associated) encodes a member of membrane-bound mucins, which are involved in tumor immunomodulation (33). This gene had a mutation rate of 11.6% in Immunity-L vs. 2.1% in Immunity-H (P = 0.002). PCLO (piccolo presynaptic cytomatrix protein) encodes a protein which is part of the presynaptic cytoskeletal matrix. This gene's mutations have been associated with alterations of tumor immunity (34). GOLGB1 (golgin B1) encodes a protein located in Golgi apparatus and endoplasmic reticulum-Golgi intermediate compartment. Previous studies have shown that GOLGB1 mutations were more prevalent in “cold” tumors (35), consistent with our analysis revealing its higher mutation rate in Immunity-L vs. Immunity-H (10.1 vs. 2.8%; P = 0.014).
We compared the expression levels of 226 proteins between both subtypes and found 25 differentially expressed proteins (DEPs) (two-tailed Student's t-test, P < 0.05) (Figure 3E). Among the DEPs, 16 were upregulated in Immunity-H, including Caspase-7, PREX1, Lck, C-Raf, PI3K-p85, Syk, 14-3-3_epsilon, STAT5-α, GATA3, Src_pY416, NDRG1_pT346, Notch1, PDK1_pS241, Bim, NF-kB-p65_pS536, and p53. Of note, many of these proteins play a role in the positive regulation of antitumor immune responses. For example, Caspase-7 is positively associated with antitumor immune responses by the regulation of apoptosis (36). PREX1 acts as a guanine nucleotide exchange factor for the RHO family of small GTP-binding proteins (RACs) to promote antitumor immune responses (37). Lck incites antitumor immune responses by regulating T cell development (38). Syk as a tumor suppressor has a role in driving antitumor immune responses (39). STAT5 has also been shown to promote antitumor immunity (40). Among the DEPs, nine were downregulated in Immunity-H, including IGFBP2, Rab25, AMPK-α, TAZ, XRCC1, AMPK_pT172, Src_pY527, PKC-alpha, and YB-1. It suggests that these proteins have a negative association with antitumor responses. Previous studies have shown that IGFBP2 upregulation can drive the growth of tumors and promote antitumor immunosuppression (41), supporting our result. TAZ is a component of the Hippo signaling pathway, which may promote tumor immune evasion (15, 42). It is in agreement with our finding that this protein is downregulated in Immunity-H. YB-1 is a cold shock domain protein implicated in numerous cellular processes, whose upregulation drives cancer proliferation and immune evasion (43). Again, it is consistent with our finding. XRCC1 is involved in DNA repair, whose upregulation indicates a stronger DNA repair capacity. It is in line with the previous finding that the DDR pathways are more enriched in Immunity-L than in Immunity-H. A previous study revealed that AMP-activated protein kinase (AMPK) plays a pivotal role in regulating the response to immune-checkpoint blockade, indicating that AMPK agonists may promote the efficacy of immunotherapy (44). This previous finding supports our result that AMPK is upregulated in “cold” tumors.
Identification of pathways enriched in the immune subtypes of HPV+ cervical cancer
GSEA (25) identified numerous KEGG pathways more enriched in Immunity-H vs. Immunity-L. As expected, many immune-relevant pathways were included in the list, including cytokine-cytokine receptor interaction, natural killer cell-mediated cytotoxicity, antigen processing and presentation, T cell receptor signaling, Toll-like receptor signaling, Fc gamma R-mediated phagocytosis, Jak-STAT signaling pathway, B cell receptor signaling, Fc epsilon RI signaling, NOD-like receptor signaling, cytosolic DNA-sensing, cell adhesion molecules (CAMs), chemokine signaling, RIG-I-like receptor signaling, and complement and coagulation cascades (Figure 4A). Besides, some stromal pathways were also included in the list, such as regulation of actin cytoskeleton and focal adhesion. It is consistent with the previous finding that Immunity-H has higher stromal content than Immunity-L. In addition, several cancer-associated pathways were more enriched in Immunity-H vs. Immunity-L, including apoptosis, VEGF signaling, calcium signaling, MAPK signaling, and Wnt signaling pathways. It indicates a positive association between the enrichment of these pathways and antitumor immune responses. Indeed, we found that the enrichment scores of most these pathways were positively correlated with immune scores in HPV+ cervical cancer (Spearman correlation, P < 0.05) (Figure 4B). These results conform to previous findings of the significant positive association between these pathways' enrichment and antitumor immunity (45). We also identified certain pathways more enriched in Immunity-L vs. Immunity-H, most of which were metabolism relevant, such as maturity onset diabetes of the young, metabolism of xenobiotics by cytochrome P450, drug metabolism - cytochrome P450, retinol metabolism, O-Glycan biosynthesis, starch and sucrose metabolism, and glycosphingolipid biosynthesis - lacto and neolacto series.
Figure 4. KEGG pathways more enriched in Immunity-H vs. Immunity-L. (A) The cancer, immune, and stroma-associated pathways showing significantly higher enrichment in Immunity-H (FDR <0.05). FDR, false discovery rate. (B) Spearman correlations between the enrichment scores of the cancer-associated pathways upregulated in Immunity-H and immune scores in HPV+ cervical cancer.
Unsupervised clustering identifies two immune subtypes of HPV+ cervical cancer single cells
Likewise, we performed a hierarchical clustering of HPV+ cervical cancer single cells in a scRNA-seq dataset (GSE171894), which was a gene expression profiling in 3,356 cancer cells from 2 HPV+ cervical cancer patients. Because this clustering analysis was performed in cancer single cells, we used four immune-related pathways which are expressed in cancer cells themselves. These pathways included antigen processing and presentation, JAK-STAT signaling, apoptosis, and PD-L1 expression pathway in cancer. Similarly, we identified two clusters of these cancer single cells, also termed Immunity-H and Immunity-L, respectively. Immunity-H and Immunity-L displayed high and low enrichment scores of these pathways (Figure 5A). As expected, Immunity-H had significantly higher PD-L1 expression levels than Immunity-L (P < 0.001); Immunity-H also showed significantly higher expression levels of many HLA genes than Immunity-L (P < 0.001); Immunity-L displayed significantly stronger proliferation and stemness signatures than Immunity-H (P < 0.001) (Figure 5B). We further performed the consensus clustering of Immunity-H and Immunity-L single cells by SC3 (27), respectively. The clustering identified 37 and 40 cell clusters in Immunity-H and Immunity-L single cells, respectively (Figure 5C). It suggests a higher heterogeneity of cancer cells in Immunity-L compared to Immunity-H, consistent with the result from bulk tumors. Immunity-L had significantly higher enrichment of the DDR pathways than Immunity-H (Figure 5D). These results are consistent with those in bulk tumors.
Figure 5. Validation of the immune-specific subtyping method in HPV+ cervical cancer single cells. (A) Hierarchical clustering of 3,356 cancer cells from 2 HPV+ cervical cancer patients based on the enrichment scores of four immune-related pathways. (B) The expression levels of PD-L1 and many HLA genes are significantly higher in Immunity-H than in Immunity-L cancer single cells, while tumor stemness and proliferation scores are significantly lower in Immunity-H cancer single cells. (C) Consensus clustering of single cells in Immunity-H and Immunity-L by SC3 (27) identifying 37 and 40 cell clusters, respectively. (D) Seven of the nine major DNA damage response pathways shows significantly higher enrichment scores in Immunity-L than in Immunity-H. BER, base excision repair. DR, direct damage reversal/repair. NER, nucleotide excision repair. MMR, mismatch repair. FA, Fanconi Anemia. HR, homologous recomination. NHEJ, non-homologous end joining. TLS, translesion synthesis.
Discussion
The tumor immune microenvironment (TIME) plays crucial roles in determining cancer prognosis and therapy responses (46). Meanwhile, the TIME heterogeneity is prevalent across tumors (47). Hence, a classification of HPV+ cervical cancer based on the TIME would have potential clincal values in risk stratification and effective treatment of this disease. To this end, we identified two immune subtypes of HPV+ cervical cancer based on the enrichment of 29 immune signatures by unsupervised clustering. We demonstrated the reproducibility and predictability of this subtyping method by analyzing four different datasets, including three bulk tumor datasets and one single cell dataset. Compared to Immunity-L, Immunity-H showed higher immunity, more stromal contents, lower tumor purity, lower tumor proliferation potential, stemness, and ITH, higher TMB, more neoantigens, lower levels of CNAs, lower DDR activity, as well as better overall survival prognosis (Figure 6). Our analysis confirmed that “hot” tumors have a stronger antitumor immunity and thus more favorable prognosis vs. “cold” tumors (15, 45, 48). Previous studies have shown that TMB and CNAs correlate positively and negatively with antitumor immune response, respectively (31). This is consistent with our results that Immunity-H had higher TMB and lower levels of CNAs than Immunity-L. In addition, the markedly lower ITH in Immunity-H vs. Immunity-L supports the notion that ITH can contribute to tumor immune evasion (23, 47).
Figure 6. A schematic illustration to summarize the molecular and clinical features of both immune subtypes of HPV+ cervical cancer. HPV+ cervical cancer can be classified into two subtypes with high and low immunity, respectively. The low immunity is associated with tumor progressive phenotypes, unfavorable prognosis, and genomic instability in HPV+ cervical cancer. The figure was created with BioRender.com.
In addition to immune pathways, Immunity-H was more enriched in several cancer-associated pathways, including apoptosis, VEGF signaling, calcium signaling, MAPK signaling, and Wnt signaling pathways. We demonstrated that most of these pathways has positive associations of their enrichment with immune infiltration levels in HPV+ cervical cancer. In fact, the positive association between these pathway' enrichment and immune infiltration have been implicated in many other cancers, such as head and neck (45), gastric (48), and breast cancers [50].
Using the uncentered correlation and centroid linkage method, the TCGA network performed mRNA clustering of cervical cancers (4). There were three cervical cancer clusters identified, termed C1, C2, and C3, respectively. We found that 39.74% of Immunity-L tumors belonged to C1, compared to 9.19% of Immunity-H tumors in C1 (P < 0.001). It is in line with that C1 has the worst prognosis among the three clusters. In contrast, 64.37 and 26.44% of Immunity-H tumors belonged to C2 and C3, respectively, compared to 46.15 and 14.10% of Immunity-L tumors in C2 and C3, respectively. These results suggest that our immune-based subtyping method is reasonable in terms of the prognostic relevance.
By analyzing multiple datasets using both unsupervised and supervised machine learning approaches, we demonstrated that this subtyping method for HPV+ cervical cancer was stable and predictable. It suggests that this method has the application potential in clinical practice. However, further experimental and clinical validation is necessary to translate our findings into clinical practice. It should be a priority in our future studies.
Conclusion
HPV+ cervical cancers are classified into two immune subtypes based on their immune signatures' enrichment levels. The immune subtypes have significantly different immunity, tumor proliferation potential, stemness, ITH, TMB, neoantigens, CNAs, DDR activity, and overall survival prognosis. Our data offer new understanding of tumor immunity for HPV+ cervical cancer and clinical association with its immunotherapy.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.
Author contributions
GS: software, validation, formal analysis, investigation, data curation, and writing—review and editing. JL: software, validation, formal analysis, investigation, data curation, and visualization. SZ and FL: investigation and data curation. TZ and XZ: investigation and formal analysis. JY: investigation and supervision. XW: conceptualization, methodology, resources, investigation, writing—original draft, writing—review and editing, supervision, and project administration. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the China Pharmaceutical University (Grant number 3150120001 to XW).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2022.979933/full#supplementary-material
References
1. Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer. (2015) 136:E359–86. doi: 10.1002/ijc.29210
2. Burd EM, Human papillomavirus and cervical cancer. Clin Microbiol Rev. (2003) 16:1-17. doi: 10.1128/CMR.16.1.1-17.2003
3. Xing B, Guo J, Sheng Y, Wu G, Zhao Y. Human papillomavirus-negative cervical cancer: a comprehensive review. Front Oncol. (2020) 10:606335. doi: 10.3389/fonc.2020.606335
4. Cancer Genome Atlas Research Network; Albert Einstein College of Medicine; Analytical Biological Services; Barretos Cancer Hospital; Baylor College of Medicine; Beckman Research Institute of City of Hope; Buck Institute for Research on Aging, et al., Integrated genomic and molecular characterization of cervical cancer. Nature. (2017) 543:378-84. doi: 10.1038/nature21386
5. Li B, Cui Y, Nambiar DK, Sunwoo JB, Li R. The immune subtypes and landscape of squamous cell carcinoma. Clin Cancer Res. (2019) 25:3528–37. doi: 10.1158/1078-0432.CCR-18-4085
6. Das BC, Hussain S, Nasare V, Bharadwaj M. Prospects and prejudices of human papillomavirus vaccines in India. Vaccine. (2008) 26:2669–79. doi: 10.1016/j.vaccine.2008.03.056
7. Stevanovic S, Draper LM, Langhan MM, Campbell TE, Kwong ML, Wunderlich JR, et al. Complete regression of metastatic cervical cancer after treatment with human papillomavirus-targeted tumor-infiltrating T cells. J Clin Oncol. (2015) 33:1543–50. doi: 10.1200/JCO.2014.58.9093
8. Burtness B, Harrington KJ, Greil R, Soulières D, Tahara M, de Castro G Jr, et al. Pembrolizumab alone or with chemotherapy versus cetuximab with chemotherapy for recurrent or metastatic squamous cell carcinoma of the head and neck (KEYNOTE-048): a randomized, open-label, phase 3 study. Lancet. (2019) 394:1915–28. doi: 10.1016/S0140-6736(19)32591-7
9. Yanik EL, Kaunitz GJ, Cottrell TR, Succaria F, McMiller TL, Ascierto ML, et al. Association of HIV status with local immune response to anal squamous cell carcinoma: implications for immunotherapy. JAMA Oncol. (2017) 3:974–8. doi: 10.1001/jamaoncol.2017.0115
10. Cristescu R, Mogg R, Ayers M, Albright A, Murphy E, Yearley J, et al., Pan-tumor genomic biomarkers for PD-1 checkpoint blockade-based immunotherapy. Science. (2018) 362:eaar3593. doi: 10.1126/science.aar3593
11. Patel SP, Kurzrock R. PD-L1 Expression as a Predictive Biomarker In Cancer Immunotherapy. Mol Cancer Ther. (2015) 14:847–56. doi: 10.1158/1535-7163.MCT-14-0983
12. Allgäuer M, Budczies J, Christopoulos P, Endris V, Lier A, Rempel E, et al. Implementing tumor mutational burden (TMB) analysis in routine diagnostics-a primer for molecular pathologists and clinicians. Transl Lung Cancer Res. (2018) 7:703–15. doi: 10.21037/tlcr.2018.08.14
13. Oliveira AF, Bretes L, Furtado I. Review of PD-1/PD-L1 inhibitors in metastatic dMMR/MSI-H colorectal cancer. Front Oncol. (2019) 9:396. doi: 10.3389/fonc.2019.00396
14. Haanen JBAG. Converting cold into hot tumors by combining immunotherapies. Cell. (2017) 170:1055–6. doi: 10.1016/j.cell.2017.08.031
15. Wu F, Wang ZL, Wang KY Li GZ, Chai RC, Liu YQ, et al. Classification of diffuse lower-grade glioma based on immunological profiling. Mol Oncol. (2020) 14:2081–95. doi: 10.1002/1878-0261.12707
16. He Y, Jiang Z, Chen C, Wang X. Classification of triple-negative breast cancers based on Immunogenomic profiling. J Exp Clin Cancer Res. (2018) 37:327. doi: 10.1186/s13046-018-1002-1
17. Liu Q, Nie R, Li M, Li L, Zhou H, Lu H, et al. Identification of subtypes correlated with tumor immunity and immunotherapy in cutaneous melanoma. Comput Struct Biotechnol J. (2021) 19:4472–85. doi: 10.1016/j.csbj.2021.08.005
18. Ojesina AI, Lichtenstein L, Freeman SS, Pedamallu CS, Imaz-Rosshandler I, Pugh TJ, et al. Landscape of genomic alterations in cervical carcinomas. Nature. (2014) 506:371–5. doi: 10.1038/nature12881
19. Lee HO, Hong Y, Etlioglu HE, Cho YB, Pomella V, Van den Bosch B, et al. Lineage-dependent gene expression programs influence the immune landscape of colorectal cancer. Nat Genet. (2020) 52:594–603. doi: 10.1038/s41588-020-0636-z
20. Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinform. (2013) 14:7. doi: 10.1186/1471-2105-14-7
21. Bland JM, Altman DG. Survival probabilities (the Kaplan-Meier method). BMJ. (1998) 317:1572. doi: 10.1136/bmj.317.7172.1572
22. Mermel CH, Schumacher SE, Hill B, Meyerson ML, Beroukhim R, Getz G, et al. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol. (2011) 12:R41. doi: 10.1186/gb-2011-12-4-r41
23. Li L, Chen C, Wang X. DITHER: an algorithm for defining intratumor heterogeneity based on entropy. Brief Bioinform. (2021) 22:bbab202. doi: 10.1093/bib/bbab202
24. Yoshihara K, Shahmoradgoli M, Martínez E, Vegesna R, Kim H, Torres-Garcia W, et al. Inferring tumor purity and stromal and immune cell admixture from expression data. Nat Commun. (2013) 4:2612. doi: 10.1038/ncomms3612
25. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. (2005) 102:15545–50. doi: 10.1073/pnas.0506580102
26. Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. (2017) 45:D353–61. doi: 10.1093/nar/gkw1092
27. Kiselev VY, Kirschner K, Schaub MT, Andrews T, Yiu A, Chandra T, et al. SC3: consensus clustering of single-cell RNA-seq data. Nat Methods. (2017) 14:483–6. doi: 10.1038/nmeth.4236
28. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol. (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x
29. Rooney MS, Shukla SA, Wu CJ, Getz G, Hacohen N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell. (2015) 160:48–61. doi: 10.1016/j.cell.2014.12.033
30. Knijnenburg TA, Wang L, Zimmermann MT, Chambwe N, Gao GF, Cherniack AD, et al., Genomic and molecular landscape of dna damage repair deficiency across the cancer genome atlas. Cell Rep. (2018) 23:239-254 e6.
31. Davoli T, Uno H, Wooten EC, Elledge SJ, Tumor Tumor aneuploidy correlates with markers of immune evasion and with reduced response to immunotherapy. Science. (2017) 35. doi: 10.1126/science.aaf8399
32. Huang YH, Cai K, Xu PP, Wang L, Huang CX, Fang Y, et al. CREBBP/EP300 mutations promoted tumor progression in diffuse large B-cell lymphoma through altering tumor-associated macrophage polarization via FBXW7-NOTCH-CCL2/CSF1 axis. Signal Transduct Target Ther. (2021) 6:10. doi: 10.1038/s41392-020-00437-8
33. Bhatia R, Gautam SK, Cannon A, Thompson C, Hall BR, Aithal A, et al. Cancer-associated mucins: role in immune modulation and metastasis. Cancer Metastasis Rev. (2019) 38:223–36. doi: 10.1007/s10555-018-09775-0
34. Zhang X, Shi M, Chen T, Zhang B. Characterization of the immune cell infiltration landscape in head and neck squamous cell carcinoma to aid immunotherapy. Mol Ther Nucleic Acids. (2020) 22:298–309. doi: 10.1016/j.omtn.2020.08.030
35. Craven KE, Gökmen-Polar Y, Badve SS. CIBERSORT analysis of TCGA and METABRIC identifies subgroups with better outcomes in triple negative breast cancer. Sci Rep. (2021) 11:4691. doi: 10.1038/s41598-021-83913-7
36. Lamkanfi M, Kanneganti TD. Caspase-7: a protease involved in apoptosis and inflammation. Int J Biochem Cell Biol. (2010) 42:21–4. doi: 10.1016/j.biocel.2009.09.013
37. Li M, Ma Y, Zhong Y, Liu Q, Chen C, Qiang L, et al., KALRN mutations promote antitumor immunity and immunotherapy response in cancer. J Immunother Cancer. (2020) 8:e000293. doi: 10.1136/jitc-2019-000293
38. Duan H, Jing L, Jiang X, Ma Y, Wang D, Xiang J, et al., CD146 bound to LCK promotes T cell receptor signaling and antitumor immune responses in mice. J Clin Invest. (2021) 131:e148568. doi: 10.1172/JCI148568
39. Krisenko MO, Geahlen RL, Calling in SYK. SYK's dual role as a tumor promoter and tumor suppressor in cancer. Biochim Biophys Acta. (2015) 1853:254–63. doi: 10.1016/j.bbamcr.2014.10.022
40. Ding ZC, Shi H, Aboelella NS, Fesenkova K, Park EJ, Liu Z, et al., Persistent STAT5 activation reprograms the epigenetic landscape in CD4(+) T cells to drive polyfunctionality and antitumor immunity. Sci Immunol. (2020) 5:eaba5962. doi: 10.1126/sciimmunol.aba5962
41. Liu Y, Song C, Shen F, Zhang J, Song SW. IGFBP2 promotes immunosuppression associated with its mesenchymal induction and FcgammaRIIB phosphorylation in glioblastoma. PLoS ONE. (2019) 14:e0222999. doi: 10.1371/journal.pone.0222999
42. Janse van Rensburg HJ, Azad T, Ling M, Hao Y, Snetsinger B, Khanal P, et al., The Hippo Pathway Component TAZ Promotes Immune Evasion in Human Cancer through PD-L1. Cancer Res. (2018) 78:1457-1470. doi: 10.1158/0008-5472.CAN-17-3139
43. Johnson TG, Schelch K, Mehta S, Burgess A, Reid G. Why Be One Protein When You Can Affect Many? The multiple roles of YB-1 in lung cancer and mesothelioma. Front Cell Dev Biol. (2019) 7:221. doi: 10.3389/fcell.2019.00221
44. Dai X, Bu X, Gao Y, Guo J, Hu J, Jiang C, et al., Energy status dictates PD-L1 protein abundance and anti-tumor immunity to enable checkpoint blockade. Mol Cell. (2021) 81:2317-31 e6. doi: 10.1016/j.molcel.2021.03.037
45. Lyu H, Li M, Jiang Z, Liu Z, Wang X. Correlate the TP53 mutation and the HRAS mutation with immune signatures in head and neck squamous cell cancer. Comput Struct Biotechnol J. (2019) 17:1020–30. doi: 10.1016/j.csbj.2019.07.009
46. Jiang Z, Liu Z, Li M, Chen C, Wang X. Immunogenomics analysis reveals that tp53 mutations inhibit tumor immunity in gastric cancer. Transl Oncol. (2018) 11:1171–87. doi: 10.1016/j.tranon.2018.07.012
47. Li M, Zhang Z, Li L, Wang X. An algorithm to quantify intratumor heterogeneity based on alterations of gene expression profiles. Commun Biol. (2020) 3:505. doi: 10.1038/s42003-020-01230-7
Keywords: human papillomavirus-positive cervical cancer, immunological classification, machine learning, multi-omics analysis, immunotherapy
Citation: Song G, Luo J, Zou S, Lou F, Zhang T, Zhu X, Yang J and Wang X (2022) Molecular classification of human papillomavirus-positive cervical cancers based on immune signature enrichment. Front. Public Health 10:979933. doi: 10.3389/fpubh.2022.979933
Received: 28 June 2022; Accepted: 29 August 2022;
Published: 20 September 2022.
Edited by:
Lanlan Wei, Shenzhen Third People's Hospital, ChinaReviewed by:
Jie Wang, First Affiliated Hospital of Xinjiang Medical University, ChinaKun-Yan He, Shanghai Jiao Tong University, China
S. M. Zahid Hosen, University of New South Wales, Australia
Copyright © 2022 Song, Luo, Zou, Lou, Zhang, Zhu, Yang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jianhua Yang, eWpoMjAwNiYjeDAwMDQwO3pqdS5lZHUuY24=; Xiaosheng Wang, eGlhb3NoZW5nLndhbmcmI3gwMDA0MDtjcHUuZWR1LmNu
†These authors have contributed equally to this work