- 1Department of General Surgery, Peking University First Hospital, Beijing, China
- 2Department of Endoscopy Center, Peking University First Hospital, Beijing, China
Gastric cancer is one of the most heterogeneous tumors with multi-level molecular disturbances. Sustaining proliferative signaling and evading growth suppressors are two important hallmarks that enable the cancer cells to become tumorigenic and ultimately malignant, which enable tumor growth. Discovering and understanding the difference in tumor proliferation cycle phenotypes can be used to better classify tumors, and provide classification schemes for disease diagnosis and treatment options, which are more in line with the requirements of today’s precision medicine. We collected 691 eligible samples from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database, combined with transcriptome data, to explore different heterogeneous proliferation cycle phenotypes, and further study the potential genomic changes that may lead to these different phenotypes in this study. Interestingly, two subtypes with different clinical and biological characteristics were identified through cluster analysis of gastric cancer transcriptome data. The repeatability of the classification was confirmed in an independent Gene Expression Omnibus validation cohort, and consistent phenotypes were observed. These two phenotypes showed different clinical outcomes, and tumor mutation burden. This classification helped us to better classify gastric cancer patients and provide targeted treatment based on specific transcriptome data.
Introduction
From a global perspective, gastric cancer incidence is relatively high, ranking fourth (Sung et al., 2021). The incidence and mortality of gastric cancer ranks second among malignant tumors in China (Zheng et al., 2019). Surgery combined with radiotherapy and chemotherapy has contributed to the longer survival of gastric cancer, but at present it is only beneficial to some patients (Wöhrer et al., 2004; Liu et al., 2019). The long-term prognosis is an urgent clinical problem to be solved.
Maintaining proliferation signals and avoiding growth inhibitory factors are two important signs that make cancer cells tumorigenic and eventually malignant, thereby enabling tumors growth (Hanahan and Weinberg, 2011; Ateshian et al., 2012). Normal tissues gingerly regulate the generation and release of pro-growth signals, which indicates cells entry and progression through the cell growth and division cycle, thus making sure that the steady state of cell numbers and maintaining normal tissue structure (Sulić et al., 2005; Lemmon and Schlessinger, 2010; Ateshian et al., 2012). Cancer cells can acquire the ability to maintain proliferation signals through a variety of ways: they can produce growth factor ligands and respond through homologous receptors expression, leading to autocrine proliferation stimulation (Perona, 2006; Hynes and MacDonald, 2009; Lemmon and Schlessinger, 2010; Witsch et al., 2010). In addition, cancer cells can send signals to stimulate normal cells in the tumor-associated stroma, which can offer various growth factors to the cancer cells (Bhowmick et al., 2004; Cheng et al., 2008). The destruction of the negative feedback mechanism that inhibits proliferation signals is another mechanism for cancer development. The deficiencies of these feedback mechanisms can enhance proliferation signals (Cabrita and Christofori, 2008; Wertz and Dixit, 2010). In addition to inducing and maintaining positive growth stimulating signals, cancer cells must also bypass powerful procedures that negatively affect cell proliferation (Amit et al., 2007; Cabrita and Christofori, 2008; Hanahan and Weinberg, 2011). Insufficient understanding of the microenvironment of gastric cancer proliferation cycle may be the main reason for the disappointing results. The rapid development of transcriptomics has made it possible to systematically explore the temporal heterogeneity of genomics in gastric cancer.
Discovering and understanding the differences in tumor proliferation cycle phenotypes can better classify tumors, provide classification schemes for disease diagnosis and treatment, and better meet the requirements of precision medicine today. At the same time, it is conducive to find more sensitive and specific biomarkers, help predict the prognosis of tumors, and develop more effective anti-tumor drugs. The purpose of this study is to use open cohorts to identify the molecular subtypes of gastric cancer, identify the relationship between each cluster and clinical data, determine the unique molecular characteristics of each cluster, and establish a corresponding classifier. In the same time, class gene labels and classifiers can be obtained in this way to predict the classification of new samples, achieve the purpose of identifying cancer subtypes in new samples, establish targeted treatment plans for individuals, reduce the mortality of cancer patients, and improve the mortality of patient rate and improve living standards.
In this study, we collected qualified samples from TCGA and GEO cohorts, and combined with transcriptome data to explore different heterogenous proliferation cycle phenotypes, and further investigated the potential mechanism of each proliferation phenotype.
Patient Datasets
In this study 691 gastric cancer patients (334 from TCGA-STAD sequencing data and 357 from GEO chip data) were included. Transcriptome sequencing data and clinical information were downloaded from the TCGA database (https://portal.gdc.cancer.gov/) as a training cohort. The chip data and the corresponding clinical information were downloaded from the GEO database (https://www.ncbi.nlm.nih.gov/geo/) as the validation cohort. These samples had complete transcriptome data, clinical data and follow-up data and non-zero survival time at the same time in the TCGA-STAD cohort. Correspondingly, we also selected GEO Chip data (GSE84433) with a large enough data volume and relatively complete clinical data and follow-up data as the validation cohort.
Transcriptome-Based Subtypes Identification
The R “ConsensusClusterPlus” package was applied to cluster gene transcriptome data. Two main subtypes were identified by hierarchical clustering of 5,000 genes with highly variable expression [top 5,000 of Median Absolute Deviation (MAD) of gene value]. 80% of the items to resample, 50 resamplings and the maximum evaluation K of 10 were used for gene expression clustering. The cumulative distribution function (CDF) and consensus heat map were applied to evaluate the best K.
Selection of Gene Signatures
The differentially expressed genes were identified with the statistically difference of |log2FC| > 0.58496 and false discovery rate [FDR] <0.05 between subtype A (Sub A) and subtype B (Sub B) firstly. A univariate cox regression analysis was performed to determine the genes with prognostic significance. Subsequently, the method of lasso regression was used to determine the target genes that have the greatest weight on the prognosis by the R package “glmnet” (Friedman et al., 2010) (https://www.jstatsoft.org/v39/i05/). The influence of gene expression on prognosis was weighed by the enter method-based multivariate cox analysis. The prognostic risk signature was built by combining the values of identified gene expression and their corresponding regression coefficients (β value). The median of risk score was set as a cutoff value and divided gastric cancer into high and low risk groups. The differences of overall survival between high and low risk groups were assessed by the R “survival” package (version 3.2–7, https://CRAN.R-project.org/package=survival).
Bioinformatics Analysis
Principal component analysis (PCA) was used to detect expression differences between groups with the R package “princomp” (Zhou et al., 2018). Function annotation of difference genes between groups was analyzed by Gene Ontology (GO) analysis (Huang et al., 2009). Gene set enrichment analysis (GSEA) was performed to determine statistically different gene sets (Mootha et al., 2003; Subramanian, 2005). GO analysis and KEGG analysis conducted by R clusterProfiler R package (3.14.3 version) (Carlson, 2015). The R package “pROC” was used to analyze the receiver operating characteristic (ROC) curve to predict the overall survival (OS). The catalogs of genes related to each stage of the cell cycle were obtained from GO (http://geneontology.org/), KEGG (https://www.genome.jp/kegg/) database and references at the same time. The level of infiltration of different immune cells in the TCGA-STAD and GEO data was quantified by the “CIBERSORT” R package (Newman et al., 2015) with LM22 features and 1,000 permutations. ESTIMATE algorithms assessed the immune and matrix content (immune and stromal scores) of each TCGA-STAD and GEO sample (Yoshihara et al., 2013). The results were displayed in the form of heatmaps and histograms.
Statistical Analysis
SPSS 24.0 (IBM, Chicago, Illinois, United States) and R software (version 3.6.3; http://www.r-project.org/) were used to conduct all statistical analysis. All visualizations were implemented by R software. Kaplan-Meier analysis was used to evaluate survival differences between the groups with log-rank test. Student’s t-test was used to calculate differences in molecular expression, stromal score, immune score and ESTIMATE score between the two subtypes. p < 0.05 was considered as statistically significant.
Results
Gastric Cancer Differentiates Into Two Molecular Subgroups
In order to stratify patients with gastric cancer, we obtained 334 samples of sequencing data from the TCGA-STAD database, using unbiased methods and consistent clustering of gene expression profiles. Through cumulative distribution function (CDF) curve and consensus matrix evaluation (Supplementary Figure S1), we hierarchically clustered 5,000 highly variable expression genes (top 5000 MAD of gene value) and identified two main subtypes (Figure 1A). Principal component analysis (PCA) plot found significant differences in the expression profiles of the two subtypes (Figure 1B). Group members of these two subtypes were associated with different molecular and survival characteristics. Compared with Sub A, the clinical outcome of Sub B patients was significantly worse. In contrast, Sub A had a longer overall survival than Sub B group (Figure 1C).
FIGURE 1. Two subtypes with different prognosis in gastric cancer were identified by unsupervised cluster analysis. (A) Two subtypes defined by 5,000 genes with highly variable expression are plotted as a heatmap. Genes with MAD value in the top 5,000. (B) Principal component analysis (PCA) revealed differences between the two subtypes in TCGA cohort. (C) Kaplan-Meier survival plot for two subtypes in TCGA cohort. (D) The gene sequence of the TCGA training set was applied to the GEO validation cohort. (E) PCA analysis revealed differences between the two subtypes in GEO validation cohort. (F) Kaplan-Meier survival plot for two subtypes in the GEO validation cohort.
357 independent gastric cancer expression profiles were obtained from the GEO (GSE84433) cohort to evaluate the reproducibility of subtypes. Applying the similar genes rankings from the training set (5,000 available genes) to the validation set can clearly replicate the identified subgroups in the TCGA cohort (Figure 1D). PCA also confirmed the difference in gene expression profiles between these two subtypes (Figure 1E). Similarly, survival analysis showed that Sub B had a shorter overall survival time than Sub A (Figure 1F).
Differences in Biological Functions of the Two Subtypes
We also analyzed the functional background of the two gastric cancer subtypes. Figure 2A showed the results of cluster analysis by differential gene expression between the two groups. GO and KEGG analyses with TCGA cohort showed that the up-regulated genes in Sub A vs Sub B were enriched in cell cycle, cell cycle G2/M phase transition, cell cycle checkpoint and negative regulation of chromosome separation (Figure 2B). GSEA further showed that the up-regulated genes in Sub A vs Sub B were mainly enriched in cell cycle checkpoints, G2/M checkpoints, mitotic metaphase and anaphase, and mitotic spindle checkpoint (Figure 2C). Differential gene expression analysis showed cell cycle checkpoint (Figure 2D), cell cycle G2/M phase transition (Figure 2E), negative regulation of nuclear division (Figure 2F) and regulation of spindle checkpoint (Figure 2G) related genes were highly expressed in Sub A samples. The same enrichment trend also appeared in GEO cohort (Figures 3A–G).
FIGURE 2. Analyses of the biological functions of the two subtypes in the TCGA cohort. (A) The heatmap showed DEG between Sub A and Sub B. (B) GO and KEGG analyses on upregulated genes in Sub A vs Sub B in TCGA cohort. (C) GSEA analysis of the two subtypes in the TCGA cohort. (D–G) A box plot showed the difference in the expression of cell cycle checkpoint (D), cell cycle G2/M phase transition (E), negative regulation of nuclear division (F) and regulation of spindle checkpoint (G) between the two subtypes. *p < 0.05; **p < 0.01; ***p < 0.001.
FIGURE 3. Analyses of the biological functions of the two subtypes in the GEO cohort. (A) The heatmap showed the DEG between Sub A and Sub B. (B) GO and KEGG analyses on upregulated genes in Sub A vs Sub B in GEO cohort. (C) GSEA analysis of the two subtypes in the GEO cohort. (D–G) A box plot showed the difference in the expression of cell cycle checkpoint (D), cell cycle G2/M phase transition (E), negative regulation of nuclear division (F) and regulation of spindle checkpoint (G) between the two subtypes. *p < 0.05; **p < 0.01; ***p < 0.001.
GO and KEGG analyses on genes that were upregulated in Sub B vs Sub A in TCGA cohort showed that extracellular matrix structural constituent, collagen-containing extracellular matrix and extracellular matrix organization were enriched (Supplementary Figure S2A). The same enrichment trend also appeared in GEO cohort (Supplementary Figure S2B). Since the differential genes of Sub B vs Sub A were mostly clustered in extracellular matrix related pathways, we analyzed the infiltration of stromal cells in Sub A and Sub B. We implemented the CIBERSORT and ESTIMATE algorithms to quantify the activity or enrichment level of immune cells in gastric cancer tissues. The results showed that the stromal cell infiltration and score, the degree and score of immune cell infiltration, and the overall extracellular matrix infiltration score of Sub B subtypes were higher than Sub A in TCGA and GEO cohorts (Supplementary Figure S3 and S4).
In order to understand the significance of Sub A and Sub B classification, we put our findings in the context of well-recognized molecular subtypes of gastric cancer (EBV, MSI, GS, and CIN) (Sohn et al., 2017) and the results were shown in Supplementary Figure S4. We made the survival curves of Sub A and Sub B among the four recognized classification subtypes, and they were not statistically significant. However, Sub A tended to have a longer survival than Sub B in the MSI and CIN subtypes (Supplementary Figure S5C,D). EBV subtype had better prognosis and overall survival (Sohn et al., 2017) and there were more EBV subtypes in the Sub A sample (Supplementary Figure S5E). At the same time,Kaplan-Meier survival analysis was performed on Sub A and Sub B in different TCGA and GEO clinical characteristics (Supplementary Figure S6, S7). Sub A samples had a better survival trend than Sub B samples. A quantitative analysis of different characteristics enriched in Sub A and Sub B in TCGA and GEO cohorts was also conducted. The results showed that Sub B samples had a higher proportion of well-recognized clinical indicators with poor relative prognosis (Supplementary Figure S8). These illustrated the feasibility of the classification method to a certain extent.
Analysis of Differences in Cycle Control Molecules
Since the up-regulated molecular functions in Sub A vs Sub B were enriched in cell cycle-related pathways, the difference genes in cyclin expression between the two subtypes were analyzed. Cell cycle checkpoint, cell cycle G2/M phase transition, negative regulation of nuclear division and regulation of spindle checkpoint related gene expression were analyzed between the two subtypes. Sub A had a higher expression of negative cell cycle control proteins than Sub B in TCGA cohort (Figures 2D–G). Sub A highly expressed CHEK2, MAD2L1 and SFN genes, which were two typical cell cycle checkpoint genes, and their high expression can reduce cell proliferation (Stolz et al., 2011; Lewinska et al., 2017; Marima et al., 2021) (Figure 2D). Sub A highly expressed CDKN3 and CHEK1 which were cell cycle suppressor genes (Lee et al., 2000; Liu et al., 2000) belonging to the G2 and G2/M phase-related proteins (Figure 2F). Sub A samples expressed BUB1and PLK1 higher than sub B. BUB1and PLK1 are typical genes that negative regulate nuclear division (Fukagawa, 2008; de Cárcer et al., 2018). The same performance was also verified in GEO data (Figures 3D–G).
The Correlation Between Tumor Burden Mutation and the Two Subgroups
Next, we analyzed tumor burden mutation (TMB) from the TCGA cohort to explore the differences in genomic changes between the two subtypes. Figures 4A,B showed the TMB situation of Sub A and Sub B. Sub A displayed more deletion regions, such as TTN, TP53, MUC16, ARID1A, LRP1B, SYNE1 and FAT4 (Figures 4A,B). As shown in Figure 4C, the TMB of the Sub A group was significantly higher than that of Sub B group (p < 0.001). Kaplan-Meier survival analysis showed that patients with high tumor burden mutation (H-TMB) had better overall survival than the low tumor burden mutation (L-TMB, p = 0.003, Figure 4D). Considering the prognostic value of TMB and clusters, we next evaluated the synergy effect of these indicators in the prognostic stratification of TCGA-STAD. Stratified survival analysis revealed that the TMB status did not affect cluster-based predictions. Sub A and Sub B showed significant survival differences in both high and low TMB subgroups (p = 0.006; Figure 4E). Overall, these results indicated that the stratification may be an underlying predictor that is independent of TMB.
FIGURE 4. The correlation between the TMB and clusters in TCGA cohort. (A) TMB analysis in the Sub A groups. (B) TMB analysis in the Sub B groups. (C) Difference analyses of TMB between Sub A and Sub B groups. (D) Kaplan-Meier curves for low and high TMB groups of the TCGA cohort. (E) Kaplan-Meier curves for patients in the TCGA cohorts stratified by both TMB and clusters.
Identification of Prognostic Characteristics Related to Gastric Cancer Classification and Construction of Clinical Prognostic Risk Model
Screening of Differential Molecules With Survival Prognostic Value
We used the lasso regression method to screen 236 differential genes of the Sub A and Sub B with prognostic significance. The corresponding clinical information was shown in Table 1. The remaining three molecules in the final model that had a greater impact on the prognosis were GPC3, GPX3 and PRICKLE1. Survival analysis showed that high expression of these three molecules showed worse prognosis in TCGA and GEO cohorts (Figures 5A,B) which indicated that the three selected molecules may have the potential to predict prognosis. Therefore, we combined clinical indicators and the expression of the three molecules, and applied cox multivariate regression analysis to determine the weight of the three molecule’s influence on the prognosis, thereby constructing a prognostic risk prediction model. We combined the expression value of the identified genes and the weighting of the regression coefficient (β value) to construct the prognostic risk characteristics respectively. Risk score = GPC3*0.142 + GPX3 *0.262 + PRICKLE1*0.366. The median of risk score was set as a cutoff value and divided gastric cancer patients into high and low risk groups. Kaplan-Meier survival analysis illustrated that high-risk group had worse prognosis for overall survival (p < 0.05, Figures 6C,E) in TCGA and GEO cohorts. Enter method-based cox multivariate regression was performed to evaluate the prognostic influence of the risk signature combined with other clinic pathological features. The results showed that the risk score was one of the independent risk factors for overall survival among gastric cancer patients (p < 0.05, Table 2).
FIGURE 5. Cox proportional hazards model was used to identify the prognostic signature. (A and B) Kaplan-Meier curve plot for key prognostic genes of TCGA (A) and GEO (B) cohorts. (C and E) Kaplan-Meier curve plot for key prognostic genes-based risk model of TCGA (C) and GEO (E) cohorts. (D and F) Risk score of the two subtypes in TCGA (D) and GEO (F) cohorts. (G and H) Distribution and current status of gastric cancer risk scores in TCGA and GEO cohorts, which indicated that high-risk score was related to poor prognosis. (I and G) Heatmaps showed that the degree of key prognostic genes expression in different risk score and the two subtypes in TCGA (I) and GEO (G) cohorts. (K and M) ROC of risk score prediction model on TCGA (K) and GEO (M) cohorts. (L and N) Survival curves of gastric cancer patients with combinations of risk score and T stage in the TCGA (L) and GEO (N) cohorts.
FIGURE 6. Differential expression of three signature genes in cancer and adjacent tissue in 20 tumors at the ONCOMINE database. Student’s t test was used to compare the differences in mRNA expression. Red represented overexpression and blue represented low expression. The darker the color, the more obvious the difference in gene expression.
Clinical Correlation Analysis of the Prognostic Signature for Gastric Cancer
Risk model illustrated that high-risk score was interrelated to poor prognosis (Figures 5C,E) and histogram showed that Sub B got a higher risk score (Figures 5D,F) in TCGA and GEO cohort simultaneously. The risk curve showed that high risk score was positively correlated with high risk of death (Figure 5G,H). At the same time, the heat map showed that Sub B had a higher risk score, and the expression of the three genes GPC3, GPX3 and PRICKLE1 were higher compared with Sub A in TCGA and GEO cohorts (Figures 5G,I). The ROC curve showed that the risk prediction model had higher specificity and sensitivity when combining with TNM stage in predicting the death of patients (Figure 5K and M). The higher the patient’s T stage, the higher the corresponding risk score (Figure 5L) in TCGA cohort and no significant differences were found in the GEO data (Figure 5N).
Analysis of Key Genes Expression Difference in Pan-Cancer
GPC3, GPX3 and PRICKLE1 were the candidate sites selected in the risk prediction model for OS, which indicated that the genes might have potential roles on the malignant behaviors of gastric cancer. Therefore, we had performed pan-cancer analysis of these three genes to provide evidence for future basic research in this field. As shown in Figure 6A, ONCOMINE database was used to perform pan-cancer analysis on GPC3, GPX3 and PRICKLE1 expression (Figure 6A). The expression of these three genes in tumor tissues was higher than that in adjacent tissues in 20 tumors.
Discussion
Here, we had defined two subtypes with different clinical and biological characteristics through cluster analysis of gastric cancer transcriptome analysis. The repeatability of the classification was confirmed in an independent GEO validation cohort, and consistent phenotypes were observed. Gastric cancer patients with a better prognosis was characterized by higher expression of proliferation cycle suppression related genes. This classification helps us better classify gastric cancer patients and provided targeted treatment based on specific transcriptome data. Over the past decades, scientists had conducted in-depth research work to identify the underlying molecular mechanisms of gastric cancer, identify its prognostic indicators, and explore potential treatment strategies. Our results not only provide insight into the relationship between the proliferation cycle genome subtypes and postoperative survival rate, but also open up new opportunities for improving the management of gastric cancer.
According to the transcriptome data in TCGA cohort, gastric cancer was clustered into Sub A and Sub B by unsupervised cluster analysis. Principal component analysis can also distinguish Sub A and Sub B well. Kaplan-Meier survival analysis showed that Sub A survives longer than Sub B. There were more Sub B cases in TNM 3/4 compared with Sub A cases. Similar conclusions were also verified in GEO cohort. This showed that the clustered Sub A and Sub B had personalized transcriptome expression components and showed different survival times.
Further GO and KEGG analyses showed that the up-regulated genes of Sub A vs Sub B were mostly concentrated in cell cycle inhibition pathways. GSEA analysis also proved the above conclusions on the trend, although a larger sample sizes was required for verification. These indicated that genes related to the regulation of cell proliferation level cycle played an important part in the occurrence and development of gastric cancer. In addition, GO and KEGG analyses revealed that Sub A had high proliferation-related proteins involved in cell cycle checkpoint, cell cycle G2/M phase transition, negative regulation of nuclear division and regulation of spindle checkpoint. These proteins benefit the survival of gastric cancer patients. We found that Sub B cases with poor results showed an opposite proliferative phenotype in functional annotation. We found that tumor invasiveness and patient survival are more influenced by proliferative cycle nature than by other malignant features of gastric cancer. In the cell cycle-related expression analysis, Sub A expressed more cell cycle inhibitory proteins. The lower proliferation ability of Sub A gastric cancer patients may be important for their survival ability better than Sub B.
In addition, we have identified key different molecules between Sub A and Sub B subtypes that have a greater impact on the prognosis through lasso regression method and univariate and multivariate cox regression analysis. In the end, we screened out 3 molecules, GPC3, GPX3 and PRICKLE1, and constructed a risk scoring model for gastric cancer samples. Survival analysis showed that high expression of these three molecules showed worse prognosis in TCGA and GEO cohorts simultaneously.
Glypican 3, also known as GPC3, is a cancer fetal glycoprotein that is attached to cell membranes via glycophosphatidylinositol anchors. GPC3 can regulate cell proliferation in embryonic mesoderm tissue, as GPC3 gene deletion leads to giant/over growth syndrome, simpson-Golabi-Behmel syndrome (SGBS) (Simpson et al., 1975; Behmel et al., 1984; Ferlini et al., 1984; Pilia et al., 1996; Vuillaume et al., 2019). GPC3 is widely expressed in the placenta, liver, lung, and kidney of embryos. On the contrary, it is difficult to detect in most adult organs (Pellegrini et al., 1998). DNA methylation in the GPC3 promoter region may explain this biological downregulation in adult tissues (Hsu et al., 1997; Huber et al., 1999; Boily et al., 2004). A number of innovative treatments for GPC3 have emerged in recent years. The prognostic significance of serum GPC3 levels and tumor cell GPC3 immunoreactivity in patients with hepatocellular carcinoma has been elucidated. Thus, GPC3 has also attracted attention as a useful biomarker and a new therapeutic target molecule. The main mechanism of anti-GPC3 antibody (GPC3Ab) anticancer cells is antibody-dependent cytotoxicity and/or complement dependent cytotoxicity. Because GPC3Ab is associated with immune responses, regiments of combined immune checkpoint inhibitors have also been investigated. In terms of mechanism, GPC3 may be involved in the regulation of Wnt, hedgehog, bone morphogenetic protein, FGF and other signaling pathways, by which it controls the growth and apoptosis of certain types of cells during development (Paine-Saunders et al., 2000; Midorikawa et al., 2003; Capurro et al., 2008; Iglesias et al., 2008).
Glutathione peroxidase 3, also known as GPx3, is a major extracellular GPx isomer and a major scavenger of reactive oxygen species (ROS) in plasma. Some researches show high GPx3 expression predicted poor prognostic survival. Reduced GPx3 expression inhibited the survival of clonal and unanchored cells and prostate cancer (Yu et al., 2007). In addition, GPx3 is necessary to protect cells from exogenous oxidative damage, as demonstrated by high-dose ascorbic acid therapy. GPx3 is essential for the survival of ovarian cancer cells in the ascites tumor environment and protects against extracellular oxidative stressors, suggesting that GPx3 is an important adaptation for metastasis (Lou et al., 2020). In contrast, GPX3 inhibits tumor progression in some tumors. High GPx3 expression is a potential marker for the diagnosis and prognosis and can inhibit the progression in breast cancer, clear cell ovarian cancer hepatocellular carcinoma, cervical cancer and melanoma patients (Itamochi et al., 2002; Saga et al., 2008; Qi et al., 2014; Zhang et al., 2014; Chen et al., 2016; Lou et al., 2020). The study by Chang et al. provided the first in vivo molecular genetic evidence that GPx3 does indeed play a tumor suppressor role in the development of prostate cancer (Chang et al., 2016). In esophageal squamous cell carcinoma, GPx3 inhibits tumor migration and invasion through the FAK/AKT pathway (Zhu et al., 2018). High-grade bladder cancer is associated with low urinary GPx3 levels. GPx3 inhibits tumor invasion by inhibiting the JNK-Cjun-MMP2 pathway in liver cancer (Qi et al., 2016). Many studies have shown that methylation-mediated GPX3 inhibition may have important implications for the pathogenesis of cancer. The results of Chen et al. suggest that GPx3 methylation is associated with chemotherapy resistance in head and neck cancer and can be used as a potential prognostic indicator for head and neck cancer patients receiving cisplatin-based chemotherapy (Chen et al., 2011). GPx3 is also down-regulated in hepatocellular carcinoma and esophageal squamous cell carcinoma through promoter hypermethylation, which may lead to cancer development and progression (He et al., 2011; Cao et al., 2015). Silencing of GPx3 through DNA hypermethylation is associated with lymph node metastasis in gastric cancer and cervical cancer (Peng et al., 2012; Zhang et al., 2014). However, the functional research of GPx3 in gastric cancer needs to be further explored.
Prickle planar cell polarity protein 1, is also known as PRICKLE1. The results of Daulat et al. suggested that up-regulation of PRICKLE1 in basal breast cancer, a subtype characterized by high metastatic potential, is associated with poor metastases-free survival. PRICKLE1 promotes cancer cell transmission through interaction with mTORC2 (Daulat et al., 2016). According to a study, PRICKLE1 expression can be used as an independent prognostic factor, can be in the column chart combined with age and TNM staging, to predict the rate of gastric cancer patients with OS. PRICKLE1 expression is an independent prognostic factor in patients with gastric cancer (Zhao et al., 2016; Ding et al., 2020).
Because the conclusions and sample cluster analysis were based on only public sequencing data, and our own clinical sample sequencing data will be needed to verify the conclusion and classification standard in the future. At the same time, the conclusion of this topic needs to be verified by multiple centers. Our subjects were mostly white, so it is not known whether these results are suitable for other groups, such as the yellow race. Future studies with more varied samples are needed. The key prognostic molecules screened out were only used for survival analysis and clinicopathological link analysis in TCGA cohort. They are need to be tested and verified in our own samples and gene function also needs to be further studied through basic experiments in the future.
In this study, cluster analysis based on gastric cancer proliferation-related genes were performed for the first time. The major strength of this study was that the clustering results were verified and analyzed in two independent cohorts. Here, a high-risk Sub B gastric cancer subtype that displayed a lower expression of proliferation-related genes was identified. The biological processes of gastric cancer must be understood to facilitate the improvement of clinical treatments.
In summary, our data indicate that transcriptome analysis divides gastric cancer into two different subgroups with different clinical and biological phenotypes. We believe that this classification is meaningful for different treatment strategies and will lead to targeted treatment of patients with gastric cancer.
Conclusion
In conclusion, two subtypes with different clinical and biological characteristics was identified through cluster analysis of gastric cancer transcriptome data. The repeatability of the classification was confirmed in an independent GEO validation set, and consistent phenotypes were observed. Gastric cancer patients with a better prognosis was characterized by higher expression of proliferation cycle suppression related genes. This classification helps us to better classify gastric cancer patients and provide targeted treatment based on specific transcriptome data.
Data Availability Statement
The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.
Author Contributions
JH participated in the study design, carried out the data analysis, and completed the manuscript. YY participated in completing all figures. YM and YN performed the correlation analysis between the risk model and clinicopathological indicators. YL and GC were major contributors to the design of this study and revised the manuscript. All authors read and approved the final manuscript.
Funding
The National Natural Science Foundation of China (No. 81902384 and No. 81770522) and National Science and Technology Major Project of China (2018ZX10723204) supported the completion of this project.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
The authors acknowledge LM for his bioinformatics technical support.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcell.2021.770994/full#supplementary-material
Abbreviations
TCGA, The Cancer Genome Atlas; GEO, Gene Expression Omnibus; NCBI, National Center for Biotechnology Information; UCSC, University of California Santa Cruz; DEG, Differentially expressed gene; STAD, Stomach adenocarcinoma; TPM, transcripts per million reads; MAD, Median absolute deviation; PCA, Principal component analysis; FDR, False discovery rate; GO, Gene ontology; KEGG, Kyoto encyclopedia of genes and genomes; GSEA, Gene set enrichment analysis; ROC, Receiver operating characteristic; AUC, Area under curve; OS, overall survival; HR, Hazard ratio; TMB, Tumor burden mutation; H-TMB, High tumor burden mutation; L-TMB, Low tumor burden mutation.
References
Amit, I., Citri, A., Shay, T., Lu, Y., Katz, M., Zhang, F., et al. (2007). A Module of Negative Feedback Regulators Defines Growth Factor Signaling. Nat. Genet. 39, 503–512. doi:10.1038/ng1987
Ateshian, G. A., Morrison, B., Holmes, J. W., and Hung, C. T. (2012). Mechanics of Cell Growth. Mech. Res. Commun. 42, 118–125. doi:10.1016/j.mechrescom.2012.01.010
Behmel, A., Plchl, E., and Rosenkranz, W. (1984). A New X-Linked Dysplasia Gigantism Syndrome: Identical with the Simpson Dysplasia Syndrome? Hum. Genet. 67, 409–413. doi:10.1007/bf00291401
Bhowmick, N. A., Neilson, E. G., and Moses, H. L. (2004). Stromal Fibroblasts in Cancer Initiation and Progression. Nature 432, 332–337. doi:10.1038/nature03096
Boily, G., Saikali, Z., and Sinnett, D. (2004). Methylation Analysis of the Glypican 3 Gene in Embryonal Tumours. Br. J. Cancer 90, 1606–1611. doi:10.1038/sj.bjc.6601716
Cabrita, M. A., and Christofori, G. (2008). Sprouty Proteins, Masterminds of Receptor Tyrosine Kinase Signaling. Angiogenesis 11, 53–62. doi:10.1007/s10456-008-9089-1
Cao, S., Yan, B., Lu, Y., Zhang, G., Li, J., Zhai, W., et al. (2015). Methylation of Promoter and Expression Silencing of GPX3 Gene in Hepatocellular Carcinoma Tissue. Clin. Res. Hepatol. Gastroenterol. 39, 198–204. doi:10.1016/j.clinre.2014.09.003
Capurro, M. I., Xu, P., Shi, W., Li, F., Jia, A., and Filmus, J. (2008). Glypican-3 Inhibits Hedgehog Signaling during Development by Competing with Patched for Hedgehog Binding. Dev. Cell 14, 700–711. doi:10.1016/j.devcel.2008.03.006
Chang, S.-N., Lee, J. M., Oh, H., and Park, J.-H. (2016). Glutathione Peroxidase 3 Inhibits Prostate Tumorigenesis in TRAMP Mice. Prostate 76, 1387–1398. doi:10.1002/pros.23223
Chen, B., Rao, X., House, M. G., Nephew, K. P., Cullen, K. J., and Guo, Z. (2011). GPx3 Promoter Hypermethylation Is a Frequent Event in Human Cancer and Is Associated with Tumorigenesis and Chemotherapy Response. Cancer Lett. 309, 37–45. doi:10.1016/j.canlet.2011.05.013
Chen, H., Zheng, Z., Kim, K.-Y., Jin, X., Roh, M. R., and Jin, Z. (2016). Hypermethylation and Downregulation of Glutathione Peroxidase 3 Are Related to Pathogenesis of Melanoma. Oncol. Rep. 36, 2737–2744. doi:10.3892/or.2016.5071
Cheng, N., Chytil, A., Shyr, Y., Joly, A., and Moses, H. L. (2008). Transforming Growth Factor-β Signaling-Deficient Fibroblasts Enhance Hepatocyte Growth Factor Signaling in Mammary Carcinoma Cells to Promote Scattering and Invasion. Mol. Cancer Res. 6, 1521–1533. doi:10.1158/1541-7786.mcr-07-2203
Daulat, A. M., Bertucci, F., Audebert, S., Sergé, A., Finetti, P., Josselin, E., et al. (2016). PRICKLE1 Contributes to Cancer Cell Dissemination through its Interaction with mTORC2. Dev. cell 37, 311–325. doi:10.1016/j.devcel.2016.04.011
de Cárcer, G., Venkateswaran, S. V., Salgueiro, L., El Bakkali, A., Somogyi, K., Rowald, K., et al. (2018). Plk1 Overexpression Induces Chromosomal Instability and Suppresses Tumor Development. Nat. Commun. 9, 3012. doi:10.1038/s41467-018-05429-5
Ding, Y., Chen, Y., Wu, M., Li, L., Huang, Y., Wang, H., et al. (2020). Identification of Genes Associated with Gastric Cancer Survival and Construction of a Nomogram to Improve Risk Stratification for Patients with Gastric Cancer. Oncol. Lett. 20, 215–225. doi:10.3892/ol.2020.11543
Ferlini, A., Ragno, M., Gobbi, P., Marinucci, C., Rossi, R., Zanetti, A., et al. (1984). A New X-Linked Mental Retardation-Overgrowth Syndrome. Am. J. Med. Gene 17, 345–358.
Friedman, J. H., Hastie, T., and Robert, T. (2010). Regularization Paths for Generalized Linear Models via Coordinate Descent. J. Stat. Softw. 33, 1–22. doi:10.18637/jss.v033.i01
Fukagawa, T. (2008). The Kinetochore and Spindle Checkpoint in Vertebrate Cells. Front. Biosci. 13, 2705–2713. doi:10.2741/2877
Hanahan, D., and Weinberg, R. A. (2011). Hallmarks of Cancer: the Next Generation. Cell 144, 646–674. doi:10.1016/j.cell.2011.02.013
He, Y., Wang, Y., Li, P., Zhu, S., Wang, J., and Zhang, S. (2011). Identification of GPX3 Epigenetically Silenced by CpG Methylation in Human Esophageal Squamous Cell Carcinoma. Dig. Dis. Sci. 56, 681–688. doi:10.1007/s10620-010-1369-0
Hsu, H. C., Cheng, W., and Lai, P. L. (1997). Cloning and Expression of a Developmentally Regulated Transcript MXR7 in Hepatocellular Carcinoma: Biological Significance and Temporospatial Distribution. Cancer Res. 57, 5179–5184.
Huang, D. W., Sherman, B. T., and Lempicki, R. A. (2009). Systematic and Integrative Analysis of Large Gene Lists Using DAVID Bioinformatics Resources. Nat. Protoc. 4, 44–57. doi:10.1038/nprot.2008.211
Huber, R., Hansen, R. S., Strazzullo, M., Pengue, G., Mazzarella, R., D'Urso, M., et al. (1999). DNA Methylation in Transcriptional Repression of Two Differentially Expressed X-Linked Genes, GPC3 and SYBL1. Proc. Natl. Acad. Sci. 96, 616–621. doi:10.1073/pnas.96.2.616
Hynes, N. E., and MacDonald, G. (2009). ErbB Receptors and Signaling Pathways in Cancer. Curr. Opin. cell Biol. 21, 177–184. doi:10.1016/j.ceb.2008.12.010
Iglesias, B. V., Centeno, G., Pascuccelli, H., Ward, F., Peters, M. G., Filmus, J., et al. (2008). Expression Pattern of Glypican-3 (GPC3) during Human Embryonic and Fetal Development. Histol. Histopathol 23, 1333–1340. doi:10.14670/HH-23.1333
Itamochi, H., Kigawa, J., Sugiyama, T., Kikuchi, Y., Suzuki, M., and Terakawa, N. (2002). Low Proliferation Activity May Be Associated with Chemoresistance in clear Cell Carcinoma of the Ovary. Obstet. Gynecol. 100, 281–287. doi:10.1097/00006250-200208000-00014
Lee, S. W., Reimer, C. L., Fang, L., Iruela-Arispe, M. L., and Aaronson, S. A. (2000). Overexpression of Kinase-Associated Phosphatase (KAP) in Breast and Prostate Cancer and Inhibition of the Transformed Phenotype by Antisense KAP Expression. Mol. Cell Biol 20, 1723–1732. doi:10.1128/mcb.20.5.1723-1732.2000
Lemmon, M. A., and Schlessinger, J. (2010). Cell Signaling by Receptor Tyrosine Kinases. Cell 141, 1117–1134. doi:10.1016/j.cell.2010.06.011
Lewinska, A., Adamczyk-Grochala, J., Deregowska, A., and Wnuk, M. (2017). Sulforaphane-Induced Cell Cycle Arrest and Senescence Are Accompanied by DNA Hypomethylation and Changes in microRNA Profile in Breast Cancer Cells. Theranostics 7, 3461–3477. doi:10.7150/thno.20657
Liu, Q., Guntuku, S., Cui, X. S., Matsuoka, S., Cortez, D., Tamai, K., et al. (2000). Chk1 Is an Essential Kinase that Is Regulated by Atr and Required for the G(2)/M DNA Damage Checkpoint. Genes Dev. 14, 1448–1459. doi:10.1101/gad.14.12.1448
Liu, X., Jin, J., Cai, H., Huang, H., Zhao, G., Zhou, Y., et al. (2019). Study Protocol of a Randomized Phase III Trial of Comparing Preoperative Chemoradiation with Preoperative Chemotherapy in Patients with Locally Advanced Gastric Cancer or Esophagogastric junction Adenocarcinoma: PREACT. BMC cancer 19, 606. doi:10.1186/s12885-019-5728-8
Lou, W., Ding, B., Wang, S., and Fu, P. (2020). Overexpression of GPX3, a Potential Biomarker for Diagnosis and Prognosis of Breast Cancer, Inhibits Progression of Breast Cancer Cells In Vitro. Cancer Cell Int 20, 378. doi:10.1186/s12935-020-01466-7
Marima, R., Hull, R., Penny, C., and Dlamini, Z. (2021). Mitotic Syndicates Aurora Kinase B (AURKB) and Mitotic Arrest Deficient 2 like 2 (MAD2L2) in Cohorts of DNA Damage Response (DDR) and Tumorigenesis. Mutat. Research/Reviews Mutat. Res. 787, 108376. doi:10.1016/j.mrrev.2021.108376
Midorikawa, Y., Ishikawa, S., Iwanari, H., Imamura, T., Sakamoto, H., Miyazono, K., et al. (2003). Glypican-3, Overexpressed in Hepatocellular Carcinoma, Modulates FGF2 and BMP-7 Signaling. Int. J. Cancer 103, 455–465. doi:10.1002/ijc.10856
Mootha, V. K., Lindgren, C. M., Eriksson, K.-F., Subramanian, A., Sihag, S., Lehar, J., et al. (2003). PGC-1α-responsive Genes Involved in Oxidative Phosphorylation Are Coordinately Downregulated in Human Diabetes. Nat. Genet. 34, 267–273. doi:10.1038/ng1180
Newman, A. M., Liu, C. L., Green, M. R., Gentles, A. J., Feng, W., Xu, Y., et al. (2015). Robust Enumeration of Cell Subsets from Tissue Expression Profiles. Nat. Methods 12, 453–457. doi:10.1038/nmeth.3337
Paine-Saunders, S., Viviano, B. L., Zupicich, J., Skarnes, W. C., and Saunders, S. (2000). glypican-3 Controls Cellular Responses to Bmp4 in Limb Patterning and Skeletal Development. Dev. Biol. 225, 179–187. doi:10.1006/dbio.2000.9831
Pellegrini, M., Pilia, G., Pantano, S., Lucchini, F., Uda, M., Fumi, M., et al. (1998). Gpc3 Expression Correlates with the Phenotype of the Simpson-Golabi-Behmel Syndrome. 213, 431–439. doi:10.1002/(SICI)1097-0177(199812)213:4<431:AID-AJA8>3.0.CO;2-7
Peng, D.-F., Hu, T.-L., Schneider, B. G., Chen, Z., Xu, Z.-K., and El-Rifai, W. (2012). Silencing of Glutathione Peroxidase 3 through DNA Hypermethylation Is Associated with Lymph Node Metastasis in Gastric Carcinomas. PloS one 7, e46214. doi:10.1371/journal.pone.0046214
Perona, R. (2006). Cell Signalling: Growth Factors and Tyrosine Kinase Receptors. Clin. Transl Oncol. 8, 77–82. doi:10.1007/s12094-006-0162-1
Pilia, G., Hughes-Benzie, R. M., Mackenzie, A., Baybayan, P., Chen, E. Y., Huber, R., et al. (1996). Mutations in GPC3, a Glypican Gene, Cause the Simpson- Golabi-Behmel Overgrowth Syndrome. 12, 241–247. doi:10.1038/ng0396-241
Qi, X., Ng, K. T.-P., Shao, Y., Li, C. X., Geng, W., Ling, C. C., et al. (2016). The Clinical Significance and Potential Therapeutic Role of GPx3 in Tumor Recurrence after Liver Transplantation. Theranostics 6, 1934–1946. doi:10.7150/thno.16023
Qi, X., Ng, K. T. P., Lian, Q. Z., Liu, X. B., Li, C. X., Geng, W., et al. (2014). Clinical Significance and Therapeutic Value of Glutathione Peroxidase 3 (GPx3) in Hepatocellular Carcinoma. Oncotarget 5, 11103–11120. doi:10.18632/oncotarget.2549
Saga, Y., Ohwada, M., Suzuki, M., Konno, R., Kigawa, J., Ueno, S., et al. (2008). Glutathione Peroxidase 3 Is a Candidate Mechanism of Anticancer Drug Resistance of Ovarian clear Cell Adenocarcinoma. Oncol. Rep. 20, 1299–1303.
Simpson, J. L., Landey, S., New, M., and German, J.(1975). A Previously Unrecognized X Linked Syndrome of Dysmorphia %. J. Birth Defects Original Article Series. 11, 18–24.
Sohn, B. H., Hwang, J.-E., Jang, H.-J., Lee, H.-S., Oh, S. C., Shim, J.-J., et al. (2017). Clinical Significance of Four Molecular Subtypes of Gastric Cancer Identified by the Cancer Genome Atlas Project. Clin. Cancer Res. 23, 4441–4449. doi:10.1158/1078-0432.ccr-16-2211
Stolz, A., Ertych, N., and Bastians, H. (2011). Tumor Suppressor CHK2: Regulator of DNA Damage Response and Mediator of Chromosomal Stability: Figure 1. Clin. Cancer Res. 17, 401–405. doi:10.1158/1078-0432.ccr-10-1215
Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., et al. (2005). Gene Set Enrichment Analysis: a Knowledge-Based Approach for Interpreting Genome-wide Expression Profiles. Proc. Natl. Acad. Sci. U S A. 102, 15545-15550. doi:10.1073/pnas.0506580102
Sulić, S., Panić, L., Dikić, I., and Volarević, S. (2005). Deregulation of Cell Growth and Malignant Transformation. Croat. Med. J. 46, 622–638.
Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA A. Cancer J. Clin. 71, 209–249. doi:10.3322/caac.21660
Vuillaume, M. L., Moizard, M. P., Baumer, A., Cottereau, E., Brioude, F., Rauch, A. T., et al. (2019). CUGC for Simpson-Golabi-Behmel Syndrome (SGBS). Euro. J .Hum. Gene. 27, 663–668. doi:10.1038/s41431-019-0339-z
Wertz, I. E., and Dixit, V. M. (2010). Regulation of Death Receptor Signaling by the Ubiquitin System. Cell Death Differ 17, 14–24. doi:10.1038/cdd.2009.168
Witsch, E., Sela, M., and Yarden, Y. (2010). Roles for Growth Factors in Cancer Progression. Physiology 25, 85–101. doi:10.1152/physiol.00045.2009
Wöhrer, S. S., Raderer, M., and Hejna, M. (2004). Palliative Chemotherapy for Advanced Gastric Cancer. Ann. Oncol. 15, 1585–1595. doi:10.1093/annonc/mdh422
Yoshihara, K., Shahmoradgoli, M., Martínez, E., Vegesna, R., Kim, H., Torres-Garcia, W., et al. (2013). Inferring Tumour Purity and Stromal and Immune Cell Admixture from Expression Data. Nat. Commun. 4, 2612. doi:10.1038/ncomms3612
Yu, Y. P., Yu, G., Tseng, G., Cieply, K., Nelson, J., Defrances, M., et al. (2007). Glutathione Peroxidase 3, Deleted or Methylated in Prostate Cancer, Suppresses Prostate Cancer Growth and Metastasis. Cancer Res. 67, 8043–8050. doi:10.1158/0008-5472.can-07-0648
Zhang, X., Zheng, Z., Yingji, S., Kim, H., Jin, R., Renshu, L., et al. (2014). Downregulation of Glutathione Peroxidase 3 Is Associated with Lymph Node Metastasis and Prognosis in Cervical Cancer. Oncol. Rep. 31, 2587–2592. doi:10.3892/or.2014.3152
Zhao, W., Hoadley, K. A., Parker, J. S., and Perou, C. M. (2016). Identification of mRNA Isoform Switching in Breast Cancer. BMC genomics 17, 181. doi:10.1186/s12864-016-2521-9
Zheng, R. S., Sun, K. X., Zhang, S. W., Zeng, H. M., and He, J. (2019). Report of Cancer Epidemiology in China, 2015. Chin. J. Oncol. 41, 19–28. doi:10.3760/cma.j.issn.0253-3766.2019.01.005
Zhou, Z., Huang, R., Chai, X., Wang, W., Zhou, X., Hu, Z., et al. (2018). Identification of an Energy Metabolism-Related Signature Associated with Clinical Prognosis in Diffuse Glioma. Aging 10, 3185–3209. doi:10.18632/aging.101625
Keywords: the cancer genome atlas, gene expression omnibus, proliferation, cell cycle, cluster, gastric cancer
Citation: Hu J, Yang Y, Ma Y, Ning Y, Chen G and Liu Y (2021) Proliferation Cycle Transcriptomic Signatures are Strongly associated With Gastric Cancer Patient Survival. Front. Cell Dev. Biol. 9:770994. doi: 10.3389/fcell.2021.770994
Received: 05 September 2021; Accepted: 08 November 2021;
Published: 01 December 2021.
Edited by:
Kirk James McManus, University of Manitoba, CanadaReviewed by:
May Yin Lee, Genome Institute of Singapore (A∗STAR), SingaporeYue J. Wang, Florida State University, United States
Copyright © 2021 Hu, Yang, Ma, Ning, Chen and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Guowei Chen, Z3Vvd2VpY2hlbkAyNjMubmV0; Yucun Liu, eXVjdW5saXVAYmptdS5lZHUuY24=