- 1Cancer Research Institute of Zhongshan City, Zhongshan City People’s Hospital, Zhongshan, China
- 2Department of Pathology, Zhongshan City People’s Hospital, Zhongshan, China
- 3Department of Oncology, Southwest Hospital, Army Medical University, Chongqing, China
Background: Nasopharyngeal carcinoma (NPC) is a highly invasive malignant tumor. Recurrence and distant metastasis represent the primary causes of treatment failure. This study aimed to identify biomarkers highly associated with NPC and investigate its roles in tumor progression.
Methods: Transcriptome sequencing (RNA-seq) data of NPC and normal tissues were downloaded from the Gene Expression Omnibus (GEO) database. By analyzing the RNA-seq data, we found that G Protein Subunit Alpha 14 (GNA14) is closely associated with the diagnosis and prognosis of NPC. Immunohistochemistry (IHC) was used to detect the expression of GNA14 in tumor tissues of 165 NPC patients, and we analyzed the relationship between GNA14 expression and patient prognosis. The potential mechanisms by which GNA14 affects tumor prognosis were preliminarily analyzed using bioinformatics analysis.
Results: Analysis of RNA-seq data and IHC showed that GNA14 expression was downregulated in NPC (p < 0.001, p < 0.01, respectively), and low expression of GNA14 was closely associated with poor prognosis. IHC analysis showed that patients with low GNA14 expression had significantly shorter progression-free survival (PFS) and distant metastasis-free survival (DMFS) than those with high GNA14 expression (p = 0.023, p = 0.008, respectively). Multivariate analysis indicated that the GNA14 expression was an independent risk factor for DMFS (p = 0.030). The DMFS nomogram included GNA14 expression, EBV DNA, and N stage as prognostic factors and the concordance index (C-index) of the nomogram was 0.73. Bioinformatics analysis indicated that NPC patients with low GNA14 expression might represent lower levels of immune cell infiltration and poorer drug sensitivity.
Conclusion: Low GNA14 expression may be a risk factor for poor prognosis in NPC.
Introduction
Nasopharyngeal carcinoma (NPC), a type of head and neck squamous cell carcinoma (HNSCC), originates from nasopharyngeal epithelial tissues (1). NPC is characterized by an uneven geographical distribution, particularly prevalent in East and Southeast Asia (2) The etiology of the disease is related to racial susceptibility, genetic factors, environmental factors, and Epstein-Barr virus (EBV) infection (1). In terms of treatment, NPC is treated by a combined regimen of radiotherapy and chemotherapy. This regimen has achieved significant success, with a marked increase in the survival rate for patients (3). However, recurrence and distant metastases remain the leading causes of death for NPC patients (2). Therefore, identifying patients at high risk of recurrence and metastasis before treatment can help oncologists develop individualized treatment plans, which are essential for improving outcomes and prolonging the survival of NPC patients. A recent study proposes that NPC should be viewed as a multidimensional spatiotemporal “unity of ecology and evolution” pathological ecosystem (4). The ecological theory of NPC suggests that tumor cells behave like invasive species in a dynamic ecosystem, interacting with the tumor microenvironment (TME), the immune system, and various factors. It is emphasized that certain genes are not only markers of disease but also key players in ecological interactions in the tumor microenvironment. For example, genes associated with immune evasion, cell proliferation, and metastasis contribute to the enhanced invasiveness of NPC. Therefore, identifying such key biomarkers is crucial for elucidating the molecular mechanisms of recurrence and distant metastasis in NPC.
Identifying biomarkers closely associated with NPC and exploring their value in the diagnosis and treatment of the tumor is currently a major research focus (5, 6). RNA-seq technology has been extensively applied in NPC research to elucidate the association between biomarkers and disease mechanisms, tumor biology, and prognosis (7). Against this backdrop, machine learning algorithms and bioinformatics technologies have played a pivotal role. Advances in bioinformatics enable comprehensive analysis of RNA-seq data (8). The combination of bioinformatics analysis and machine learning algorithms has facilitated the identification of biomarkers highly associated with tumors (9). Identifying key biomarkers in tumors is crucial for developing personalized treatment strategies and understanding the molecular mechanisms of NPC. For instance, in the treatment of NPC, outcomes are often influenced by the unique immune evasion mechanisms of the tumor. Recent studies have identified that certain key biomarkers influence the number and proportion of immune cells in the TME, thereby affecting the therapeutic outcomes for patients (10, 11). By analyzing the characteristics of immune infiltration associated with these biomarkers, new avenues for targeted therapies can be identified (12).
The prognosis of NPC primarily depends on the extent of tumor infiltration. However, in addition to the American Joint Committee on Cancer (AJCC) tumor-node-metastasis (TNM) staging, various other prognostic factors, including age, gender, smoking history, EBV DNA load, and gene expression levels (13–15), have been observed. Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) provide a wealth of RNA-seq data and corresponding clinical information (16, 17), which play a crucial role in the discovery of new functional genes and understanding of the pathogenesis of tumors. In this study, NPC-related RNA-seq data were analyzed using bioinformatics methods, identifying that GNA14 was strongly associated with the prognosis of NPC. Immunohistochemistry was used to verify the prognostic value of GNA14 in clinical samples. The intrinsic association between GNA14 and NPC was investigated by functional enrichment analysis, immune infiltration analysis, drug sensitivity analysis, and other methods.
Materials and methods
Raw data acquisition and preprocessing
RNA-seq datasets related to NPC (GSE12452, GSE53819, GSE64634, GSE61218, GSE102349) were downloaded from the GEO database. Raw data were log-transformed and normalized using the “limma” software package. Expression data of the same genes in the GSE12452 and GSE53819 were merged to form a merged expression matrix, which was subjected to batch effect removal using the “sva” package to generate a training set for the screening of key biomarkers. Using the same method, we merged GSE64634 and GSE61218 as a validation set. The GSE102349 dataset contains clinical and survival information from 113 NPC patients, which was used for survival analysis and subsequent bioinformatics analysis. RNA-seq data and clinical information from 33 solid tumors and corresponding normal tissues were downloaded from the TCGA database for the subsequent pan-cancer analysis. The characteristics of the GEO datasets and the tumor types in the TCGA database are summarized in Supplementary Table 1.
Differential expression analysis and machine learning algorithms
Differential expression analysis was performed on the training set using the “limma” package, with |log2FC| > 1 and false discovery rate (FDR) < 0.05 as screening criteria for identifying differentially expressed genes (DEGs) (18). The DEGs were further screened utilizing Random Forest (RF), Least Absolute Shrinkage and Selection Operator (LASSO) logistic regression, and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) algorithms. RF was performed using the “randomForest” package, genes with a score > 1 were considered as key genes based on the “importance” function. The LASSO regression model was constructed utilizing the “glmnet” package (19) for variable selection. The cross-validation method was used to determine the optimal lambda value corresponding to the number of key genes. SVM-RFE was performed using the “e1071” package to conduct 10-fold cross-validation and feature elimination, identifying a set of key genes based on the principle of error minimization.
Selection of the key biomarker
The overlapping genes identified by the three algorithms were considered highly related to NPC. The diagnostic value of these genes was assessed by receiver operating characteristic curve (ROC) analysis and validated through the validation set. 88 patients from GSE102349 with complete progression-free survival (PFS) information were used for survival analysis. The RNA-seq data and PFS information from the TCGA were used to conduct a pan-cancer survival analysis. Grouping for survival analysis was based on median expression levels of the overlapping genes (GNA14 and LRRC34). Through the above process, we successfully identified a key biomarker, GNA14, highly related to the diagnosis and prognosis of NPC.
Study population in hospital and follow-up
Tissue specimens for immunohistochemical (IHC) examination of GNA14 were obtained from 165 diagnosed NPC patients treated at Zhongshan City People’s Hospital (Guangdong, China) from January 2015 to December 2017, along with 30 patients diagnosed with chronic rhinosinusitis during the same period. This study was conducted in compliance with the Declaration of Helsinki. The study received approval from the Clinical Research Ethics Committee of the Zhongshan City People’s Hospital. All NPC tissues were collected before anti-cancer treatment. All patients following these criteria were retrospectively enrolled: (a) histopathologically confirmed NPC; (b) clinical stages I-IVa according to the 8th edition AJCC/UICC staging system; (c) received either solely intensity-modulated radiation therapy, concurrent chemoradiotherapy, with or without induction chemotherapy or adjuvant chemotherapy; (d) had complete baseline data; (e) had no severe heart, lung, liver, kidney diseases, or other cancers at NPC diagnosis. NPC patients who completed treatment were followed monthly for the first 3 months, every 3 months for the next 3 years, every 6 months for the next 2 years, and annually thereafter. Follow-up ended in December 2023.
IHC and scoring strategies
All Tissues were fixed in 4% formaldehyde and embedded in paraffin. To assess the expression level of GNA14, IHC examinations were conducted using the GNA14 antibody (Polyclonal, rabbit, 13350-1-AP-50UL, 1:200 dilution, Wuhan Sanying). Tissues were cut into 4 μm sections, deparaffinized with xylene, and subsequently rehydrated with graded ethanol. The slides were then incubated with a 3% H2O2 solution for 10 minutes to quench endogenous peroxidase activity, and 0.01 mmol/L citrate buffer (pH 6.0) was used for antigen retrieval in a high-pressure cooker. The sections were incubated with the primary antibody for 3 hours at room temperature, followed by rinsing with TBS. Following incubation with the secondary antibody (Anti-Mouse/Rabbit Universal Immunohistochemical Test Kit, PK10006, Wuhan Sanying), the sections were stained with DAB. All slides were then re-stained with hematoxylin, examined under the microscope, and photographed. Positive control sections were provided by the antibody manufacturer. Cell staining intensity was scored based on a previous study (20, 21). The IHC results were assessed by calculating the total score (0-12) by multiplying the intensity of positive staining (negative, 0; weak, 1; moderate, 2; or strong, 3) by the proportion of target immunopositive cells (<25%, 1; 25-49%, 2; 50-75%, 3; or >75%, 4). IHC results were evaluated independently by two pathologists, and any discrepancies were resolved through consensus. Based on the median IHC score, the high GNA14 expression group was defined as samples with an IHC score > 4, and the low GNA14 expression group was defined as samples with an IHC score ≤ 4.
Expression profile of GNA14
RNA-seq data were extracted from the training and validation sets to compare the expression levels of GNA14 in NPC and normal tissue. IHC staining was employed to detect and compare the expression of GNA14 in nasopharyngeal tissues from patients with chronic sinusitis and NPC. Furthermore, we assessed the differential expression of GNA14 in multiple solid tumors by analyzing RNA-Seq data from the TCGA database, covering 33 different types of tumors and their corresponding normal tissues.
Functional enrichment analysis
Based on the median GNA14 expression level, NPC samples in GSE102349 were categorized into high and low GNA14 expression groups. Identification of DEGs between two groups using the “limma” package (|log2FC| >1, FDR < 0.05). Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways of the DEGs were performed using the “clusterProfiler” package. We selected “C5.go.Hs.symbols.gmt” from the Molecular Signatures Database (MSigDB) as the reference gene set, and gene set enrichment analysis (GSEA) was performed for genes in the GNA14 high and low expression groups.
Immune infiltration and drug sensitivity analysis
The “estimate” package in R was employed to predict the ImmuneScore, StromalScore, and their combined scores (ESTIMATE score) in tumor samples. Expression levels of GNA14 and 79 immune checkpoint genes were extracted, followed by Spearman correlation analysis to evaluate the association between GNA14 and each immune checkpoint gene. Genes with a p-value less than 0.001 were identified as immune checkpoint genes highly associated with GNA14. The list of immune checkpoint genes was derived from previously published literature (22). The single-sample gene set enrichment analysis (ssGSEA) algorithm was utilized to assess the relationship between the proportions of various immune cell types in NPC and GAN14 expression. The gene annotation file contains 28 tumor-infiltrating immune cells from TISIDB. Furthermore, the “oncoPredict” package was used to estimate the chemotherapeutic response of patients from high and low GNA14 groups. The chemotherapeutic response was determined by the half maximal inhibitory concentration (IC50) of each NPC patient and the IC50 data was sourced from the GDSC website (https://www.cancerrxgene.org/).
Statistical analysis
Descriptive statistical analysis was performed on the collected data, expressed as mean ± standard deviation (SD) or percentage (%). Continuous variables were compared using the independent samples t-test or Mann-Whitney U test. Correlations between variables were assessed using Pearson or Spearman correlation analysis. The division of high and low GNA14 expression groups was based on the median GNA14 expression level. Survival analysis was performed using the Kaplan-Meier method, and log-rank tests were used to compare differences. Correlations between GNA14 expression and clinicopathological features were analyzed using the chi-square test or Fisher exact test. The cutoff value for the high and low EBV DNA groups was chosen as 4000 copies per milliliter based on a previous study of the prognostic value of EBV DNA (23). Univariate and multivariate Cox regression models were used to identify the risk factors associated with the prognosis of NPC. A nomogram was constructed to predict the 3-year and 5-year DMFS of NPC patients based on the results of multivariate analysis, and the predictive ability of the models was assessed by calculating the consistency index (C-index). All tests were two-tailed, with significance levels set at p < 0.05. All statistical analyses were performed using R software (version 4.0.0) or SPSS software (version 20.0, IBM, New York, USA).
Results
Differential expression analysis and machine learning algorithms based on GEO datasets
A total of 795 DEGs were identified between NPC and normal tissues based on GEO datasets (Supplementary Table 2). Among these genes, 515 were up-regulated and 280 were down-regulated in NPC samples (Figure 1A). Using the RF algorithm, LASSO regression algorithm, and SVM-RFE algorithm, 4, 21, and 16 genes were identified as highly associated with NPC, respectively (Figures 1B–F, Supplementary Table 3).
Figure 1. Differential expression analysis and machine learning algorithms. (A) Volcano plot shows the differentially expressed genes (DEGs) in NPC and normal tissues. (B) The relationship between the number of decision trees and the error rate in the random forest algorithm. (C, D) LASSO regression analysis. (E, F) Accuracy and error curves for feature variables in the SVM-RFE.
Differential expression analysis and machine learning algorithms based on GEO datasets.
GNA14 may be a key biomarker in NPC
The Venn diagram showed two overlapping genes (GNA14, LRRC34) identified by three machine learning algorithms (Figure 2A, Supplementary Table 2). ROC curve analysis demonstrated that GNA14 had an AUC of 0.982 (95% CI: 0.96-0.99) in the training set (Figure 2B), and achieved an AUC of 0.941 (95% CI: 0.85-1.00) in the validation set (Figure 2C). The results of the survival analysis indicated that the group with low GNA14 expression exhibited significantly shorter PFS compared to the high-expression group (Figure 2D). Conversely, there were no significant differences in PFS between the high-expression and low-expression groups of LRRC34 (Figure 2E). In addition, we found that in head and neck squamous cell carcinoma (HNSCC), thyroid carcinoma (THCA), cholangiocarcinoma (CHOL), kidney renal clear cell carcinoma (KIRC), liver hepatocellular carcinoma (LIHC), and uterine corpus endometrial carcinoma (UCEC), the risk of tumor progression was higher in the GNA14 low-expression group than in the GNA14 high-expression group (Figure 2F). Therefore, GNA14 was selected as a key biomarker for further study.
Figure 2. GNA14 may be a key biomarker in NPC. (A) Venn diagram showing the overlapped genes from machine learning algorithms. (B, C) ROC curve of GNA14 in the training set and validation set, with the horizontal axis representing the false positive rate (FPR) and the vertical axis indicating sensitivity. (D) Kaplan-Meier curves of PFS for 88 patients between high GNA14 expression group and low GNA14 expression group. (E) Kaplan-Meier curves of PFS for patients between high LRRC34 expression group and low LRRC34 expression group. (F) Pan-cancer survival analysis between high and low GNA14 expression groups.
Expression profile of GNA14
Analysis of the RNA-seq data demonstrated a significant reduction in GNA14 expression in NPC compared to normal nasopharyngeal tissues in training set (p < 0.001) (Figure 3A), a result corroborated by the validation set (Figure 3B). Through IHC examination, we observed that the expression level of GNA14 in NPC samples was significantly lower than in non-cancerous nasopharyngeal tissues (p < 0.01), and GNA14 was primarily localized in the cell membrane (Figures 3C–E). Furthermore, it was discovered that in various tumor samples, such as head and neck squamous cell carcinoma (HNSCC), bladder urothelial carcinoma (BLCA), liver hepatocellular carcinoma (LHIC), lung squamous cell carcinoma (LUSC), and thyroid carcinoma (THCA), the expression level of GNA14 was significantly lower than that of the corresponding normal samples (p < 0.001) (Figure 3F).
Figure 3. GNA14 expression profile. (A, B) Box plot showing differential expression of GNA14 in normal nasopharyngeal mucosa and NPC tissues, with the left graph representing the training set and the right graph representing the validation set. (C) Box plot showing differential expression of GNA14 in chronic rhinosinusitis tissues (Non-NPC) and NPC tissues (Based on IHC results). (D, E) IHC examination of GNA14 expression in NPC tissues and in non-NPC tissues [magnification 40x (left) and 200x (right)]. (F) Differential expression analysis was conducted on GNA14 in various tumor samples and normal controls (*** p < 0.001, ** p < 0.01, * p < 0.05).
Patient characteristics and clinical sample analysis
Of the 165 patients enrolled, 110 (66.7%) were male and 55 (33.3%) were female, with a male-to-female ratio of 2:1, and a median age of 49 years (range: 22-76 years). 39 (23.6%) patients were diagnosed as stage I-II and 126 (76.4%) as stage III-IV. The majority of patients had a pathological diagnosis of WHO type III (97.5%). The median duration of follow-up was 69 months (range: 4-95). There were 92 cases in the GNA14 low expression group and 73 cases in the GNA14 high expression group (Figures 4E, F). We found that the low GNA14 expression was significantly associated with advanced clinical stage (p = 0.013) and increased risk of distant metastasis (p = 0.008) (Table 1, Figure 4G). The PFS (56.5% vs. 74.0%; p = 0.023) and DMFS (70.0% vs. 87.7%; p = 0.008) of patients in the GNA14-low expression group were significantly shorter than those in the GNA14-high expression group (Figures 4A, B), while there was no significant difference in overall survival (OS) and locoregional recurrence-free survival (LRFS) between the two groups (p = 0.088, p = 0.478, respectively) (Figures 4C, D). The results of the multivariate Cox regression analysis indicated that T stage and EBV DNA were significant factors for OS (p = 0.024 and p = 0.029, respectively). EBV DNA was identified as a significant factor for PFS (p = 0.012), while T stage was a significant factor for LRFS (p = 0.031). GNA14 expression was an independent risk factor affecting DMFS in NPC (p = 0.030) (Table 2). We constructed a nomogram that incorporates the GNA14 expression (Figures 4H, I), EBV DNA, and N stage to predict the 3-year and 5-year DMFS in NPC patients (Figure 4G). The C-index of the nomogram was 0.73 (Figure 4H).
Figure 4. Low expression of GNA14 predicts poor prognosis in NPC patients. Kaplan–Meier curves of OS (A), PFS (B), LFRS (C), and DMFS (D) for high and low GNA14 expression in NPC patients. (E, F) Representative images of high and low expression groups of GNA14 immunohistochemical staining in NPC tissue (100×). (G) Representative images of GNA14 expression in the clinical staging group and the distant metastasis group. (H) The nomogram was used to predict the 3-year and 5-year DMFS rates for patients with NPC. (I) The calibration curve for the nomogram.
Functional enrichment analysis
We identified 280 DEGs between high and low GNA14 expression groups in GSE102349, with 266 genes exhibiting upregulation and 14 showing downregulation (Supplementary Table 4). GO analysis indicated that these DEGs were predominantly enriched in pathways related to immune response and cell migration, including B-cell activation, proliferation of various immune cells, and ciliary movement (Figure 5A). KEGG enrichment analysis revealed that the DEGs were significantly concentrated in pathways such as chemokine signaling, NF-kB signaling, and cytochrome P450-mediated drug metabolism (Figure 5B). According to Gene Set Enrichment Analysis (GSEA), pathways related to cell division, DNA, and chromosome replication were up-regulated in the low GNA14 expression group, and pathways related to immune cell activity and adaptive immune response were down-regulated compared to patients in the high GNA14 expression group (Figures 5C, D).
Figure 5. Analysis of functional enrichment in GNA14 high and low expression groups in 113 NPC patients from GSE102349. Gene Ontology (A) and KEGG pathways (B). Results of differentially expressed genes (DEGs) in patients with high and low GNA14 expression. GSEA analysis revealed gene pathways that were significantly enriched in the high (C) and low (D) GNA14 expression groups.
Immune cell infiltration and drug sensitivity analysis
It was found tumor tissues with low GNA14 expression represented lower immune and stromal scores (Figure 6A). Furthermore, in NPC patients with low GNA14 expression, the majority of immune checkpoint genes exhibit lower expression levels (p < 0.001) (Figure 6B, Supplementary Table 5). Results from the immune infiltration analysis showed that a decrease in GNA14 expression level was correlated with a reduced proportion of most immune cells such as B cells, CD8 T cells, and NK cells (Figure 6C). Through drug sensitivity analysis, we observed that most chemotherapy drugs such as 5-fluorouracil, Gemcitabine, and Oxaliplatin exhibited higher IC50 values in patients with GNA14-low expression (Figure 6D).
Figure 6. Immune cell infiltration and drug sensitivity analysis in 113 NPC patients from GSE102349. (A) Box plots were used to compare the differences between GNA14 high and low expression groups in terms of immunity score, stroma score, and estimate score. (B) Correlation between GNA14 expression levels and immune checkpoint genes. (C) The levels of multiple immune cell counts were compared between high and low GNA14 expression groups (*** p < 0.001, ** p < 0.01, * p < 0.05). (D) Box plots were used to compare drug sensitivity between GNA14 high and low expression groups, the horizontal axis indicates the drug name and the vertical axis indicates drug sensitivity (IC50).
Discussion
Exploring the key biomarkers that affect the prognosis of NPC is crucial for identifying new therapeutic targets and prolonging the survival of patients. In this study, through a comprehensive analysis of RNA-seq data, we ultimately identified GNA14 as a potential key biomarker for NPC. Subsequently, we performed IHC to examine the expression of GNA14 in NPC and to explore its correlation with the clinical characteristics of patients. Analysis of RNA-seq data and IHC analysis consistently showed that GNA14 expression was downregulated in NPC tissues.
GNA14 is a gene encoding the G protein subunit alpha 14. G proteins (guanine nucleotide-binding proteins) are widespread intracellular signal transduction molecules. Some studies have shown that GNA14 can act as an oncogene to promote cancer development. For example, a study by Wang et al. found that GNA14 is highly expressed in endometrial carcinoma tissue and that GNA14 promotes the malignant growth of endometrial carcinoma by stimulating KLF7 to upregulate HAS2 expression (24). Interestingly, more research indicates that GNA14 may function as a tumor suppressor gene For example, Song et al. found that GNA14 was downregulated in hepatocellular carcinoma (HCC), and negatively associated with hepatitis B virus (HBV) infection, vascular invasion, and HCC prognosis (25). Huang et al. reported lower GNA14 expression in oral squamous cell carcinoma (OSCC) cell lines and tissues, indicating a poorer prognosis for OSCC (26). In a study conducted by Pan et al., GNA14 exhibited low expression in papillary thyroid carcinoma and patients in the GNA14 low expression group demonstrated a diminished disease-free survival (DFS) rate (27). The function of GNA14 in tumor development may depend on the specific tumor type, with the protein exhibiting either tumor-promoting or tumor-suppressing properties. It is noteworthy that thyroid carcinoma, oral squamous cell carcinoma, and nasopharyngeal carcinoma are all clinically common head and neck tumors. Similarly, our study also found that GNA14 expression was down-regulated in NPC, and patients with low GNA14 expression had significantly lower PFS and DMFS than those with high GNA14 expression. Considering that the processing of the specimens might negatively affect the GNA14 measurements, we employed an antigen retrieval step in the experiment to ensure antibody binding to the target antigen. For instance, antigen retrieval was performed under high-pressure conditions using a 0.01 mmol/L citrate buffer (pH 6.0) in a pressure cooker. This approach has been demonstrated to restore the antigenicity of many proteins, ensuring reliable antibody binding (28, 29). Additionally, tissue fixation time or the extent of paraffin infiltration may compromise the accuracy of GNA14 quantification due to antigen masking. Therefore, we included positive control sections provided by the manufacturer, which helped to confirm the validity of the staining protocol.
In this study, we preliminarily investigated the possible reasons for the poor prognosis of NPC patients due to low GNA14 expression, as well as to explore the potential therapeutic approach of GNA14 as a target. It was observed that the expression level of GNA14 was lower in patients with advanced clinical stages (III or IVa) compared to those with early stages (I or II). It was hypothesized that GNA14 may function as an oncogene, with low expression promoting the development and progression of NPC. Strategies to restore GNA14 expression in NPC cells may have the potential to inhibit tumor growth and metastasis. This restoration can be achieved through gene therapy techniques, such as viral or nanoparticle-mediated delivery of GNA14, with the aim of re-establishing its expression within tumor tissues. Furthermore, we focused on DEGs in patients with high and low GNA14 expression. The analysis of GO and KEGG indicates that these DEGs are primarily enriched in pathways associated with immune response and cell migration (30, 31). In addition, GSEA suggested that compared to patients with high GNA14 expression, pathways such as cell division, DNA, and chromosome replication were up-regulated in patients with low GNA14 expression, while pathways such as immune cell activity and adaptive immune response were down-regulated (32). Research has demonstrated that the extent of immune infiltration is linked to prognosis, with greater levels of immune infiltration generally indicating a more favorable prognosis (33). Our study discovered that NPC patients with low GNA14 expression had a significant decrease in the proportion of most immune cells, such as B cells, CD4+, and CD8+ T cells, and had lower stromal scores and immunity scores, indicating these patients had higher tumor purity and lower levels of immune infiltration (22). It is anticipated that the restoration of GNA14 expression or activity will enhance the immune infiltration of tumors and augment the capacity of the immune system to combat tumor growth. The combination of GNA14-targeted therapy with immune checkpoint inhibitors (e.g., PD-1/PD-L1) may result in a synergistic effect and an improvement in therapeutic efficacy. In addition, we found that patients with low GNA14 expression were less sensitive to chemotherapeutic agents such as 5-fluorouracil and gemcitabine, which are commonly used chemotherapeutic agents in nasopharyngeal carcinoma, suggesting that these patients may have poorer response to chemotherapy.
Numerous studies have demonstrated a close association between EBV infection and the development of NPC. EBV DNA can be used as a biomarker for early diagnosis and prognostic prediction of NPC (34, 35). In our study, multivariate Cox regression analysis showed that pre-treatment EBV DNA was an independent risk factor for PFS and DMFS in patients, which is consistent with the findings of Tang et al. (36). In addition to EBV DNA, many biomarkers have been reported to be associated with the prognosis of NPC, such as EBV serum antibodies, miR-BART2-5p, serum LDH, and C-reactive protein (CRP) (37–40). These factors have limitations in clinical application due to their susceptibility to patient conditions and the complexity of the inspection techniques. We found that GNA14 expression and N stage were equally independent risk factors for DMFS. We have developed a nomogram based on GNA14 expression that can more easily help clinicians evaluate patients with a high risk of distant metastasis. For instance, a more intense regimen can be used in individuals with low GNA14 expression.
Our study had some limitations. First, this study was a single-center study, and all patients were enrolled in a high-prevalence area, thus not able to generalize to all NPC patients. It is recommended that future studies integrate data from different sources and a broader patient population for multicenter validation. Second, due to practical constraints, we were unable to collect data from a larger number of participants. A small sample size may reduce the statistical power to detect differences. However, if the results demonstrated statistical significance under these circumstances, it emphasizes the robustness of the findings. Third, immune cell infiltration and drug sensitivity analysis with GNA14 expression has not been fully validated in cellular or other experimental studies. Future studies are needed to elucidate the precise role of GNA14 in nasopharyngeal carcinogenesis, invasion, and metastasis, along with its impact on immune cell infiltration and drug sensitivity, through experiments like in vitro cellular assays and animal models.
Conclusion
In conclusion, we found that GNA14 expression was down-regulated in NPC tissues, and its low expression may be closely associated with advanced tumor stage and distant metastasis. GNA14 expression combined with pre-treatment EBV DNA load and N stage shows potential for predicting NPC patients with a high risk of distant metastasis.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Clinical Research Ethics Committee of the Zhongshan City People’s Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study. The retrospective study was approved by the Clinical Research Ethics Committee of the Zhongshan City People's Hospital (Guangdong, China) (Project No: 2024-065).
Author contributions
JH: Writing – original draft, Writing – review & editing, Methodology, Validation, Visualization. XY: Supervision, Writing – review & editing. LX: Supervision, Validation, Writing – review & editing. LD: Conceptualization, Investigation, Writing – review & editing. SL: Supervision, Writing – review & editing. SS: Supervision, Validation, Writing – review & editing. XL: Supervision, Writing – original draft, Writing – review & editing. YD: Methodology, Supervision, Writing – review & editing. MJ: Conceptualization, Methodology, Supervision, Validation, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. 1. 2023 Zhongshan Social Welfare and Basic Research Project (2023B3008). 2. 2020 National Major Chronic Noncommunicable Disease Prevention and Control Research, “Research on the Development of New Technologies and Optimization of Nasopharyngeal Cancer Screening in high-incidence Areas” (2020YFC1316900), subproject “Building a Preliminary Screening Model for Individualized disease Risk” (2020YFC1316902).
Acknowledgments
We thank all participants, collaborators, and team members of this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1482038/full#supplementary-material
Supplementary Table 1 | The characteristics of the GEO datasets and the tumor types in the TCGA database.
Supplementary Table 2 | The results of differential expression analysis in the training set.
Supplementary Table 3 | Results of RF algorithm, LASSO regression algorithm, and SVM-RFE algorithm.
Supplementary Table 4 | The DEGs between high and low GNA14 expression groups in GSE102349.
Supplementary Table 5 | List of immune checkpoint genes highly correlated with GNA14.
Abbreviations
NPC, Nasopharyngeal carcinoma; RNA-seq, Transcriptome sequencing; GNA14, G Protein Subunit Alpha 14; IHC, Immunohistochemical; GEO, Gene Expression Omnibus; TCGA, The Cancer Genome Atlas; EBV, Epstein-Barr virus; C-index, Concordance index; CHOL, Cholangiocarcinoma; HNSCC, Head and Neck squamous cell carcinoma; KIRC, Kidney renal clear cell carcinoma; LIHC, Liver hepatocellular carcinoma; THCA, Thyroid carcinoma; UCEC, Uterine Corpus Endometrial Carcinoma; HCC, Hepatocellular carcinoma; OSCC, Oral squamous cell carcinoma; DEGs, Differentially expressed genes; FDR, False discovery rate; RF, Random Forest; LASSO, Least Absolute Shrinkage and Selection Operator; SVM-RFE, Support Vector Machine-Recursive Feature Elimination; ROC, Receiver operating characteristic curve; HGB, Hemoglobin; ALB, Albumin; LDH, Lactate dehydrogenase; WHO, World Health Organization; M, Male; F, Female; PFS, Progression-free survival; OS, Overall survival; DMFS, Distant metastasis-free survival; LRFS, Locoregional recurrence-free survival; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; GSEA, Gene set enrichment analysis; ssGSEA, Single-sample gene set enrichment analysis; IC50, Half maximal inhibitory concentration.
References
1. Chen YP, Chan ATC, Le QT, Blanchard P, Sun Y, Ma J. Nasopharyngeal carcinoma. Lancet. (2019) 394:64–80. doi: 10.1016/S0140-6736(19)30956-0
2. Tang LL, Chen WQ, Xue WQ, He YQ, Zheng RS, Zeng YX, et al. Global trends in incidence and mortality of nasopharyngeal carcinoma. Cancer Lett. (2016) 374:22–30. doi: 10.1016/j.canlet.2016.01.040
3. Blanchard P, Lee A, Marguet S, Leclercq J, Ng WT, Ma J, et al. Chemotherapy and radiotherapy in nasopharyngeal carcinoma: an update of the MAC-NPC meta-analysis. Lancet Oncol. (2015) 16:645–55. doi: 10.1016/S1470-2045(15)70126-9
4. Luo W. Nasopharyngeal carcinoma ecology theory: cancer as multidimensional spatiotemporal “unity of ecology and evolution” pathological ecosystem. Theranostics. (2023) 13:1607–31. doi: 10.7150/thno.82690
5. Li T, Li F, Guo X, Hong C, Yu X, Wu B, et al. Anti-epstein-barr virus BNLF2b for mass screening for nasopharyngeal cancer. N Engl J Med. (2023) 389:808–19. doi: 10.1056/NEJMoa2301496
6. Tang XR, Li YQ, Liang SB, Jiang W, Liu F, Ge WX, et al. Development and validation of a gene expression-based signature to predict distant metastasis in locoregionally advanced nasopharyngeal carcinoma: a retrospective, multicentre, cohort study. Lancet Oncol. (2018) 19:382–93. doi: 10.1016/S1470-2045(18)30080-9
7. Lin M, Zhang XL, You R, Liu YP, Cai HM, Liu LZ, et al. Evolutionary route of nasopharyngeal carcinoma metastasis and its clinical significance. Nat Commun. (2023) 14:610. doi: 10.1038/s41467-023-35995-2
8. Jiang P, Sinha S, Aldape K, Hannenhalli S, Sahinalp C, Ruppin E. Big data in basic and translational cancer research. Nat Rev Cancer. (2022) 22:625–39. doi: 10.1038/s41568-022-00502-0
9. Swanson K, Wu E, Zhang A, Alizadeh AA, Zou J. From patterns to patients: Advances in clinical machine learning for cancer diagnosis, prognosis, and treatment. Cell. (2023) 186:1772–91. doi: 10.1016/j.cell.2023.01.035
10. Wang M, Huang Y, Chen M, Wang W, Wu F, Zhong T, et al. Inhibition of tumor intrinsic BANF1 activates antitumor immune responses via cGAS-STING and enhances the efficacy of PD-1 blockade. J Immunother Cancer. (2023) 11:e007035. doi: 10.1136/jitc-2023-007035
11. van Montfoort N, Borst L, Korrer MJ, Sluijter M, Marijt KA, Santegoets SJ, et al. NKG2A blockade potentiates CD8 T cell immunity induced by cancer vaccines. Cell. (2018) 175:1744–1755.e15. doi: 10.1016/j.cell.2018.10.028
12. Chen YP, Yin JH, Li WF, Li HJ, Chen DP, Zhang CJ, et al. Single-cell transcriptomics reveals regulators underlying immune cell diversity and immune subtypes associated with prognosis in nasopharyngeal carcinoma. Cell Res. (2020) 30:1024–42. doi: 10.1038/s41422-020-0374-x
13. Zhu Q, Zhang Q, Gu M, Zhang K, Xia T, Zhang S, et al. MIR106A-5p upregulation suppresses autophagy and accelerates Malignant phenotype in nasopharyngeal carcinoma. Autophagy. (2021) 17:1667–83. doi: 10.1080/15548627.2020.1781368
14. Lechner M, Schartinger VH, Steele CD, Nei WL, Ooft ML, Schreiber LM, et al. Somatostatin receptor 2 expression in nasopharyngeal cancer is induced by Epstein Barr virus infection: impact on prognosis, imaging and therapy. Nat Commun. (2021) 12:117. doi: 10.1038/s41467-020-20308-8
15. Mazurek AM, Wygoda A, Rutkowski T, Olbryt M, Pietrowska M, Celejewska A, et al. Prognostic significance of Epstein-Barr virus viral load in patients with T1-T2 nasopharyngeal cancer. J Med Virol. (2020) 92:348–55. doi: 10.1002/jmv.25606
16. Blum A, Wang P, Zenklusen JC. SnapShot: TCGA-analyzed tumors. Cell. (2018) 173:530. doi: 10.1016/j.cell.2018.03.059
17. Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, et al. NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Res. (2013) 41:D991–995. doi: 10.1093/nar/gks1193
18. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. (2015) 43:e47. doi: 10.1093/nar/gkv007
19. Engebretsen S, Bohlin J. Statistical predictions with glmnet. Clin Epigenet. (2019) 11:123. doi: 10.1186/s13148-019-0730-1
20. Huang PY, Guo SS, Zhang Y, Lu JB, Chen QY, Tang LQ, et al. Tumor CTLA-4 overexpression predicts poor survival in patients with nasopharyngeal carcinoma. Oncotarget. (2016) 7:13060–8. doi: 10.18632/oncotarget.v7i11
21. Waisberg J, De Souza Viana L, Affonso Junior RJ, Silva SRM, Denadai MVA, Margeotto FB, et al. Overexpression of the ITGAV gene is associated with progression and spread of colorectal cancer. Anticancer Res. (2014) 34:5599–607.
22. Hu FF, Liu CJ, Liu LL, Zhang Q, Guo AY. Expression profile of immune checkpoint genes and their roles in predicting immunotherapy response. Brief Bioinform. (2021) 22:bbaa176. doi: 10.1093/bib/bbaa176
23. Leung S-f, Zee B, BB Ma, EP H, Mo F, Lai M, et al. Plasma Epstein-Barr viral deoxyribonucleic acid quantitation complements tumor-node-metastasis staging prognostication in nasopharyngeal carcinoma. J Clin Oncol. (2006) 24:5414–8. doi: 10.1200/JCO.2006.07.7982
24. Wang J, Lv X, Xu F, Wei M, Liu C, Yang Y. GNA14 silencing suppresses the proliferation of endometrial carcinoma cells through inducing apoptosis and G2/M cell cycle arrest. Biosci Rep. (2018) 38:BSR20180574. doi: 10.1042/BSR20180574
25. Song G, Zhu X, Xuan Z, Zhao L, Dong H, Chen J, et al. Hypermethylation of GNA14 and its tumor-suppressive role in hepatitis B virus-related hepatocellular carcinoma. Theranostics. (2021) 11:2318–33. doi: 10.7150/thno.48739
26. Huang GZ, Wu QQ, Zheng ZN, Shao TR, Lv XZ. Identification of candidate biomarkers and analysis of prognostic values in oral squamous cell carcinoma. Front Oncol. (2019) 9:1054. doi: 10.3389/fonc.2019.01054
27. Pan L, Zhang L, Fu J, Shen K, Zhang G. Integrated transcriptome sequencing and weighted gene co-expression network analysis reveals key genes of papillary thyroid carcinomas. Heliyon. (2024) 10:e27928. doi: 10.1016/j.heliyon.2024.e27928
28. Leong TYM, Leong ASY. How does antigen retrieval work? Adv Anat Pathol. (2007) 14:129–31. doi: 10.1097/PAP.0b013e31803250c7
29. Paulsen IMS, Dimke H, Frische S. A single simple procedure for dewaxing, hydration and heat-induced epitope retrieval (HIER) for immunohistochemistry in formalin fixed paraffin-embedded tissue. Eur J Histochem EJH. (2015) 59:2532. doi: 10.4081/ejh.2015.2532
30. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. (2000) 28:27–30. doi: 10.1093/nar/28.1.27
31. The Gene Ontology Consortium. The Gene Ontology Resource: 20 years and still GOing strong. Nucleic Acids Res. (2019) 47:D330–8. doi: 10.1093/nar/gky1055
32. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U.S.A. (2005) 102:15545–50. doi: 10.1073/pnas.0506580102
33. Jin S, Li R, Chen MY, Yu C, Tang LQ, Liu YM, et al. Single-cell transcriptomic analysis defines the interplay between tumor cells, viral infection, and the microenvironment in nasopharyngeal carcinoma. Cell Res. (2020) 30:950–65. doi: 10.1038/s41422-020-00402-8
34. Chan KCA, Lo YMD. Circulating EBV DNA as a tumor marker for nasopharyngeal carcinoma. Semin Cancer Biol. (2002) 12:489–96. doi: 10.1016/S1044579X02000913
35. Nilsson JS, Forslund O, Andersson FC, Lindstedt M, Greiff L. Intralesional EBV-DNA load as marker of prognosis for nasopharyngeal cancer. Sci Rep. (2019) 9:15432. doi: 10.1038/s41598-019-51767-9
36. Tang LQ, Chen QY, Fan W, Liu H, Zhang L, Guo L, et al. Prospective study of tailoring whole-body dual-modality [18F]fluorodeoxyglucose positron emission tomography/computed tomography with plasma Epstein-Barr virus DNA for detecting distant metastasis in endemic nasopharyngeal carcinoma at initial staging. J Clin Oncol. (2013) 31:2861–9. doi: 10.1200/JCO.2012.46.0816
37. Ye Q, Guo J, Chen Y, Cui Z, Chen Y. Performance of plasma HSP90α, serum EBV VCA igA antibody and plasma EBV DNA for the diagnosis and prognosis prediction of nasopharyngeal carcinoma. Cancer Manag Res. (2021) 13:5793–802. doi: 10.2147/CMAR.S320541
38. Wu Y, Zhang X, Liu C, Li Z, Wen Y, Zheng R, et al. Epstein-Barr virus microRNA miR-BART2-5p accelerates nasopharyngeal carcinoma metastasis by suppressing RNase III endonuclease DICER1. J Biol Chem. (2023) 299:105082. doi: 10.1016/j.jbc.2023.105082
39. Zeng X, Liu G, Pan Y, Li Y. Prognostic value of clinical biochemistry-based indexes in nasopharyngeal carcinoma. Front Oncol. (2020) 10:146. doi: 10.3389/fonc.2020.00146
Keywords: nasopharyngeal carcinoma, biomarker, GNA14, ebv, prognosis, immunohistochemistry, bioinformatics analysis
Citation: Hao J, Yu X, Xiong L, Deng L, Lian S, Sun S, Li X, Du Y and Ji M (2024) GNA14 may be a potential prognostic biomarker in nasopharyngeal carcinoma. Front. Oncol. 14:1482038. doi: 10.3389/fonc.2024.1482038
Received: 17 August 2024; Accepted: 29 October 2024;
Published: 26 November 2024.
Edited by:
Sharon R. Pine, University of Colorado Anschutz Medical Campus, United StatesReviewed by:
Weiren Luo, The Second Affiliated hospital of Southern University of Science and Technology, ChinaLukas S. Fiedler, SLK Kliniken Heilbronn GmbH, Germany
Copyright © 2024 Hao, Yu, Xiong, Deng, Lian, Sun, Li, Du and Ji. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mingfang Ji, am1mdGJoQHNpbmEuY29t; Yun Du, ZHV5dW4yMDIwQDEyNi5jb20=
†These authors have contributed equally to this work