- 1Department of Endoscopy, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Guangzhou, China
- 2Department of Musculoskeletal Oncology, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Guangzhou, China
- 3State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China
Background: This study aims to build a focal adhesion-related genes-based prognostic signature (FAS) to accurately predict gastric cancer (GC) prognosis and identify key prognostic genes related to gastric cancer.
Results: Gene expression and clinical data of gastric cancer patients were sourced from Gene Expression Omnibus and The Cancer Genome Atlas. Subsequently, the GEO dataset was randomly distributed into training and test cohorts. The TCGA dataset was used to validate the external cohort. Lasso Cox regression was used to detect OS-related genes in the GEO cohort. A risk score model was established according to the screened genes. A nomogram, based on the clinical characteristics and risk score, was generated to predict the prognosis of gastric cancer patients. Using time-dependent receiver operating characteristic (ROC) and calibration performances, we evaluated the models’ validity. The patients were grouped into a high- or low-risk group depending on the risk score. Low-risk patients exhibited higher OS than high-risk patients (entire cohort: p < 0.001; training cohort: p < 0.001, test cohort: p < 0.001). Furthermore, we found a correlation between high-risk gastric cancer and extracellular matrix (ECM) receptor interaction, high infiltration of macrophages, CD44, and HLA-DOA.
Conclusion: The generated model based on the genetic characteristics of the focal adhesion prognostic gene can aid in the prognosis of gastric cancer patients in the future.
Introduction
Gastric cancer (GC) is one of the most common solid tumors worldwide, responsible for 7.7% of all cancer-related deaths and ranking second only to lung and liver cancers (Sung et al., 2021). Unfortunately, early-stage GC is often asymptomatic, resulting in most patients being diagnosed at advanced stages (Sitarz et al., 2018).
Consequently, an accurate understanding of the heterogeneity of GC is crucial for predicting prognosis and tailoring clinical diagnosis and treatment. Focal adhesion is a complex of proteins that physically connects the extracellular matrix to the actin cytoskeleton and comprises several proteins, including integrins, cofilin proteins, and focal adhesion kinase (FAK) (Paluch et al., 2016). By regulating cell adhesion, migration, and differentiation, focal adhesion is essential for normal physiological functions, and its dysregulation can lead to tumorigenesis and metastasis (Damiano et al., 1999; Landowski et al., 2003). Focal adhesion has been found to upregulate B3 and FAK expressions in GC, which can facilitate cancer cells to resist fluorouracil, leading to treatment failure (Ngabire et al., 2020). In addition, focal adhesion proteins have been shown to promote GC cell invasion by enhancing cell proliferation (Shen et al., 2013). Therefore, inhibiting the focal adhesion signal pathway could potentially lead to effective treatment for GC.
Based on the univariate Cox regression analysis, significant genes associated with prognosis were identified, and a prognostic signature was constructed using Lasso analysis. ROC and Kaplan-Meier (KM) analyses were used to evaluate the performance of the signature.
Next, we constructed a nomogram based on the FAS and corresponding clinical characteristics. Finally, we validated the accuracy of our newly developed nomogram that predicts the prognosis of GC patients using an external validation cohort from TCGA. Overall, our study highlights the crucial role of focal adhesion-related genes in GC prognosis and presents a novel nomogram for predicting OS in GC patients.
Materials and methods
Data collection
The clinicopathological information and corresponding gene expression data of GC patients were obtained from the GEO database (http://www.ncbi.nlm.nih.gov/geo/). A total of 684 cases (GSE13861, GSE29272, GSE62254, and GSE26942) were examined in the entire cohort. 330 TCGA-STAD samples and their corresponding clinicopathological data were extracted from the TCGA (https://portal.gdc.cancer.gov/) to be used for the external validation cohort. A list of focal adhesion-related genes was retrieved from the MSigDB database (https://www.gsea-msigdb.org/gsea/msigdb) to aid this analysis.
Data processing
Gene symbols for each gene matrix file were extracted based on the corresponding platform file used by the Perl software. The batch effect was adjusted using the Empirical Bayes method (“sva” package) among the series. Finally, the entire cohort was randomly divided into a training and a test cohort in a 7:3 ratio.
FAS construction and validation
A total of 199 focal adhesion-related genes were selected from the MSigDB database. The relationship between these genes and gastric cancer prognosis was investigated through univariate Cox, LASSO regression, and multivariate Cox analyses using R packages “glmnet” and “survival.”
We calculated the risk score of every patient using the regression coefficient value and expression of each gene, as follows: = Σβi × Expi, where Expi refers to the gene expression level of the focal adhesion-related genes, and β is the LASSO Cox regression coefficient of the corresponding gene. Based on their risk scores, samples were categorized as high- or low-risk. The prediction accuracy of the signature was evaluated using time-dependent ROC curves and Kaplan-Meier survival analysis.
Establishment and evaluation of a FAS-based nomogram model to predict OS of GC patients
Based on the focal adhesion risk score, the patient’s gender, age, and American Joint Committee on Cancer AJCC stage, a predictive nomogram was developed to accurately predict the OS of GC patients. The precision of the nomogram was evaluated using calibration and receiver operating characteristic (ROC) curves.
Gene set enrichment analysis
The GSEA program was obtained from the GSEA website (http://www.gseamsigdb.org/gsea/index.jsp) to identify the functional enrichment pathways regulated by the focal adhesion -related genes signature. Additionally, the “c2.cp.kegg.v7.4.symbols.gmt” and “c5.go.v7.4.symbols.gmt” gene sets were extracted from the molecular signatures database and used as the target enrichment sets for GSEA analysis.
Calculations of the immune, stromal, and estimate scores
The Estimation R package was used to estimate the immune and stromal component scores in the tumor microenvironment (TME) of each GC sample. The immune and stromal scores were then calculated and displayed, along with an estimated score. A positive correlation between the risk score and the immune, stromal, and overall scores suggests a higher risk score and a greater proportion of corresponding TME components.
The R packages “survival” and “survminer” were used to analyze the TME scores. A total of 684 samples with available survival data were divided into high- and low-scoring groups based on the median values of the immune, stromal, and estimated scores for subsequent survival analysis.
Single-sample gene set enrichment analysis (ssGSEA)
The single sample gene set enrichment analysis (ssGSEA) in the “gsva” R package quantifies the infiltration statuses of 16 immune cells and the activities of 13 immune-related pathways in the high- and low-risk
Drug sensitivity analysis
The drug sensitivity analysis was conducted using cellminer (https://discover.nci.nih.gov/SclcCellMinerCDB/) Database data, screening FDA-approved and clinical trial data, and analyzing the relationship between focal adhesion-related genes expression level and drug sensitivity. Spearman’s correlation analysis was conducted to determine the correlation using R software, and the top 16 drugs were selected.
Validation of a FAS-Based prognostic model in a clinical sample
The accuracy of the results was further validated using the STAD data from the KM-plotter database (https://kmplot.com/).
Validating the expression of focal adhesion-related genes using scRNA-seq data
Download the GSE112301 dataset from the GEO website and use the CreateSeuratObject function from the SeuratR package to create a Seurat object that contains the basic information of the single-cell dataset. Next, quality control is performed on the data, including filtering out low-quality cells, followed by reducing the dimensionality of the data using principal component analysis (PCA). Visualize the PCA results to better distinguish the differences between cells in different tissue. Finally, use the FindConservedMarkers function of Seurat to analyze the gene expression of GC samples and normal samples.
Statistical analyses
The statistical analyses were conducted using R software version 4.0.0, and KM survival analysis assessed the differential OS durations between the high- and low-risk groups. p < 0.05 was set as the significance threshold.
Results
Patient characteristics and establishment of FAS
As illustrated in Figure 1, upon exclusion of cases with a survival time of fewer than 30 days and normal cases, 684 samples were collated in the four GEO datasets (GSE13861, GSE26942, GSE29272, and GSE62254). These cases were randomly divided into a training (478) or test cohort (206) in a 7:3 ratio. Table 1 summarizes the patient clinical characteristics. The “limma” package extracted genes associated with focal adhesion in the GEO database. In the training cohort, the univariate Cox and LASSO regression analyses were conducted to screen eight genes associated with GC patient OS, as depicted in Figure 2A. The risk score was calculated according to a linear combination of the expression levels of the eight focal adhesion-related genes and corresponding regression coefficients (Table 2).
FIGURE 1. The fow chart showing the scheme of our study on focal adhesion prognostic signatures in GC.
FIGURE 2. (A) Determination of the number of factors by the LASSO analysis. Risk score distribution (B), survival status (C), and eight focal adhesion-related genes expression profiles (D) for patients in high-risk and low-risk groups in training cohort.
Risk score = CMOP × (0.3294) + FLNC × (0.2545) + ITGB5 × (0.7737) + LAMC1 × (0.8969) + TBHS1 × (0.5076) + THBS4 × (0.0156) + VEGFB × (0.8368) + VWF × (0.2495).
Based on the optimal cut-off value of the risk score determined using the Survminer R package, the patients were categorized into either a high- or low-risk group. Further independent prognostic analysis of all the key genes revealed a significant association between high expression levels of these genes and poor prognosis in the training cohort (Supplementary Figure S1).
The distribution of the risk scores and the survival statuses of patients in the training cohort are displayed in Figures 2B–D. KM analysis determines the differences between the two groups in the training cohort (Figure 3A). Lastly, time-dependent ROC exhibits the prognostic values of our signature (Figure 3D).
FIGURE 3. (A) Differences in survival between high-risk and low-risk groups of the training cohort (p < 0.001). Differences in survival between high-risk and low-risk groups of the test cohort (p < 0.001) (B) and external cohort (p =0.007) (C). (D) Time-dependent ROC analysis of the FAS in the training cohort. (E) Time-dependent ROC analysis of the FAS in the test cohort. (F) Time-dependent ROC analysis of the FAS in the external cohort. (G) The t-SNE plot in the training cohort. (H) The t-SNE plot in the test cohort. (I) The t-SNE plot in the external cohort. (J) The PCA plot in the training cohort. (K) The PCA plot in the test cohort. (L) The PCA plot in the external cohort.
Validation and evaluation of the prognostic gene signature
Test and external validations were conducted. Consistent with the training cohort results, OS was lower in high-risk patients than in low-risk patients (Figure 3B, p < 0.001 in test cohort; Figure 3C, p = 0.007 in TCGA cohort). The area under the ROC curve demonstrated that the signature could precisely predict GC prognosis (Figures 3E, F). The AUCs for the test and external validation cohorts were 0.667 and 0.627 at 3 years, respectively, and 0.652 and 0.701 at 5 years, respectively. The accuracy of the signature was evaluated using PCA and t-SNE analyses. Both PCA and t-SNE plots revealed that the high- and low-risk groups had different directions in the training (Figures 3G, J), test (Figures 3H, K), and external cohorts (Figures 3I, L).
FAS is an independent predictor of GC
The Cox regression analysis demonstrates the relationship between the risk scores acquired from the prognostic model with other clinical parameters. Based on the uni- and multivariate regression analyses, stage (p < 0.001, HR = 3.369; p < 0.001, HR = 3.050, respectively) and risk score (p < 0.001, HR = 3.314; p < 0.001, HR = 2.980, respectively) were independent OS prognostic factors in the training cohort (Figures 4A, B). The test cohort was validated, whereby both stage (p < 0.001, HR = 3.248; p < 0.001, HR = 2.903, respectively) and risk score (p < 0.001, HR = 3.025; p = 0.002, HR = 2.676, respectively; Figures 4C, D) were demonstrated to be independent risk factors for OS in GC patients.
FIGURE 4. Univariate and multivariate Cox regression analysis showed the relationship between age, gender, stage, risk score, and overall survival, and indicated that risk score could be used as an independent prognostic factor for training cohort (A,B), test cohort (C,D).
Subgroup analysis of the prognostic value of FAS
To investigate the prognostic value of the developed model in different patient populations based on their clinical characteristics, the training cohort was further divided into subgroups, and OS was estimated between high- and low-risk groups in each subgroup. The KM analysis showed that the risk score could distinguish differences between various subgroups, such as age, gender, and stage (Figures 5A–F). Similarly, in the test cohort, the high- and low-risk groups demonstrated differences in age and gender, but no significant differences were observed in the stage I-II subgroup, which may be attributed to the relatively small sample size in our study (Supplementary Figures S2A–F).
FIGURE 5. The high-risk group in training cohort showed a poor prognosis than the low-risk group in different clinical stratification like age (A,B), gender (C,D), stage (E,F).
The relationship between risk score and corresponding clinical characteristics of GC patients (n = 684) in the GEO dataset was analyzed. The risk scores of stage I + II patients were significantly lower than those of stage III + IV patients (p = 1.7e-09, Figure 6A). Similar results were shown in the age subgroup (p = 0.0095, Figure 6B). No significant relationship between the risk score and gender (p = 0.72, Figure 6C).
FIGURE 6. Stratified analysis of the prognostic signature in the training cohort. The relationships between the FAS and age (A), gender (B), stage (C).
Generation of a prognostic nomogram that predicts OS in GC patients
To accurately predict the prognosis of GC patients, a nomogram was developed based on uni- and multivariate regression analyses to predict 1-, 3-, and 5-year OS rates (Figure 7). Additionally, 3 years time-dependent ROC analysis revealed that the sensitivity of the nomogram was higher than other clinicopathological features in training cohort (Figure 8A), test cohort (Figure 8B), entire cohort (Figure 8C) and external cohort (Figure 8D). 5 years time-dependent ROC analysis for nomogram in training cohort (Figure 8E), test cohort (Figure 8F), entire cohort (Figure 8G) and external cohort (Figure 8H). The calibration plots for the training, test, entire cohort and external cohort were in agreement between the actual OS and the predicted from the nomogram (Figures 8I–8L).
FIGURE 8. (A–D) 3 years time-dependent ROC analysis for nomogram in GC. (A) Time-dependent ROC analysis of the nomogram in the training cohort. (B) Time-dependent ROC analysis of nomogram in the test cohort. (C) Time-dependent ROC analysis of the nomogram in the entire cohort. (D) Time-dependent ROC analysis of the nomogram in the external cohort. (E–H) 5 years time-dependent ROC analysis for nomogram in GC. (E) Time-dependent ROC analysis of the nomogram in the training cohort. (F) Time-dependent ROC analysis of nomogram in the test cohort. (G) Time-dependent ROC analysis of the nomogram in the entire cohort. (H) Time-dependent ROC analysis of the nomogram in the external cohort. (I) The calibration plot for training cohort. (J) The calibration plot for test cohort. (K) The calibration plot for entire cohort. (L) The calibration plot for external cohort.
The nomogram generated to predict the 1-, 3-, and 5-year OS rates of GC patients was found to be accurate, as evidenced by the 3-year and 5-year time-dependent ROC analysis and the calibration plots. These results suggest that the nomogram is a reliable tool for predicting the prognosis of GC patients.
GSEA
To explore the functional and signaling pathway differences between the high- and low-risk score groups, GSEA was performed on the gene sets “c5.go.v7.4.symbols.gmt” and “c2.cp.kegg.v7.4.symbols.gmt.” The top five pathways and gene functions in the high- and low-risk groups are displayed in Figures 9A–D.
FIGURE 9. Significantly enriched GO pathways and KEGG pathways in the entire cohort by GSEA. (A) The GSEA analysis for GO pathway in the low-risk group. (B) The GSEA analysis for GO pathway in the high-risk group. (C) The GSEA analysis for KEGG pathway in the low-risk group. (D) The GSEA analysis for KEGG pathway in the high-risk group.
Immune cells infiltration and immune-related pathways
The tumor microenvironment (TME) plays a critical role in regulating tumor treatment resistance and is associated with tumor occurrence, development, and metastasis. It includes various components such as tumor cells, immune cells, stromal cells, and a variety of cytokines Changes in the TME, including alterations in the immune cell components, can promote tumor progression. To analyze the distribution of immune cells in the GC TME and investigate the interaction between GC tumors and immune cells, we utilized the ssGSEA tool to predict 16 common immune cells and 13 immune-related functional components based on GC gene expression profile data.
The low-risk group of patients showed higher ratios of B cells, T regulators, and follicular helper T cells than the high-risk group (Figure 10A). In addition, the low-risk group exhibited higher levels of antigen-presenting cell (APC) co-inhibition, inflammation promotion, and T cell inhibition compared to those with a high-risk score (Figure 10B). Additionally, we demonstrated that the expression levels of CD200, CD28, CD40, CD44, CD86, LAIR1, NRP1, TNFRSF4, TNFRSF8, TNFSF18, TNFSF4, and VTCN1 in the high-risk group were higher than those in the low-risk group (Figure 10C). The findings suggest that the immune microenvironment may be partly associated with the OS prognosis of GC patients with high expression of focal adhesion-related genes.
FIGURE 10. (A) The immune cell between high-risk and low-risk groups in entire cohort; ***p < 0.05, **p < 0.01, *p < 0.001; (B) The immune related function between high-risk and low-risk groups in entire cohort; ***p < 0.05, **p < 0.01, *p < 0.001; (C) The checkpoint differences between high-risk and low-risk groups in entire cohort; ***p < 0.05, **p < 0.01, *p < 0.001; (D) The m6A expression differences between high-risk and low-risk groups in entire cohort.***p < 0.05, **p < 0.01, *p < 0.001.
Correlation of m6A expression
N6-methyladenosine (m6A) is the most abundant RNA modification in eukaryotic cells (Yue et al., 2015). Extensive RNA processing and metabolism research revealed that m6A is a key contributor to cancer development. m6A is a potential prognostic marker involved in multiple aspects of cancer treatment (Ma et al., 2019). To assess the relationship between m6A expression and our GC prognostic signature, the levels of 13 m6A genes in different GC samples were estimated. It is found that an elevated expression of FTO, METTL3, YTHDC1, and YTHDF1 genes in the high-versus low-risk group (Figure 10D).
Correlation between TME subcomponents and the focal adhesion-related genes risk score and outcome of GC patients
TME consists of diverse immune and stromal cells linked to disease development, prognosis, and treatment outcome. Based on our ESTIMATE algorithm, TME was separated and scored into stromal, immune, and estimate subcomponents to investigate potential relationships between this study’s risk scores and TME. A high immune or matrix score indicates a high proportion of the immune or matrix components in the TME. The ESTIMATE score is the sum of the immune and stromal scores, indicating the combined proportion of these two components in TME. In our study, patients in the high-risk group in entire cohorts had higher stromal, immune, or ESTIMATE scores (Figure 11A) than those in the low-risk group.
FIGURE 11. Comparison of stromal score, immune score, ESTIMATE scores, between high- and low-risk groups in the entire cohort (A). (B) Kaplan–Meier curves for overall survival of 684 GC patients according to stromal score. Log-rank test, p < 0.001. (C) Kaplan–Meier curves for overall survival of 684 GC patients according to immune score. Log-rank test, p = 0.933. (D) Kaplan–Meier curves for overall survival of 684 GC patients according to ESTIMATE score. Log-rank test, p = 0.01.
To further investigate the impact of different components of the TME on GC patient survival, the entire cohort was divided into subgroups based on the median immune, stromal, and ESTIMATE scores as cutoff points. As shown in Figures 11B, D, patients with high stromal and ESTIMATE scores had worse overall survival than those with low stromal and ESTIMATE scores (p < 0.001, p = 0.01, respectively). However, the survival rate was similar between patients with high and low immune scores (p = 0.933) (Figure 11C). To better understand the association between the immune microenvironment and GC prognosis, a heatmap was generated to display the distribution of immune cell scores in the high- and low-risk groups, as shown in Figure 12.
Drug sensitivity prediction
The correlation between drug Z score and genes was analyzed, and the top 16 significant drug-gene pairs are displayed in Figure 13. A total of 246 drugs showed statistical differences, as shown in Supplementary Table S2. Among them, Dasatinib, XAV-939, and Staurosporine exhibited the most positive correlation with hub gene expression. In contrast, Palbocic, Oxaliplatin, and Ribavirin were negatively correlated with the expression of hub genes (Figure 13).
FIGURE 13. Correlations between focal adhesion-related genes expression and drug sensitivity. The figure shows the top 16 significant drug-gene pairs with significant correlation. X-axis: gene expression; y-axis: drug sensitivity Z scores.
Verification of a focal adhesion-based prognostic model in a clinical sample
To investigate the prognosis of patients with different hub genes expressions, the clinical data of STAD in the KM-plotter database were analyzed. Patients with high hub genes expression had better overall survival (OS) than those with low expression, except for THBS1 (Figure 14).
FIGURE 14. Univariate survival analysis of the focal adhesion-related genes using Kaplan-Meier curves.
Verification the expression of hub genes using scRNA-seq data
The Single-cell RNA sequencing dataset GSE112302 was used for further analysis in high resolution. A total of three GC and three normal samples were included in our study.
After quality control of the data, standardization and normalization were performed, followed by PCA and UMAP (Supplementary Figures S3A, B). A total of 305 normal cells and 401 tumor cells were included in the analysis. The UMAP plots of each gene in different tissues are displayed in Supplementary Figure S3C. Most genes exhibited high expression in tumor cells, except for ITGB5, which may be due to the limited sample size (Figure 15).
FIGURE 15. Verification results of four hub genes’ expression using scRNA-seq data. (A) COMP; (B) FLNC; (C) ITGB5, (D) LAMC1, (E) THBS1, (F) THBS4, (G) VEGFB, (H) VWF.
Discussion
Patients with GC often do not experience symptoms in the early stages of the disease and may miss the opportunity for surgery due to local or distant metastasis at the time of diagnosis.
The development of the disease is influenced by multiple factors, and relying on a single factor or gene may not be a dependable prognostic marker. In this study, we systematically analyzed a group of genes related to patient survival and identified a strong association between the expression of focal adhesion-related genes and gastric cancer prognosis. The formation and turnover of focal adhesion are critical to tumor cell migration and progression (Eke and Cordes, 2015; Maziveyi et al., 2018; Wu et al., 2021). Therefore, evaluating the prognostic value of focal adhesion-related genes in GC patients is essential.
This study employed a bioinformatics approach and publicly available TCGA and GEO databases to identify focal adhesion-related genes. A risk score was assigned to each case to predict their prognosis, and high-risk cases were found to have a worse prognosis than low-risk ones. Furthermore, a nomogram was developed by combining risk scores and relevant clinical characteristics to demonstrate the accuracy of the prognostic model in predicting 3- and 5-year survival rates in GC patients. The focal adhesion-related genes identified in this study were highly predictive of GC prognosis and accurately characterized individual patient informationPrevious studies have established the association of the focal adhesion-related genes with cancer, specifically GC. In colon cancer, COMP levels were found to be significantly elevated, and were strongly associated with cell adhesion and tumor progression (Nfonsam et al., 2020). IGF1R signaling regulates the biological process of GC by increasing β-Catenin activation, epithelial-mesenchymal transition, and cell proliferation (Xu D et al., 2017). Yang et al. (2021) demonstrated that reducing the expression of ITGB5 using CRISPRa and CRISPRi technologies led to inhibition of cell proliferation. One prior study found that ITGB5 promotes lymph node metastasis in colorectal cancer patient (Capriotti et al., 2020). Another study by Han et al. (2021) showed that knockdown of LAMC1 inhibits GC cell proliferation, migration, invasion, and the Warburg effect by suppressing AKT and MEK/ERK pathways. Additionally, extracellular matrix proteins THBS1 and THBS4 strongly regulate key tumor cell processes, such as proliferation, attachment, adhesion, and migration. Elevated expressions of THBS1 and THBS4 may be closely associated with higher tumor grading and poorer prognosis in GC patients (Chen et al., 2019; Zhang et al., 2021). Filamin C is an essential component of the actin cytoskeleton and is encoded by the FLNC gene. As a member of the filamin family, it forms dimers and plays a crucial role in regulating cell motility, adhesion, and migration (Eden and Frey, 2021). In recent years, studies have found that the expression of FLNC is dysregulated in several types of cancer, including gastric cancer, glioma, liver cancer, and prostate cancer, and it is involved in tumor invasion and metastasis (Kokate et al., 2018; Kamil et al., 2019).
GO analysis was performed to identify the biological functions that are more relevant to high-risk patients. The analysis revealed that high-risk patients exhibit increased ameboidal type cell migration, axon development, blood vessel morphogenesis, circulatory system process, and collagen fibril organization. Furthermore, high-risk patients also undergo cellular processes such as cell division, proliferation, and formation of new cells, indicating a high rate of tumor cell division and proliferation in this patient group. These findings are consistent with the previous clinical conclusion that patients in the high-risk group generally have a poor prognosis. Using our 8-gene signature risk model, we investigated the KEGG functional pathways in high- and low-risk GC patients. Our analysis revealed that the five major pathways identified are generally associated with cancer development. Namely, cell adhesion molecules, complement and coagulation cascades, ECM receptor interaction, focal adhesion and hypertrophic cardiomyopathy were high in the high-risk patients (Francavilla et al., 2009; Jurcak et al., 2019; Yang et al., 2020; Zhang et al., 2020).
Moreover, high-risk patients showed a higher population of macrophages and neutrophils. Tumor-associated macrophages and neutrophils are generally associated with a poor prognosis in GC patients (Li et al., 2019; Gambardella et al., 2020). Macrophages are present at all stages of tumor progression at the primary site and are closely associated with tumor cell invasion (Nielsen and Schmid, 2017).
In addition, the high-risk group exhibited lower APC co-inhibitory effect scores than the low-risk group, indicating weakened antitumor immunity, which may contribute to poor prognosis. Tumor m6A research has recently gained attention, and the levels of FTO, METTL3, and YTHDC1 were significantly higher in the high-risk group compared to the low-risk group. FTO plays a critical role in the progression and metastasis of GC and is associated with low differentiation, lymph node metastasis, TNM stage, and poor prognosis, making it an important molecular marker for monitoring GC (Xu L et al., 2017). Yue et al. (2019) found that overexpression of METTL3 is associated with a poor prognosis in GC patients and promotes epithelial-mesenchymal transition and metastasis in vivo. Another study analyzed various biological information from different human cancer databases and discovered that YTHDF1 mutations are present in approximately 7% of GC patients. Elevated YTHDF1 levels are linked with increased cancer proliferation, invasiveness, and poorer overall survival in patients (Pi et al., 2021).
The KM survival curve also confirmed that differences in gene expression levels have varying effects on patient prognosis, with most patients exhibiting poor survival times when the gene expression is high. n addition, we predicted drugs that are closely related to the gene expression of our hub genes in order to explore their therapeutic effects on tumors. Positive correlation indicates that high expression of hub genes in GC is directly proportional to drug sensitivity. Negative correlation indicates that high gene expression in GC may affect drug efficacy. This study is the first to develop a prognostic model involving focal adhesion in GC patients, which was validated as an excellent predictor of patient overall survival (OS). Moreover, our model provides additional insights into immune infiltration, immune checkpoint markers, and pathway enrichment in different subgroups.
Although the model was validated in various aspects, there are still some limitations that need to be addressed. Firstly, the data used in this study were obtained from the TCGA and GEO databases, and thus, the generalizability of the model to other patient cohorts needs further validation. Additionally, further investigation is required to determine if the genes in the model act synergistically to influence GC patient prognosis. This study investigated the prognostic relevance of focal adhesion genes in GC patients using retrospective analysis. However, its predictive ability needs to be tested in prospective studies to validate its clinical application. Unlike traditional biological research methods, this method was based on a large dataset and possessed the advantages of enhanced efficiency, flexibility, and pertinence. With the continuous development of sequencing technology, this model has the potential for clinical application.
Conclusion
In this study, we developed a prognostic model based on the focal adhesion genes COMP, FLNC, ITGB5, LAMC1, THBS1, THBS4, VEGFB, and VWF to differentiate clinical features and predict the prognosis of GC patients. Our results provide a valuable foundation and direction for future basic experimental and clinical research on GC.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GSE13861, GSE29272 ,GSE62254 and GSE26942 (http://www.ncbi.nlm.nih.gov/geo/); TCGA-STAD (https://portal.gdc.cancer.gov/).
Author contributions
Designation: JL and ZL; Data collection: GZ and TL; Drawing: GZ and TL; Original draft writing: GZ; JL supervised the overall workflow and revised the manuscript and guaranteed of the study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2023.1122580/full#supplementary-material
SUPPLEMENTARY FIGURE S1 | Kaplan-Meier survival curves by the expression level of hub genes. (A) COMP; (B) FLNC; (C) ITGB5, (D) LAMC1, (E) THBS1, (F) THBS4, (G) VEGFB, (H) VWF. The patients were split into high and low expression groups according to the median value of the hub gene expression.
SUPPLEMENTARY FIGURE S2 | Stratified analysis of the prognostic signature in the test cohort. The relationships between the FAS and age (A), gender (B), stage (C).
SUPPLEMENTARY FIGURE S3 | UMAP representation of the pooled cells depicted in different colours by (A) tissue and (B) sample. (C) The expression level of hub genes in the cell clusters of different tissues (gray indicates low, red indicates high).
References
Capriotti, L., Baraldi, E., Mezzetti, B., Limera, C., and Sabbadini, S. (2020). Biotechnological approaches: Gene overexpression, gene silencing, and Genome editing to control fungal and oomycete diseases in grapevine. Int. J. Mol. Sci. 21 (16), 5701. doi:10.3390/ijms21165701
Chen, X., Huang, Y., Wang, Y., Wu, Q., Hong, S., and Huang, Z. (2019). THBS4 predicts poor outcomes and promotes proliferation and metastasis in gastric cancer. J. physiology Biochem. 75 (1), 117–123. doi:10.1007/s13105-019-00665-9
Damiano, J. S., Cress, A. E., Hazlehurst, L. A., Shtil, A. A., and Dalton, W. S. (1999). Cell adhesion mediated drug resistance (CAM-DR): Role of integrins and resistance to apoptosis in human myeloma cell lines. Blood 93 (5), 1658–1667. doi:10.1182/blood.v93.5.1658.405a19_1658_1667
Eden, M., and Frey, N. (2021). Cardiac filaminopathies: Illuminating the divergent role of filamin C mutations in human cardiomyopathy. J. Clin. Med. 10 (4), 577. doi:10.3390/jcm10040577
Eke, I., and Cordes, N. (2015). Focal adhesion signaling and therapy resistance in cancer. Semin. Cancer Biol. 31, 65–75. doi:10.1016/j.semcancer.2014.07.009
Francavilla, C., Maddaluno, L., and Cavallaro, U. (2009). The functional role of cell adhesion molecules in tumor angiogenesis. Semin. Cancer Biol. 19 (5), 298–309. doi:10.1016/j.semcancer.2009.05.004
Gambardella, V., Castillo, J., Tarazona, N., Gimeno-Valiente, F., Martínez-Ciarpaglini, C., Cabeza-Segura, M., et al. (2020). The role of tumor-associated macrophages in gastric cancer development and their potential as a therapeutic target. Cancer Treat. Rev. 86, 102015. doi:10.1016/j.ctrv.2020.102015
Han, Z-R., Jiang, X-L., and Fan, W-C. (2021). LAMC1 is related to the poor prognosis of patients with gastric cancer and facilitates cancer cell malignancies. Neoplasma 68 (4), 711–718. doi:10.4149/neo_2021_201117N1239
Jurcak, N. R., Rucki, A. A., Muth, S., Thompson, E., Sharma, R., Ding, D., et al. (2019). Axon guidance molecules promote perineural invasion and metastasis of orthotopic pancreatic tumors in mice. Gastroenterology 157 (3), 838–850. doi:10.1053/j.gastro.2019.05.065
Kamil, M., Shinsato, Y., Higa, N., Hirano, T., Idogawa, M., Takajo, T., et al. (2019). High filamin-C expression predicts enhanced invasiveness and poor outcome in glioblastoma multiforme. Br. J. Cancer 120 (8), 819–826. doi:10.1038/s41416-019-0413-x
Kokate, S. B., Dixit, P., Poirah, I., Roy, A. D., Chakraborty, D., Rout, N., et al. (2018). Testin and filamin-C downregulation by acetylated Siah2 increases invasiveness of Helicobacter pylori-infected gastric cancer cells. Int. J. Biochem. Cell Biol. 103, 14–24. doi:10.1016/j.biocel.2018.07.012
Landowski, T. H., Olashaw, N. E., Agrawal, D., and Dalton, W. S. (2003). Cell adhesion-mediated drug resistance (CAM-DR) is associated with activation of NF-kappa B (RelB/p50) in myeloma cells. Oncogene 22 (16), 2417–2421. doi:10.1038/sj.onc.1206315
Li, S., Cong, X., Gao, H., Lan, X., Li, Z., Wang, W., et al. (2019). Tumor-associated neutrophils induce EMT by IL-17a to promote migration and invasion in gastric cancer cells. J. Exp. Clin. cancer Res. CR 38 (1), 6. doi:10.1186/s13046-018-1003-0
Ma, S., Chen, C., Ji, X., Liu, J., Zhou, Q., Wang, G., et al. (2019). The interplay between m6A RNA methylation and noncoding RNA in cancer. J. Hematol. Oncol. 12 (1), 121. doi:10.1186/s13045-019-0805-7
Maziveyi, M., Dong, S., Baranwal, S., and SureshAlahari, K. (2018). Nischarin regulates focal adhesion and Invadopodia formation in breast cancer cells. Mol. cancer 17 (1), 21. doi:10.1186/s12943-018-0764-6
Nfonsam, V. N., Jecius, H. C., Janda, J., Omesiete, P. N., Elquza, E., Scott, A. J., et al. (2020). Cartilage oligomeric matrix protein (COMP) promotes cell proliferation in early-onset colon cancer tumorigenesis. Surg. Endosc. 34 (9), 3992–3998. doi:10.1007/s00464-019-07185-z
Ngabire, D., Niyonizigiye, I., Patil, M. P., Seong, Y. A., Seo, Y. B., and Kim, G. D. (2020). M2 macrophages mediate the resistance of gastric adenocarcinoma cells to 5-fluorouracil through the expression of integrin β3, focal adhesion kinase, and cofilin. J. Immunol. Res. 2020, 1731457. doi:10.1155/2020/1731457
Nielsen, S. R., and Schmid., M. C. (2017). "Macrophages as key drivers of cancer progression and metastasis." Mediat. Inflamm. 2017, 9624760. doi:10.1155/2017/9624760
Paluch, E. K., Aspalter, I. M., and Sixt, M. (2016). Focal adhesion-independent cell migration. Annu. Rev. Cell Dev. Biol. 32, 469–490. doi:10.1146/annurev-cellbio-111315-125341
Pi, J., Wang, W., Ji, M., Wang, X., Wei, X., Jin, J., et al. (2021). YTHDF1 promotes gastric carcinogenesis by controlling translation of FZD7. Cancer Res. 81 (10), 2651–2665. doi:10.1158/0008-5472.CAN-20-0066
Shen, Z., Ye, Y., Kauttu, T., Seppanen, H., Vainionpaa, S., Wang, S., et al. (2013). Novel focal adhesion protein kindlin-2 promotes the invasion of gastric cancer cells through phosphorylation of integrin β1 and β3. J. Surg. Oncol. 108 (2), 106–112. doi:10.1002/jso.23353
Sitarz, R., Skierucha, M., Mielko, J., Offerhaus, G. J. A., Maciejewski, R., and Polkowski, W. P. (2018). Gastric cancer: Epidemiology, prevention, classification, and treatment. Cancer Manag. Res. 10, 239–248. doi:10.2147/CMAR.S149619
Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71 (3), 209–249. doi:10.3322/caac.21660
Wu, S., Chen, M., Huang, J., Zhang, F., Lv, Z., Jia, Y., et al. (2021). ORAI2 promotes gastric cancer tumorigenicity and metastasis through PI3K/akt signaling and MAPK-dependent focal adhesion disassembly. Cancer Res. 81 (4), 986–1000. doi:10.1158/0008-5472.CAN-20-0049
Xu, D., Shao, W., Jiang, Y., Wang, X., Liu, Y., and Liu, X. (2017). FTO expression is associated with the occurrence of gastric cancer and prognosis. Oncol. Rep. 38 (4), 2285–2292. doi:10.3892/or.2017.5904
Xu, L., Zhou, R., Yuan, L., Wang, S., Li, X., Ma, H., et al. (2017). IGF1/IGF1R/STAT3 signaling-inducible IFITM2 promotes gastric cancer growth and metastasis. Cancer Lett. 393, 76–85. doi:10.1016/j.canlet.2017.02.014
Yang, X., Chen, L., Mao, Y., Hu, Z., and He, M. (2020). Progressive and prognostic performance of an extracellular matrix-receptor interaction signature in gastric cancer. Dis. Markers 2020, 8816070. doi:10.1155/2020/8816070
Yang, Y., Feng, Q., Hu, K., and Cheng, F. (2021). Using CRISPRa and CRISPRi technologies to study the biological functions of ITGB5, TIMP1, and TMEM176B in prostate cancer cells. Front. Mol. Biosci. 8, 676021. doi:10.3389/fmolb.2021.676021
Yue, B., Song, C., Yang, L., Cui, R., Cheng, X., Zhang, Z., et al. (2019). METTL3-mediated N6-methyladenosine modification is critical for epithelial-mesenchymal transition and metastasis of gastric cancer. Mol. cancer 18 (1), 142. doi:10.1186/s12943-019-1065-4
Yue, Y., Liu, J., and He, C. (2015). RNA N6-methyladenosine methylation in post-transcriptional gene expression regulation. Genes Dev. 29 (13), 1343–1355. doi:10.1101/gad.262766.115
Zhang, J., Chen, M., Zhao, Y., Xiong, H., Sneh, T., Fan, Y., et al. (2020). Complement and coagulation cascades pathway correlates with chemosensitivity and overall survival in patients with soft tissue sarcoma. Eur. J. Pharmacol. 879, 173121. doi:10.1016/j.ejphar.2020.173121
Keywords: gastric cancer, TCGA, focal adhesion, prognosis, GEO
Citation: Zhao G, Luo T, Liu Z and Li J (2023) Development and validation of focal adhesion-related genes signature in gastric cancer. Front. Genet. 14:1122580. doi: 10.3389/fgene.2023.1122580
Received: 13 December 2022; Accepted: 23 February 2023;
Published: 08 March 2023.
Edited by:
Jianing Xi, Guangzhou Medical University, ChinaReviewed by:
Shudi Li, University of Texas Health Science Center at Houston, United StatesJiapeng Li, Wuhan University of Science and Technology, China
Copyright © 2023 Zhao, Luo, Liu and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jianjun Li, lijj@sysucc.org.cn
†These authors have contributed equally to this work and share first authorship