- 1Department of Oncology Medicine, Fujian Medical University Union Hospital, Fuzhou, China
- 2Department of Clinical Medicine, Shaanxi University of Chinese Medicine, Xianyang, China
- 3Department of Hematology, Fujian Medical University Union Hospital, Fuzhou, China
- 4Department of Surgical Oncology Medicine, Second Affiliated Hospital of Shaanxi University of Chinese Medicine, Xianyang, China
- 5Fujian Medical University Stem Cell Research Institute, Fuzhou, China
- 6Fujian Key Laboratory of Translational Cancer Medicine, Fuzhou, China
Objectives: Sm proteins (SNRPB/D1/D2/D3/E/F/G), involved in pre-mRNA splicing, were previously reported in the tumorigenesis of several cancers. However, their specific role in lung adenocarcinoma (LUAD) remains obscure. Our study aims to feature abnormal expressions and mutations of genes for Sm proteins and assess their potential as therapeutic targets via integrated bioinformatics analysis.
Methods: In this research, we explored the expression pattern and prognostic worth of genes for Sm proteins in LUAD across TCGA, GEO, UALCAN, Oncomine, Metascape, David 6.8, and Kaplan-Meier Plotter, and confirmed its independent prognostic value via univariate and multivariate cox regression analysis. Meanwhile, their expression patterns were validated by RT-qPCR. Gene mutations and co-expression of genes for Sm proteins were analyzed by the cBioPortal database. The PPI network for Sm proteins in LUAD was visualized by the STRING and Cytoscape. The correlations between genes for Sm proteins and immune infiltration were analyzed by using the “GSVA” R package.
Results: Sm proteins genes were found upregulated expression in both LUAD tissues and LUAD cell lines. Moreover, highly expressed mRNA levels for Sm proteins were strongly associated with short survival time in LUAD. Genes for Sm proteins were positively connected with the infiltration of Th2 cells, but negatively connected with the infiltration of mast cells, Th1 cells, and NK cells. Importantly, Cox regression analysis showed that high SNRPD1/E/F/G expression were independent risk factors for the overall survival of LUAD.
Conclusion: Our study showed that SNRPD1/E/F/G could independently predict the prognostic outcome of LUAD and was correlated with immune infiltration. Also, this report laid the foundation for additional exploration on the potential treatment target’s role of SNRPD1/E/F/G in LUAD.
Introduction
Lung cancer is the most well-known sort of malignant tumor worldwide and is the major cause of cancer mortality (Sung et al., 2021). Lung adenocarcinoma (LUAD) has been the most common subtype of Non-small cell cancer (NSCLC) (Anonymous, 2015). LUAD is characterized by a lack of early clinical symptoms, a high rate of distant metastasis and drug resistance, which pose serious challenges to clinical treatment. Currently, treatment methods for LUAD mainly include surgical resection, radiotherapy, chemotherapy, immunotherapy, and molecular targeted therapy (Reck and Rabe, 2017). The best treatment for lung cancer is surgical resection in the early stages of lung cancer. However, the early symptoms of the disease are not obvious, easy to be ignored, which leads to a late diagnosis. The treatment of advanced lung adenocarcinoma is limited, and molecular targeted therapy is a promising choice, as well as immunotherapy. However, because of the lack of effective molecular targets, most drugs remain ineffective in the treatment of LUAD patients, of whom the 5-year survival rate is just 15% (Chen et al., 2016). Hence, it is absolutely necessary to recognize effective and dependable biomarkers to determine poor prognoses and direct treatment strategies.
The spliceosome is a ribonucleoprotein (RNP) with a complex ring-shaped structure, which mainly consists of small nuclear ribonucleoproteins (snRNPs), which is involved in splicing the pre rna into mature mRNA. Seven Sm proteins (SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE, SNRPF, and SNRPG) and a small eponymous small nuclear RNA (snRNA) compose snRNPs (Chari et al., 2008). Accurate splicing is essential for normal cellular functions like cell proliferation, apoptosis, migration, and invasion. Sm proteins, involved in the formation of anti-Sm antibodies, were significant diagnostic biomarkers in autoimmune diseases, such as systemic lupus erythematosus (SLE). Furthermore, priors works have illustrated that the aberrant expressions of genes for Sm proteins are related to some human cancers, including cervical cancer (Zhu et al., 2020), glioblastoma (Correa et al., 2016), breast cancer (Dai et al., 2021), and hepatocellular carcinoma (Zhan et al., 2020).
Up to now, there have been limited studies investigating the connection between the abnormal expressions of genes for Sm proteins and LUAD. For instance, a report (Liu et al., 2019) showed that SNRPB down-regulation inhibited the growth and metastasis of NSCLC cells via RAB26 down-regulation. In the study directed by Valles et al. (2012), SNRPB and SNRPE were proved to be related to poor survival in LUAD patients. However, the precise functional roles of Sm proteins in LUAD are unclear yet. Our study aimed to present abnormal expressions and mutations of genes for Sm proteins and assess their potential as therapeutic targets via integrated bioinformatics analysis, which may be helpful to the treatment of LUAD patients.
Materials and Methods
Data Source and Flow Chart
The GSE40791 dataset data performed by GPL570 (Zhang et al., 2012) were downloaded from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/) (Clough and Barrett, 2016). GSE40791 included 100 non-tumor lung tissues and 94 lung adenocarcinoma tissues (N = 100, T = 94). The TGCA-LUAD data (N = 59, T = 535) were downloaded from the TGCA database (https://portal.gdc.cancer.gov) (Tomczak et al., 2015). A flow chart of this study procedure is shown in Figure 1. The characteristics of 535 patients, including their ages, genders, smoking history, TNM stages, pathologic stages, primary therapy outcome, residual tumor and anatomic neoplasm subdivision, are presented in Supplementary Table S1.
Comparison of Genes for Sm Proteins Expression Levels Between LUAD and Corresponding Normal Tissue
The Oncomine database (https://www.oncomine.org) (Rhodes et al., 2004) was used to obtain the data of mRNA expressions for Sm proteins in various cancers, and these data were analyzed via Student’s t-test. And we defined the following threshold: p-value<0.001, fold change >2 and 10% of most highly ranked genes. Furthermore, we compared expression levels of genes for Sm proteins between lung adenocarcinoma samples and normal samples in GSE40791. Moreover, the transcription levels of these genes were validated in the GEPIA 2 database (http://gepia2.cancer-pku.cn) (Tang et al., 2017), including tumor and normal samples from the TCGA and the Genotype-Tissue Expression Project (GTEx) database (https://gtexportal.org/home/) and the TCGA-LUAD data performed by R software.
Cell Culture and RT-qPCR
BEAS-2B cell line (human bronchial epithelial cell line) was purchased from the American Type Culture Collection (Manassas, VA, United States), A549 cell line was purchased from the Type Culture Collection of the Chinese Academy of Sciences, Shanghai, China. Two types of cells were cultured in DMEM with 10% FBS, penicillin (50 U/ml), and streptomycin (50 U/ml). The cells were incubated at 37°C with 5% CO2.
Total RNA of the above cells was extracted using TRIzol (Thermo, United States), and first-strand complementary DNA (cDNA) synthesis from total RNA was carried out using the GoScript Reverse Transcription System (Promega, United States). RT-qPCR was conducted using an AriaMx Real-Time PCR machine (Agilent Technologies, United States) with TB GreenPremix ExTaq II (Takara Bio, Japan). The primer sequences are shown in Supplementary Table S2. RT-qPCR cycle conditions: 3 min at 95°C, 40 cycles of 15 s for 95°C and 60 s for 60°C. Data was normalized to the house-keeping gene GAPDH. The relative gene expression was performed by the 2−ΔΔCt method (Livak and Schmittgen, 2001).
Correlations Between Genes for Sm Proteins and Clinicopathological Parameters
In our study, the transcription levels of genes for Sm proteins in TP53 mutation of LUAD patients were analyzed with the lung adenocarcinoma dataset using the UALCAN database (http://ualcan. path.uab.edu) (Chandrashekar et al., 2017). Moreover, we explored the correlation between genes for Sm proteins gene expressions and tumor stages via the “Expression DIY module” of GEPIA 2 and selected 50 neighboring genes related to genes for Sm proteins using the “Similar Genes Detection” module.
Survival Analysis
The Kaplan-Meier plotter (http://kmplot.com) (Gyorffy et al., 2013; Nagy et al., 2021), including mRNA expression data and patients’ clinical information from GEO and TCGA, is a powerful online tool to further verify the prognostic value of genes in several cancers. The overall survival (OS), free progression (FP), and post-progression survival (PPS) curves of genes for Sm proteins in LUAD were shown via the Kaplan-Meier plotter. Meanwhile, we also explored if the dyregulation of genes for Sm proteins had any impacts on the OS of LUAD patients with smoke history by using the Kaplan Meier plotter. In addition, univariate and multivariate cox regression analysis of the TGCA-LUAD data was performed using the “survival” and the “survminer” R package.
Gene Mutations and Co-Expression Analysis
In this study, the lung adenocarcinoma dataset (TCGA, Firehose Legacy), including data from 230 complete samples of 586 patients, was selected and visualized as the map of gene mutations, expression heatmap, and co-expression map of genes for Sm proteins using the cBioPortal database (http://www. cbioportal.org) (Cerami et al., 2012; Gao et al., 2013). The z-Score threshold was set to ±1.8.
Constructed Protein-Protein Interaction Network and Selected Hub Genes
The tool of STRING (https://string-db.org/) (Szklarczyk et al., 2019) was used to construct the protein-protein interaction (PPI) network between the seven genes for Sm proteins and their 50 frequently neighboring genes. All PPI pairs with a combined score of >0.4 were extracted. Then, we used the “CytoHubba” plugin (v0.1) (Chin et al., 2014) of Cytoscape (v3.8.2) (https://cytoscape.org) (Otasek et al., 2019) to identify hub genes in that PPI network. Furthermore, the online tool of Metascape (http://metascape.org) (Zhou et al., 2019) was used to analyze the MCODE components of that PPI network.
GO and KEGG Enrichment Analyses
Functional enrichment analyses, including gene ontology (GO) analysis comprising cellular component (CC), molecular function (MF), and biological process (BP), and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis, were performed via the tool of DAVID 6.8 (https://david.ncifcrf.gov) (Huang da et al., 2009). Subsequently, the specific enriched terms for GO and KEGG enrichment analysis were visualized using the “clusterProfiler” package (Yu et al., 2012) in R software. Moreover, we presented the network of enriched terms colored by cluster ID using the online tool of Metascape.
Associations Between Genes for Sm Proteins and Immune Infiltration
Finally, we explore the associations between genes for Sm proteins and 24 immune cell types in TGCA-LUAD. We investigated dendritic cell (DC), activated dendritic cell (aDC), immature dendritic cell (iDC),plasmacytoid dendritic cell (pDC), B cells, CD8+ T cells, Cytotoxic T cells, T cells, T helper cells, T help 1 (Th1) cells, Th17 cells, Th2 cells, T central memory (Tcm), T effector memory (Tem), T follicular helper (Tfh), T gamma delta (Tgd), regulatory T Cell (Treg), eosinophils, macrophages, mast cells, neutrophils, natural killer (NK) cells, NK CD56bright cells and NK CD56dim cells (Bindea et al., 2013), and the above associations was performed using single-sample Gene Sets Enrichment Analysis (ssGSEA) algorithm of R package “GSVA” (Hanzelmann et al., 2013) and lollipop charts were produced using R package “ggplot2.”
Statistical Analysis
One-way Analysis of Variance (ANOVA) in the GEPIA 2 database, the log-rank test in Kaplan-Meier survival analysis, and the Cox proportional risk models in univariate and multivariate analyses, were used in statistical analysis. R software (v4.0.2) (http://www.r-project.org) was used for analysis in this study. RT-qPCR data was performed in GraphPad Prism (v9.0.2) (San Diego, CA, United States) and presented as the mean ± S.D. Student’s T-test was used for statistical analyses between the data pairs where appropriate. *p < 0.05, **p < 0.01, ***p < 0.001, and ****p < 0.0001 were considered to represent statistical significance.
Results
The mRNA Levels of Sm Proteins Were Significantly Increased in Lung Cancer Tissues and LUAD Cell Lines
From Figure 2A and Table 1, we could see that the mRNA levels of SNRPB/D1/D2/E/G were higher in lung cancer than in non-cancer tissues. The results were also consistent with the mRNA levels of Sm proteins were upregulated in LUAD compared to normal lung tissues (Figure 2B). Moreover, the relative mRNA level of all Sm proteins in A549 cell line is higher than the relative mRNA level of them in normal lung cell lines, BEAS-2B (Figure 2C). The transcription levels of genes for Sm proteins considerably increased in data of GEPIA 2 (TCGA-LUAD & GTEx), TCGA-LUAD data and pair LUAD data (Figures 3A–C) match the above results.
FIGURE 2. The transcription levels of genes for Sm proteins in lung cancer tissues and lung cell lines. (A) The genes for Sm proteins in different human cancers (ONCOMINE). (B) Box plots of genes for Sm proteins expression between lung cancer and normal tissues in GSE40791. (C) Bar charts of mRNA level of genes for Sm proteins between BEAS-2B and A549 cell line. Students T-test was performed to assess the statistical significance. A p value of <0.05 was regarded as statistically significant.
TABLE 1. The mRNA levels of Sm proteins were significantly higher in lung cancer than in normal lung tissues (ONCOMINE).
FIGURE 3. The transcription levels of genes for Sm proteins in LUAD samples and normal lung samples. (A) The transcription levels of genes for Sm proteins in data of GEPIA 2 (TCGA-LUAD & GTEx). (B) Violin charts of mRNA level for Sm proteins in data of TCGA-LUAD. (C) The mRNA expressions of genes for Sm proteins in pair data of TCGA-LUAD. Students T-test was performed to assess the statistical significance. A p value of < 0.05 was regarded as statistically significant.
The Expressions of Genes for Sm Proteins Were Related to Clinicopathological Parameters of LUAD Patients
We research the correlations between the expressions of genes for Sm proteins, TP53 mutation and cancer stages via the UALCAN database and the GEPIA2 database. From Figure 4A, we could find that the mRNA expressions of genes for Sm proteins in LUAD with TP53 mutation were higher than those without TP53 mutation (Figure 4A). Moreover, as can be seen in Figure 4B, the mRNA levels of SNRPE varied significantly across different LUAD tumor stages (F value = 3.68, Pr (>F)<0.05), whereas SNRPB/D1/D2/D3/F/G had no significant association with LUAD stages (all Pr (>F) > 0.05).
FIGURE 4. The associations between genes for Sm proteins expression levels and clinicopathological parameters of LUAD patients. (A) Correlations between genes for Sm proteins expression levels and TP53 mutation in LUAD (UALCAN). (B) Association between mRNA expression of genes for Sm proteins and different tumor stages (GEPIA 2). ANOVA was performed to assess the statistical significance of the variations. F-value indicates the statistical value of the F test; Pr (>F) indicates p value. A p value of <0.05 was regarded as statistically significant.
The Upregulation of Genes for Sm Proteins is Related to Poor Survival Outcomes in LUAD Patients
Using Kaplan-Meier Plotter, we assessed the prognostic values of genes for Sm proteins in LUAD patients. From Figure 5A, we could find out high expressions of genes for Sm proteins were associated with short OS (all p < 0.05). In addition, high mRNA level of SNRPB/D1/D2/F (HR > 1, p < 0.05) were correlated with poor FP and PPS (all p < 0.05). However, highly expressed SNRPD3 was not obviously related to the FP of LUAD patients (p > 0.05).
FIGURE 5. The survival curves of LUAD patients with genes for Sm proteins and impacts of smoking on the survival of these patients (Kaplan-Meier Plotter). (A) The Kaplan-Meier survival curves were used to evaluate the survival of LUAD patients with Sm proteins. (B) The effects of smoking in the OS of LUAD patients with Sm proteins. The log-rank test was performed in Kaplan-Meier survival analysis. A p value of <0.05 was regarded as statistically significant. HR: hazard ratio.
As is well known, smoking is a common risk factor leading to lung cancer (Jassem et al., 2009). Hence, we explored if the upregulation of genes for Sm proteins had any impacts on the overall survival of LUAD patients. From Figure 5B, in the smoking group, the high expression of SNRPB significantly reduced the OS of LUAD patients. However, in the non-smoking group, the expression of SNRPB had no statistical significance for the overall survival of patients with lung adenocarcinoma. This indicates that SNRPB and smoking factors together could promote the development of lung adenocarcinoma. However, the upregulation of SNRPD1 (HR: 2.36, p < 0.05 vs HR: 1.63, p < 0.05), SNRPD2 (HR: 3.29, p < 0.05 vs HR: 2.52, p < 0.05), and SNRPF (HR: 3.27, p < 0.05 vs HR: 2.18, p < 0.05) had worsening impacts on the OS of LUAD patients without smoking history. Noteworthy, only high expression of SNRPE (HR: 1.28, p < 0.05) was correlated to poor OS of lung squamous cell carcinoma (LUSC) (Supplementary Figure S1). More research is still needed to interpret potential mechanisms of the above-described events.
Moreover, we attempted to evaluate the independent prognostic values of genes for Sm proteins for OS of LUAD patients. The microarray data and clinical information (Supplementary Table S1) of 535 LUAD patients were obtained from TCGA for Cox regression analysis. Univariate Cox regression analysis demonstrated that T, N, M stage, pathologic stage, SNRPD2, SNRPE, and SNRPF were highly correlated with the short OS of LUAD patients (all p < 0.05; Supplementary Table S2). In multivariate Cox regression analysis, SNRPD1, SNRPE, SNRPF, and SNRPG were observed to have significant associations with poor OS of LUAD patients (all p < 0.05; Supplementary Figure S2). Taken together, SNRPD1/E/F/G could be considered as independent prognostic factors for poor OS in LUAD.
Gene Mutations and Co-Expression Analysis of Genes for Sm Proteins in LUAD Patients
Using the cBioPortal database, we explored the mutation rate and co-expression of Sm proteins. As shown in Figures 6A–C, gene amplification was the most common type of genes for Sm proteins mutations (Figure 6A). 22% (51/230) of LUAD patients were found to have more than one gene mutation. SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE, SNRPF, and SNRPG were altered in 2.6%, 4%, 3%, 4%, 8%, 4%, and 3% of the 230 LUAD patients (Figure 6B). Likewise, the Expression Heatmap also displayed the degree of genes for Sm proteins mutations (Figure 6C). SNRPE had the highest mutation rate among genes for Sm proteins in LUAD. Meantime, we performed co-expression analysis for Sm proteins. Significant and positive correlations were observed among genes for Sm proteins in Figure 6D, including SNRPB with SNRPD1/D2/E/F/G, SNRPD1 with SNRPB/D2/D3/E/F/G, SNRPD2 with SNRPB/D1/E/F/G, SNRPD3 with SNRPD1/E/F/G, SNRPE with SNRPB/D1/D2/D3/F/G, SNRPF with SNRPB/D1/D2/D3/E/G, and SNRPG with SNRPB/D1/D2/E/F (Figure 6D).
FIGURE 6. Gene mutations, co-expression, and PPI network for genes for Sm proteins in LUAD (cBioPortal, STRING, Cytoscape, Metascape). (A–C) Gene mutations and expression heatmap of genes for Sm proteins in LUAD patients (cBioPortal). (D) The co-expression network for genes for Sm proteins (cBioPortal). (E) PPI network based on the seven genes for Sm proteins (STRING). (F) PPI network based on genes for Sm proteins and their 50 neighbouring proteins (STRING). (G) Eight hub genes, including PCNA, NUF2, UBE2T, CCDC59, ERCC1, CDCA3, QPCTC, and TROAP, are shown in the PPI network (Cytoscape). (H) Protein-protein interaction network and MCODE components identified in different expressed genes for Sm proteins and 50 most frequently altered neighboring genes (Metascape).
Constructed PPI Network and Selected Hub Genes
To explore the possible genes for Sm proteins protein-mediated biological pathways in lung adenocarcinoma, we constructed a protein interaction network based on the seven Sm proteins (Figure 6E) and their 50 frequently neighboring proteins (Figure 6F) via the STRING database. Meantime, we used Cytoscape software to find out these hub genes related to genes for Sm proteins-mediated biological pathways. As was shown in Figure 6G, eight hub genes, including PCNA, NUF2, UBE2T, CCDC59, ERCC1, CDCA3, QPCTC, and TROAP, were tightly correlated with the alterations of genes for Sm proteins. Furthermore, we extracted the three most meaningful MCODE components from the PPI network between Sm proteins and their 50 frequently neighboring proteins. As was shown in Figure 6H, Sm core complex, spliceosome, U2-SnRNP, chromosome segregation, chromosome, centromeric region, cell division, and nucleotide excision repair were related to biological function.
GO and KEGG Enrichment Analyses in LUAD Patients
Afterward, genes for Sm proteins and their 50 neighboring genes were analyzed via the tool of DAVID 6.8 for GO and KEGG functional enrichment analysis. The results were shown in Figure 7 and Supplementary Figure S3. Biological processes (BP) included spliceosomal snRNP assembly, histone mRNA metabolic process, negative regulation of nuclear division, nuclear division, and mitotic spindle assembly checkpoint (Figure 7A). Cellular components (CC) suggested that SNRP genes existed mainly in methylosome, U4 snRNP, U2-type catalytic step 2 spliceosome, SMN-Sm protein complex, U1 snRNP, U2 snRNP, and U12-type spliceosomal complex (Figure 7A). Molecular function (MF) indicated that SNRP genes were related to the structural constituents of catalytic activity, acting on DNA, ribonucleoprotein complex binding, DNA N-glycosylase activity, histone kinase activity, and ubiquitin-protein transferase regulator activity (Figure 7A). KEGG pathway enrichment analysis suggested that spliceosome, base excision repair, nucleotide excision repair, cell cycle, dna replication, and mismatch repair were significantly associated with the alterations of sm proteins. The complex process of mRNA/RNA splicing was shown in Supplementary Figure S4. In addition, the KEGG chord plot showed that PCNA was related to 5 vital KEGG pathways. Thus, PCNA may be a very critical gene (Figure 7B). To better understand the association among different enriched terms, the KEGG network of enriched terms, colored by cluster ID, was constructed via the online tool of Metascape (Figure 7C).
FIGURE 7. Functional and enrichment analysis for Sm proteins genes and their 50 neighboring genes in LUAD (DAVID 6.8, Metascape). (A) GO enrichment analysis of genes for Sm proteins and their 50 neighboring genes (DAVID 6.8). (B) Chord diagram of KEGG analyses results. (C) Network of seven enriched terms, colored by cluster ID (Metascape).
Associations Between Genes for Sm Proteins and Immune Infiltration
High genes for Sm proteins expression was considerably related to abundance infiltration of immune cell in the tumor immune microenvironment (Figure 8). Thus, further research of genes for Sm proteins in 24 immune cell populations manifested that genes for Sm proteins had positive correlations to Th2 cells, Tgd cells, but an adverse correlation to Th1 cells, mast cells, and NK cells.
Discussion
Lung cancer has been the major cause of tumor occurrence and mortality around the world (Sung et al., 2021). LUAD accounts for half part of lung cancer with an extremely low survival rate (Chen et al., 2016). Although the research on the biological characteristics of lung adenocarcinoma has made some progress in the past few decades, the tumor progression mechanism of lung adenocarcinoma is still unclear. Through literature review and online database bioinformatics analysis and experimental verification, this study verified that genes for Sm proteins are highly expressed in lung adenocarcinoma and have a significant impact on the prognosis of lung cancer. SnRNPs, mainly responsible for splicing the pre-RNA into mRNA, are composed of seven genes for Sm proteins and a small eponymous snRNA (Chari et al., 2008). Accurate splicing is essential to ensure normal cellular function like cell proliferation, apoptosis, migration, and invasion.
This study covered many aspects, including differential expression analysis, association with clinicopathological parameters, survival analysis, gene mutations, co-expression, functional enrichment analysis, and correlation with immune infiltration, revealing the roles of SNRPB/D1/D2/D3/E/F/G in LUAD.
Previous study reported that SNRPB promotes the tumor formation of NSCLC by regulating RAB26 and SNRPB may predict response to cisplatin-based chemotherapy for NSCLC patients (Liu et al., 2019; Liu et al., 2021). Here, the mRNA expression of SNRPB was found to be upregulated in LUAD compared to non-cancer tissues. Also, SNRPB is highly expressed in cervical cancer (Zhu et al., 2020), glioma (Correa et al., 2016), and hepatocellular cancer (Zhan et al., 2020), which is a prognostic factor. The mechanistic study also reported that overexpressed SNRPB played a carcinogenic role in the progression of LUAD and was mediated by c-Myc (Peng et al., 2020). Moreover, SNRPB was highly correlated with TP53 mutation, tumor grades, and cancer stages. Interestingly, our study revealed that SNRPB up-regulation was associated with poor OS in LUAD patients, especially in patients with smoking history. Taken together, our study revealed that SNRPB could promoted tumor progression in LUAD.
It has been generally accepted that SNRPD1 is related to SLE (Riemekasten et al., 1998). However, only a few studies have reported the role of SNRPD1 in human cancers. SNRPD1 down-regulation was related to poorer survival in patients with ovarian cancer (Bao et al., 2020). SNRPD1 up-regulation contributed to breast cancer cell proliferation (Dai et al., 2021). Here, we found that SNRPD1 expression was significantly up-regulated in LUAD compared to normal tissues or normal cell lines. Furthermore, SNRPD1 expression was correlated with TP53 mutation, tumor grades, and cancer stages. Overexpression of SNRPD1 could be relevant to the poor OS in LUAD patients. Multivariate Cox regression analysis indicated that SNRPD1 was an independent prognostic factor for poorer OS of LUAD patients. Taken together, all of the above findings support the opinion that SNRPD1 plays an essential role in the carcinogenic effect of LUAD.
SNRPD2 is found closely associated with several cancers, including triple-negative breast cancer (TNBC) (Koedoot et al., 2021) and hepatocellular carcinoma. Interestingly, SNRPD2 and SNRPG were not only major pathogenic genes of Alzheimer’s disease but also bridge genes (Tao et al., 2020). However, it is not yet clear to us the specific role that SNRPD2 plays in LUAD. Here, we report SNRPD2 was found overexpressed in LUAD. Additionally, high SNRPD2 expression was correlated with TP53 mutation, tumor grades, and cancer stages.
SNRPD3 (smD3), like SNRPD2, also plays a vital role in TNBC. Moreover, several studies have shown that SNRPD3 expression is relevant to breast cancer (Koedoot et al., 2021) and NSCLC (Blijlevens et al., 2020). For instance, it was investigated that silencing SNRPD3 was able to promote TP53 expression and kill NSCLC cells effectively (Olst et al., 2017). In this study, higher SNRPD3 expression was found in LUAD. Moreover, SNRPD3 expression was associated with TP53 mutation, tumor grades, and cancer stages. Multivariate Cox regression analysis suggested a strong association between overexpressed SNRPD3 with the poor OS of LUAD patients, which seemed congruous with the oncogenic role of SNRPD3.
SNRPE protein is a core component of Sm proteins, which has been reported in some malignancies, including bladder cancer (Tapak et al., 2015), prostate cancer (Anchi et al., 2012), hepatocellular carcinoma (Jia et al., 2011), and non-small cell lung cancer (Valles et al., 2012). As an example, SNRPE was a deregulated RNA metabolism-related genes of LUAD by bioinformatics analysis (Valles et al., 2012). Similarly, in our report, higher SNRPE expression was found in LUAD than in normal lung tissues. Furthermore, SNRPE expression was relevant to TP53 mutation and tumor grades. Overexpressed SNRPE was an independent prognostic factor for poorer OS in LUAD patients by cox analysis. Multivariate Cox regression analysis indicated that SNRPE was an independent prognostic factor for poorer OS of LUAD patients, suggesting that SNRPG played a carcinogenic role in LUAD. Taken together, our study revealed that SNRPE could promoted oncogenesis in LUAD.
SNRPF was a novel biomarker of colorectal cancer via complex gene interaction networks (Sun et al., 2016). However, the role of SNRPF was rarely reported in LUAD. In our study, highly expressed SNRPF was found in LUAD compared to non-cancer tissues. Moreover, abnormal SNRPF expression could be relevant to TP53 mutation, tumor grades, and cancer stages. Multivariate Cox regression analysis indicated that SNRPDF was an independent prognostic factor for poorer OS of LUAD patients. Given the above results, our study showed that SNRPF could play a carcinogenic role in LUAD.
Downregulated SNRPG could inhabit glioblastoma cells proliferation by p53 signaling pathway (Lan et al., 2020). In this paper, higher mRNA expression of SNRPG was explored in LUAD. Furthermore, SNRPD1 expression could also be relevant to TP53 mutation, tumor grades, and cancer stages. Multivariate Cox regression analysis indicated that SNRPG was an independent prognostic factor for poorer OS of LUAD patients, suggesting that SNRPG played a carcinogenic role in LUAD.
Notably, in this paper, we assessed the relation between genes for Sm proteins and 22 immune infiltration cells in LUAD. genes for Sm proteins gene expressions were positively related to the infiltration of Th2 cells and negatively related to the infiltration of Th1 cells, mast cells, and NK cells. Furthermore, prior studies also suggested that spliceosome was tightly associated with the immune microenvironment. Nevertheless, further studies are still needed to develop immunosuppressants of individual genes for Sm proteins members and apply them to the diagnostic of LUAD.
In this work, there are a few limitations that need to be recognized. Firstly, the data used to assess the prognostic worth of genes for Sm proteins in LUAD patients were mainly from the TGCA database and GEO database. Although the TGCA and GEO sequencing data were experimentally confirmed, additional large sample studies on LUAD patients from other databases are necessary to validate our results. Secondly, Cox analysis indicated that the expressions of SNRPD1//E/F/G were correlation to shorter OS in LUAD. Consequently, SNRPD1/E/F/G were considered as independent prognostic factors and potential prognostic biomarkers for LUAD. Finally, further experiments in cells and animal models are needed to elucidate the underlying mechanisms of how genes for Sm proteins play roles in LUAD.
Taken together, our work illustrates that SNRPD1/E/F/G could independently predict the prognostic outcome of LUAD and was correlated with immune infiltration. Our findings laid the foundation for further exploration on the promising therapy target’s role of Sm proteins in LUAD.
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.
Author Contributions
GL, FL, and MC contributed to the conception of the study; GL and YL performed cell culture and RT-qPCR experiment; GL, FL, and MC contributed significantly to analysis and manuscript preparation; GL, FL, and MC performed the data analyses and wrote the manuscript; GL, FL, MC, YD, and PH helped perform the analysis with constructive discussions, YD and PH revised the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We thank all the members for participating in this study. Also we thank the Fujian Medical University Union Hospital for article fee reimbursement.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.813285/full#supplementary-material.
Abbreviations
BP, biological processes; CC, cellular components; FP, free progression; GEO, gene expression omnibus; GO, gene ontology; GTEx, genotype-tissue expression project; KEGG, kyoto encyclopedia of genes and genomes; LUAD, lung adenocarcinoma; MF, molecular function; OS, overall survival; PPS, post-progression survival; SLE, systemic lupus erythematosus; ssGSEA, single-sample gene sets enrichment analysis; snRNA, small nuclear RNA; TCGA, the cancer genome atlas.
References
Anchi, T., Tamura, K., Furihata, M., Satake, H., Sakoda, H., Kawada, C., et al. (2012). SNRPE Is Involved in Cell Proliferation and Progression of High-Grade Prostate Cancer through the Regulation of Androgen Receptor Expression. Oncol. Lett. 3, 264–268. doi:10.3892/ol.2011.505
Anonymous (2015). Non-small-cell Lung Cancer. Nat. Rev. Dis. Primers. 1, 15048. doi:10.1038/nrdp.2015.48
Bao, M., Zhang, L., and Hu, Y. (2020). Novel Gene Signatures for Prognosis Prediction in Ovarian Cancer. J. Cel. Mol. Med. 24, 9972–9984. doi:10.1111/jcmm.15601
Bhattacharjee, A., Richards, W. G., Staunton, J., Li, C., Monti, S., Vasa, P., et al. (2001). Classification of Human Lung Carcinomas by mRNA Expression Profiling Reveals Distinct Adenocarcinoma Subclasses. Proc. Natl. Acad. Sci. 98, 13790–13795. doi:10.1073/pnas.191502998
Bindea, G., Mlecnik, B., Tosolini, M., Kirilovsky, A., Waldner, M., Obenauf, A. C., et al. (2013). Spatiotemporal Dynamics of Intratumoral Immune Cells Reveal the Immune Landscape in Human Cancer. Immunity 39, 782–795. doi:10.1016/j.immuni.2013.10.003
Blijlevens, M., Komor, M. A., Sciarrillo, R., Smit, E. F., Fijneman, R. J. A., and Van Beusechem, V. W. (2020). Silencing Core Spliceosome Sm Gene Expression Induces a Cytotoxic Splicing Switch in the Proteasome Subunit Beta 3 mRNA in Non-small Cell Lung Cancer Cells. Int. J. Mol. Sci. 21, 4192. doi:10.3390/ijms21124192
Cerami, E., Gao, J., Dogrusoz, U., Gross, B. E., Sumer, S. O., Aksoy, B. A., et al. (2012). The cBio Cancer Genomics Portal: An Open Platform for Exploring Multidimensional Cancer Genomics Data: Figure 1. Cancer Discov. 2, 401–404. doi:10.1158/2159-8290.cd-12-0095
Chandrashekar, D. S., Bashel, B., Balasubramanya, S. A. H., Creighton, C. J., Ponce-Rodriguez, I., Chakravarthi, B. V. S. K., et al. (2017). UALCAN: A Portal for Facilitating Tumor Subgroup Gene Expression and Survival Analyses. Neoplasia 19, 649–658. doi:10.1016/j.neo.2017.05.002
Chari, A., Golas, M. M., Klingenhäger, M., Neuenkirchen, N., Sander, B., Englbrecht, C., et al. (2008). An Assembly Chaperone Collaborates with the SMN Complex to Generate Spliceosomal SnRNPs. Cell 135, 497–509. doi:10.1016/j.cell.2008.09.020
Chen, W., Zheng, R., Baade, P. D., Zhang, S., Zeng, H., Bray, F., et al. (2016). Cancer Statistics in China, 2015. CA: A Cancer J. Clinicians 66, 115–132. doi:10.3322/caac.21338
Chin, C.-H., Chen, S.-H., Wu, H.-H., Ho, C.-W., Ko, M.-T., and Lin, C.-Y. (2014). cytoHubba: Identifying Hub Objects and Sub-networks from Complex Interactome. BMC Syst. Biol. 88 (Suppl. l), S11. doi:10.1186/1752-0509-8-s4-s11
Clough, E., and Barrett, T. (2016). The Gene Expression Omnibus Database. Methods Mol. Biol. 1418, 93–110. doi:10.1007/978-1-4939-3578-9_5
Correa, B. R., De Araujo, P. R., Qiao, M., Burns, S. C., Chen, C., Schlegel, R., et al. (2016). Functional Genomics Analyses of RNA-Binding Proteins Reveal the Splicing Regulator SNRPB as an Oncogenic Candidate in Glioblastoma. Genome Biol. 17, 125. doi:10.1186/s13059-016-0990-4
Dai, X., Yu, L., Chen, X., and Zhang, J. (2021). SNRPD1 Confers Diagnostic and Therapeutic Values on Breast Cancers through Cell Cycle Regulation. Cancer Cel Int. 21, 229. doi:10.1186/s12935-021-01932-w
Gao, J., Aksoy, B. A., Dogrusoz, U., Dresdner, G., Gross, B., Sumer, S. O., et al. (2013). Integrative Analysis of Complex Cancer Genomics and Clinical Profiles Using the cBioPortal. Sci. Signal. 6, l1. doi:10.1126/scisignal.2004088
Garber, M. E., Troyanskaya, O. G., Schluens, K., Petersen, S., Thaesler, Z., Pacyna-Gengelbach, M., et al. (2001). Diversity of Gene Expression in Adenocarcinoma of the Lung. Proc. Natl. Acad. Sci. 98, 13784–13789. doi:10.1073/pnas.241500798
Győrffy, B., Surowiak, P., Budczies, J., and Lánczky, A. (2013). Online Survival Analysis Software to Assess the Prognostic Value of Biomarkers Using Transcriptomic Data in Non-small-cell Lung Cancer. PLoS ONE 8, e82241. doi:10.1371/journal.pone.0082241
Hänzelmann, S., Castelo, R., and Guinney, J. (2013). GSVA: Gene Set Variation Analysis for Microarray and RNA-Seq Data. BMC Bioinformatics 14, 7. doi:10.1186/1471-2105-14-7
Hou, J., Aerts, J., Den Hamer, B., Van Ijcken, W., Den Bakker, M., Riegman, P., et al. (2010). Gene Expression-Based Classification of Non-small Cell Lung Carcinomas and Survival Prediction. PLoS ONE 5, e10312. doi:10.1371/journal.pone.0010312
Huang, D. W., Sherman, B. T., and Lempicki, R. A. (2009). Systematic and Integrative Analysis of Large Gene Lists Using DAVID Bioinformatics Resources. Nat. Protoc. 4, 44–57. doi:10.1038/nprot.2008.211
Jassem, E., Szymanowska, A., Siemińska, A., and Jassem, J. (2009). Smoking and Lung Cancer. Pneumonol. Alergol. Pol. 77, 469–473.
Jia, D., Wei, L., Guo, W., Zha, R., Bao, M., Chen, Z., et al. (2011). Genome-wide Copy Number Analyses Identified Novel Cancer Genes in Hepatocellular Carcinoma. Hepatology 54, 1227–1236. doi:10.1002/hep.24495
Koedoot, E., Van Steijn, E., Vermeer, M., González-Prieto, R., Vertegaal, A. C. O., Martens, J. W. M., et al. (2021). Splicing Factors Control Triple-Negative Breast Cancer Cell Mitosis through SUN2 Interaction and Sororin Intron Retention. J. Exp. Clin. Cancer Res. 40, 82. doi:10.1186/s13046-021-01863-4
Lan, Y., Lou, J., Hu, J., Yu, Z., Lyu, W., and Zhang, B. (2020). Downregulation of SNRPG Induces Cell Cycle Arrest and Sensitizes Human Glioblastoma Cells to Temozolomide by Targeting Myc through a P53-dependent Signaling Pathway. Cancer Biol. Med. 17, 112–131. doi:10.20892/j.issn.2095-3941.2019.0164
Liu, N., Chen, A., Feng, N., Liu, X., and Zhang, L. (2021). SNRPB Is a Mediator for Cellular Response to Cisplatin in Non-small-cell Lung Cancer. Med. Oncol. 38, 57. doi:10.1007/s12032-021-01502-0
Liu, N., Wu, Z., Chen, A., Wang, Y., Cai, D., Zheng, J., et al. (2019). SNRPB Promotes the Tumorigenic Potential of NSCLC in Part by Regulating RAB26. Cell Death Dis. 10, 667. doi:10.1038/s41419-019-1929-y
Livak, K. J., and Schmittgen, T. D. (2001). Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2−ΔΔCT Method. Methods 25, 402–408. doi:10.1006/meth.2001.1262
Nagy, Á., Munkácsy, G., and Győrffy, B. (2021). Pancancer Survival Analysis of Cancer Hallmark Genes. Sci. Rep. 11, 6047. doi:10.1038/s41598-021-84787-5
Olst, E. S.-V., Blijlevens, M., De Menezes, R. X., Van Der Meulen-Muileman, I. H., Smit, E. F., and Van Beusechem, V. W. (2017). A Genome-wide siRNA Screen for Regulators of Tumor Suppressor P53 Activity in Human Non-small Cell Lung Cancer Cells Identifies Components of the RNA Splicing Machinery as Targets for Anticancer Treatment. Mol. Oncol. 11, 534–551. doi:10.1002/1878-0261.12052
Otasek, D., Morris, J. H., Bouças, J., Pico, A. R., and Demchak, B. (2019). Cytoscape Automation: Empowering Workflow-Based Network Analysis. Genome Biol. 20, 185. doi:10.1186/s13059-019-1758-4
Peng, N., Li, J., He, J., Shi, X., Huang, H., Mo, Y., et al. (2020). c‐Myc‐mediated SNRPB Upregulation Functions as an Oncogene in Hepatocellular Carcinoma. Cell Biol. Int. 44, 1103–1111. doi:10.1002/cbin.11307
Reck, M., and Rabe, K. F. (2017). Precision Diagnosis and Treatment for Advanced Non-small-cell Lung Cancer. N. Engl. J. Med. 377, 849–861. doi:10.1056/nejmra1703413
Rhodes, D. R., Yu, J., Shanker, K., Deshpande, N., Varambally, R., Ghosh, D., et al. (2004). ONCOMINE: a Cancer Microarray Database and Integrated Data-Mining Platform. Neoplasia 6, 1–6. doi:10.1016/s1476-5586(04)80047-2
Riemekasten, G., Marell, J., Trebeljahr, G., Klein, R., Hausdorf, G., Häupl, T., et al. (1998). A Novel Epitope on the C-Terminus of SmD1 Is Recognized by the Majority of Sera from Patients with Systemic Lupus Erythematosus. J. Clin. Invest. 102, 754–763. doi:10.1172/jci2749
Sun, X., Zhang, H., Luo, L., Zhong, K., Ma, Y., Fan, L., et al. (2016). Comparative Proteomic Profiling Identifies Potential Prognostic Factors for Human clear Cell Renal Cell Carcinoma. Oncol. Rep. 36, 3131–3138. doi:10.3892/or.2016.5159
Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA A. Cancer J. Clin. 71, 209–249. doi:10.3322/caac.21660
Szklarczyk, D., Gable, A. L., Lyon, D., Junge, A., Wyder, S., Huerta-Cepas, J., et al. (2019). STRING V11: Protein-Protein Association Networks with Increased Coverage, Supporting Functional Discovery in Genome-wide Experimental Datasets. Nucleic Acids Res. 47, D607–D613. doi:10.1093/nar/gky1131
Tang, Z., Li, C., Kang, B., Gao, G., Li, C., and Zhang, Z. (2017). GEPIA: a Web Server for Cancer and normal Gene Expression Profiling and Interactive Analyses. Nucleic Acids Res. 45, W98–W102. doi:10.1093/nar/gkx247
Tao, Y., Han, Y., Yu, L., Wang, Q., Leng, S. X., and Zhang, H. (2020). The Predicted Key Molecules, Functions, and Pathways that Bridge Mild Cognitive Impairment (MCI) and Alzheimer's Disease (AD). Front. Neurol. 11, 233. doi:10.3389/fneur.2020.00233
Tapak, L., Saidijam, M., Sadeghifar, M., Poorolajal, J., and Mahjub, H. (2015). Competing Risks Data Analysis with High-Dimensional Covariates: an Application in Bladder Cancer. Genomics, Proteomics & Bioinformatics 13, 169–176. doi:10.1016/j.gpb.2015.04.001
Tomczak, K., Czerwińska, P., and Wiznerowicz, M. (2015). The Cancer Genome Atlas (TCGA): an Immeasurable Source of Knowledge. Contemp. Oncol. (Pozn) 19, A68–A77. doi:10.5114/wo.2014.47136
Valles, I., Pajares, M. J., Segura, V., Guruceaga, E., Gomez-Roman, J., Blanco, D., et al. (2012). Identification of Novel Deregulated RNA Metabolism-Related Genes in Non-small Cell Lung Cancer. PLoS ONE 7, e42086. doi:10.1371/journal.pone.0042086
Wachi, S., Yoneda, K., and Wu, R. (2005). Interactome-transcriptome Analysis Reveals the High Centrality of Genes Differentially Expressed in Lung Cancer Tissues. Bioinformatics 21, 4205–4208. doi:10.1093/bioinformatics/bti688
Yamagata, N., Shyr, Y., Yanagisawa, K., Edgerton, M., Dang, T. P., Gonzalez, A., et al. (2003). A Training-Testing Approach to the Molecular Classification of Resected Non-small Cell Lung Cancer. Clin. Cancer Res. 9, 4695–4704.
Yu, G., Wang, L.-G., Han, Y., and He, Q.-Y. (2012). clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters. OMICS: A J. Integr. Biol. 16, 284–287. doi:10.1089/omi.2011.0118
Zhan, Y.-T., Li, L., Zeng, T.-T., Zhou, N.-N., Guan, X.-Y., and Li, Y. (2020). SNRPB-mediated RNA Splicing Drives Tumor Cell Proliferation and Stemness in Hepatocellular Carcinoma. Aging 13, 537–554. doi:10.18632/aging.202164
Zhang, Y., Foreman, O., Wigle, D. A., Kosari, F., Vasmatzis, G., Salisbury, J. L., et al. (2012). USP44 Regulates Centrosome Positioning to Prevent Aneuploidy and Suppress Tumorigenesis. J. Clin. Invest. 122, 4362–4374. doi:10.1172/jci63084
Zhou, Y., Zhou, B., Pache, L., Chang, M., Khodabakhshi, A. H., Tanaseichuk, O., et al. (2019). Metascape Provides a Biologist-Oriented Resource for the Analysis of Systems-Level Datasets. Nat. Commun. 10, 1523. doi:10.1038/s41467-019-09234-6
Keywords: SM proteins, prognostic biomarkers, immune infiltration, target therapy, lung adenocarcinoma
Citation: Liu G, Li F, Chen M, Luo Y, Dai Y and Hou P (2022) SNRPD1/E/F/G Serve as Potential Prognostic Biomarkers in Lung Adenocarcinoma. Front. Genet. 13:813285. doi: 10.3389/fgene.2022.813285
Received: 11 November 2021; Accepted: 14 February 2022;
Published: 03 March 2022.
Edited by:
Changwei Li, Shanghai Jiao Tong University, ChinaCopyright © 2022 Liu, Li, Chen, Luo, Dai and Hou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yinhai Dai, jx2686@163.com; Peifeng Hou, houpeifeng123@fjmu.edu.cn
†These authors have contributed equally to this work and share first authorship