Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 21 October 2021
Sec. Gastrointestinal Cancers
This article is part of the Research Topic Advanced Molecular Targets in the Diagnosis and Treatment of Gastrointestinal Cancers View all 55 articles

Prognostic and Predictive Value of Transcription Factors Panel for Digestive System Carcinoma

Guoxu Fang,&#x;Guoxu Fang1,2†Jianhui Fan&#x;Jianhui Fan3†Zongren Ding,&#x;Zongren Ding1,2†Rong LiRong Li4Kongying Lin,Kongying Lin1,2Jun Fu,Jun Fu1,2Qizhen Huang,Qizhen Huang2,5Yongyi Zeng*Yongyi Zeng1*Jingfeng Liu,,*Jingfeng Liu1,2,6*
  • 1Department of Hepatopancreatobiliary Surgery, Mengchao Hepatobiliary Hospital of Fujian Medical University, Fuzhou, China
  • 2The Big Data Institute of Southeast Hepatobiliary Health Information, Mengchao Hepatobiliary Hospital of Fujian Medical University, Fuzhou, China
  • 3Department of Hepatology for Pregnancy, Mengchao Hepatobiliary Hospital of Fujian Medical University, Fuzhou, China
  • 4Department of Pathology, Eastern Hepatobiliary Surgery Hospital of Naval Medical University, Shanghai, China
  • 5Department of Radiation Oncology, Mengchao Hepatobiliary Hospital of Fujian Medical University, Fuzhou, China
  • 6Department of Hepatopancreatobiliary Surgery, Fujian Medical University Cancer Hospital, Fujian Cancer Hospital, Fuzhou, China

Purpose: Digestive system carcinoma is one of the most devastating diseases worldwide. Lack of valid clinicopathological parameters as prognostic factors needs more accurate and effective biomarkers for high-confidence prognosis that guide decision-making for optimal treatment of digestive system carcinoma. The aim of the present study was to establish a novel model to improve prognosis prediction of digestive system carcinoma, with a particular interest in transcription factors (TFs).

Materials and Methods: A TF-related prognosis model of digestive system carcinoma with data from TCGA database successively were processed by univariate and multivariate Cox regression analyses. Then, for evaluating the prognostic prediction value of the model, ROC curve and survival analysis were performed by external data from GEO database. Furthermore, we verified the expression of TFs expression by qPCR in digestive system carcinoma tissue. Finally, we constructed a TF clinical characteristics nomogram to furtherly predict digestive system carcinoma patient survival probability with TCGA database.

Results: By Cox regression analysis, a panel of 17 TFs (NFIC, YBX2, ZBTB47, ZNF367, CREB3L3, HEYL, FOXD1, TIGD1, SNAI1, HSF4, CENPA, ETS2, FOXM1, ETV4, MYBL2, FOXQ1, ZNF589) was identified to present with powerful predictive performance for overall survival of digestive system carcinoma patients based on TCGA database. A nomogram that integrates TFs was established, allowing efficient prediction of survival probabilities and displaying higher clinical utility.

Conclusion: The 17-TF panel is an independent prognostic factor for digestive system carcinoma, and 17 TFs based nomogram might provide implication an effective approach for digestive system carcinoma patient management and treatment.

Introduction

Digestive system carcinoma, including esophageal, gastric, colon, liver, and pancreatic cancers, is one of the most common malignancies in the world. It is the third leading cause of cancer-related death worldwide and is notorious for its poor prognosis, especially in colon cancer. Despite surgical interventions and other treatments, such as targeted therapy and immunotherapy, 5-year survival rates are not good (1). Prognostic assessment has important clinical significance in guiding the selection of the best treatment plan for patients with digestive system carcinoma. Although clinical outcomes in patients with digestive tract tumors are primarily related to risk factors, including TNM stage, metastasis, and tumor size, these currently used evaluation measures do not provide accurate and individualized prognostic information to facilitate effective treatment selection. Therefore, it is important to study effective prognostic predictors that influence overall survival (OS), and we should pay more attention to digestive system carcinoma and it is urgent to develop new methods to treat gastrointestinal digestive system carcinoma.

Transcription factor (TF) is an important component of transcription regulation. TF is a kind of DNA binding protein, which can bind to specific DNA sequence and then affect the genetic information from DNA transcription to RNA. In recent years, with the development of chip and high-throughput sequencing technology and the rapid development of bioinformatics, a large number of transcription factor databases have been generated, which are very important for gene transcriptional regulation and TF-related molecular biology research. With the in-depth study of the gene expression mechanism, studies have confirmed that the imbalance of TF is an important pathological basis for the occurrence of cancer (2). In this paper, TCGA databases (training cohorts) were used to search for transcription factors related to tumor prognosis and establish a prediction model. Then, GEO databases (validation cohorts) were used for validation. Our results suggest that transcription factors are important factors affecting the prognosis of cancer patients and may be potential targets for the treatment of digestive system cancer.

Materials and Methods

Data Download and Preprocessing

The TCGA-esophagus cancer dataset consisted of the RNA-seq data of 160 esophageal cancer tissue and 11 adjacent normal samples, and related clinical characteristics were downloaded from the TCGA database. The TCGA-gastric cancer dataset consisted of the RNA-seq data of 381gastric cancer tissue, and 32 adjacent normal samples and related clinical characteristics were downloaded from the TCGA database. The TCGA-colon cancer dataset consisted of the RNA-seq data of 482 colon cancer tissue, and 42 adjacent normal samples and related clinical characteristics were downloaded from the TCGA database. The TCGA-liver cancer dataset consisted of the RNA-seq data of 374 liver cancer tissue, and 50 adjacent normal samples and related clinical characteristics were downloaded from the TCGA database. The TCGA-pancreatic cancer dataset consisted of the RNA-seq data of 178 pancreatic cancer tissue, and four adjacent normal samples and related clinical characteristics were downloaded from the TCGA database. GSE53624 (esophagus cancer dataset), GSE84433 (gastric cancer dataset), GSE40967(colon cancer dataset), GSE10143 (liver cancer dataset), and GSE57495 (pancreatic cancer dataset), which contained the gene expression data with clinical characteristics, were downloaded from the GEO database (http://ncbi.nlm.nih.gov/geo/), respectively. The raw data were preprocessed with the following criteria: (1) the genes were excluded if the FPKM value (Fragments per Kilobase Million) was zero in more than half of the samples; (2) genes with missing expression values in more than 30% of samples were removed; (3) the invariant genes (i.e., same expression value across all samples) and low-variation genes were filtered; (4) samples without related clinical data or overall survival (OS) <30 days were removed. The TCGA datasets were enrolled as a training group, and the GEO datasets were regarded as the external validation cohorts. As the data were open-access, therefore, the ethical approval by an ethics committee was not required.

Differentially Expressed TFs Identification

DETFs identification: The R package of “DESeq2” was used to calculate the DEGs of the cancer and normal tissue [FDR (false discovery rate) <0.05 and |log2FC (fold change)| >1].

Statistical Analysis

R software (version 4.0.2) and SPSS software (version 22.0) were used to complete all the statistic work. OS was calculated by the KM method, and the differences between the groups were compared by using the log-rank test. Cox proportional hazard model was used to analyze the significant transcription factors affecting OS. P < 0.05 was considered statistically significant.

Risk Model Construction and Validation

The expression data of the transcription factors associated with prognosis in the training cohorts (TGCA datasets) were used in constructing a risk score model. External validation cohorts (GEO datasets) were then used to verify the reliability of the risk score model. The risk score of each sample was calculated based on formula. Then, the samples were divided into high-risk and low-risk groups by the risk score median. The risk score distribution was plotted by the R package of “time ROC (receiver operating characteristic)”. A log-rank test was used in comparing the survival difference between the two groups. The overall survival (OS) of each group was performed using the Kaplan–Meier (KM) survival curve.

Gene Set Enrichment Analysis

GSEA enrichment in the TCGA datasets were conducted for the analysis of the significantly enriched pathways in the transcription factors associated with prognosis. c2.cp.kegg.v7.2 symbols were selected for our analysis, which included the KEGG pathways database.

Sample Collection

Cancer tissues and adjacent tissues were collected from 150 patients. The study was approved by the Ethics Committee of Eastern Hepatobiliary Surgery Hospital. Informed consent was obtained from all patients before surgery for using their data in the research.

Quantitative Reverse Transcription Polymerase Chain Reaction

Cell total RNA was extracted using Trizol reagent (Invitrogen, USA) following the manufacturer’s instructions. The quantity and quality of extracted RNA were assessed by the spectrophotometric (Dojindo Laboratories, Kumamoto, Japan) determination of absorbance ratio (A260/A280). Then, the prepared RNA was reversely transcribed into cDNA using reverse transcriptase (Invitrogen, USA) and random primers. One microliter of synthesized cDNA was used in each qPCR reaction. SYBR Green-based qRT-PCR was subsequently executed on ABI PRISM 7300HT Sequence Detection System (Applied Biosystems, USA). β-Actin was used as a control for normalization. Primers used in RT-PCR were as Table 1.

TABLE 1
www.frontiersin.org

Table 1 Primer sequences of 17 TFs associated with digestive system tumor prognosis.

Bioinformatics Analysis

GO and KEGG functional enrichment analyses were conducted based on the target genes.

Results

Identified Differentially Expressed TFs Between Digestive System Carcinoma Tissues and Adjacent Normal Tissues

This study was conducted according to the flow chart shown in Figure 1. We found 99, 167, 146, 120, and 9 differential TFs in normal paracancer samples and tumor samples of esophageal cancer, gastric cancer, colon cancer, liver cancer, and pancreatic cancer, respectively, by analyzing the TCGA database. After intersection with the GEO database, we found 94, 167, 141, 52, and 9 common differential TFs in the normal paracancer samples and tumor samples of esophageal cancer, gastric cancer, colon cancer, liver cancer, and pancreatic cancer, respectively (Figures 2A, B and Supporting Figures S1A, B).

FIGURE 1
www.frontiersin.org

Figure 1 Whole procedure for analyzing TFs in digestive system carcinoma.

FIGURE 2
www.frontiersin.org

Figure 2 Differential TFs between cancer tissue and paracancer tissue. (A, B) Heatmap (A) and Volcano plot (B) of the differential TFs in the cancer tissue and paracancer tissue of TCGA database.

Identified OS-Related TFs in DETFs

In esophageal cancer, OS-related TFs NFIC, YBX2, and ZBTB47 were obtained by univariate and multivariate Cox regression analyses (Figure 3A). In gastric cancer, OS-related TFs ZNF367, CREB3L3, HEYL, and MYB were obtained by univariate Cox regression analysis. ZNF367, CREB3L3, and HEYL were obtained by multivariate Cox regression analysis (Figure 3B). In colon cancer, OS-related TFs FOXD1, TIGD1, SNAI1, and HSF4 were obtained by univariate and multivariate Cox regression analyses (Figure 3C). In liver cancer, OS-related TFs CENPA, HMGA1, ETS2, FOXO1 KLF9, AR, FOXM1, ETV4, MYBL2, ZIC2 were obtained by univariate Cox regression analysis. CENPA, ETS2, FOXM1, ETV4, MYBL2 were obtained by multivariate Cox regression analysis (Supporting Figure S2A). In pancreatic cancer, OS-related TFs SPDEF, FOXQ1, ZNF589 were obtained by univariate Cox regression analysis. FOXQ1 and ZNF589 were obtained by multivariate Cox regression analysis (Supporting Figure S2B).

FIGURE 3
www.frontiersin.org

Figure 3 The construction of TFs signature and the evaluation of its independent prognostic value. (A–C) Forest plot of the univariate Cox regression analysis with TFs in esophageal cancer, gastric cancer, and colon cancer. (D–I) Forest plots of the univariate (D–F) and multivariate (G–I) Cox regression analysis with clinical features and risk score in TCGA cancer cohorts.

A prognostic risk score for each patient was calculated based on the mRNA expression levels of the OS-related TFs and the coefficients from univariate and multivariate Cox regression analyses. We performed univariate and multivariate Cox regression analyses to evaluate the prognostic value of the risk score. The forest plot was utilized to show the clinical features such as age, gender, and the tumor TNM stage in the nomogram, as shown in Figures 3D–I and Supporting Figures S2C–E. As shown in Figures 3D–F and Supporting Figures S2C, D, the hazard ratio of risk score, which was performed by univariate Cox regression analysis, is around 1.687, 1.902, 1.809, 1.977, 1.921. As shown in Figures 3G–I, the risk score, performed by multivariate Cox regression analysis, is around 1.852, 2.050, 1.588. Our data analysis shows that risk score is an independent risk factor for the prognosis of esophageal cancer, gastric cancer, and colon cancer.

Construct TF-Related Prognostic Model of Digestive System Carcinoma

The patients were divided into a low-risk group and a high-risk group, according to the median value of the risk scores in the TCGA training cohorts. The gene-expression profiles of the prognostic risk genes between the high-risk group and low-risk group are displayed in the heatmap in Figures 4A–C and Supporting Figures S3A, B. The distribution of the risk scores and the correlation between the risk scores and survival data are illustrated in scatterplots (Figures 4D–F and Supporting Figures S3C, D). K-M survival analysis revealed a significantly higher survival probability in the low-risk group (p < 0.05) (Figures 4G–I and Supporting Figures S3E, F). Patients with high-risk scores were associated with significantly worse OS, thereby suggesting that the high-risk score was an adverse prognostic factor.

FIGURE 4
www.frontiersin.org

Figure 4 Heatmap, characteristics of the risk score, and Kaplan-Meier analysis of the OS-related TFs in training cohorts (TCGA database). (A–C) Heatmap of the gene-expression profiles of OS-related TFs in training cohorts: (A) esophageal cancer, (B) gastric cancer, (C) colon cancer. (D–F) The distributions of the risk score, survival time, and status of patients in training cohorts: (D) esophageal cancer, (E) gastric cancer, (F) colon cancer. (G–I) Kaplan-Meier curves of OS-related TFs in training cohorts: (G) esophageal cancer, (H) gastric cancer, (I) colon cancer.

Validate TF-Related Prognostic Model of Digestive System Carcinoma

The robustness of the risk model was further assessed with external datasets using the GEO datasets of GSE53624 (esophagus cancer data), GSE84433 (gastric cancer data), GSE40967 (colon cancer date), GSE10143 (liver cancer date), and GSE57495 (pancreatic cancer date), respectively. The gene-expression profiles of the validation cohorts are visualized in Figures 5A–C and Supporting Figures S4A, B. The distribution of the risk scores and the correlation between the risk scores and survival data of validation cohorts are illustrated in Figures 5D–F and Supporting Figures S4C, D. The K-M survival curves revealed a higher survival probability of the low-risk group (p <0.05) in validation cohorts (Figures 5G–I and Supporting Figures S4E, F). Similar results that high-risk score was related to worse OS were obtained in validation cohorts. The model had a relatively high distinguishing ability of prognosis and could identify the high-risk group patients with worse survival results in validation cohorts.

FIGURE 5
www.frontiersin.org

Figure 5 Heatmap, characteristics of the risk score, and Kaplan-Meier analysis of the OS-related TFs in validation cohorts (GEO database). (A–C) Heatmap of the gene-expression profiles of OS-related TFs in validation cohorts: (A) esophageal cancer, (B) gastric cancer, (C) colon cancer. (D–F) The distributions of the risk score, survival time, and status of patients in validation cohorts: (D) esophageal cancer, (E) gastric cancer, (F) colon cancer. (G–I) Kaplan-Meier curves of OS-related TFs in validation cohorts: (G) esophageal cancer, (H) gastric cancer, (I) colon cancer.

Exploration of Signaling Pathways

Gene set enrichment analysis (GSEA) has an advantage in exploring the involved signaling pathways. GSEA revealed that the genes in the high-risk group of TCGA cohorts were significantly enriched in selenoamino acid metabolism, peroxisome, histidine metabolism (esophageal cancer); neuroactive ligand receptor interaction, vascular smooth muscle contraction, melanogenesis (gastric cancer); glycosaminoglycan biosynthesis chondroitin sulfate, notch signaling pathway, ABC transporters (colon cancer). In contrast, the low-risk group genes were significantly enriched in pathways such as neuroactive ligand receptor interaction, regulation of actin cytoskeleton, focal adhesion (esophageal cancer); cell cycle, nucleotide excision repair, pyrimidine metabolism (gastric cancer); valine leucine and isoleucine degradation, peroxisome, fatty acid metabolism (colon cancer) (Figures 6A–C).The results of OS-related TFs enrichment analysis for liver cancer and pancreatic cancer are shown in Supporting Figures S5A, B and Table 2.

TABLE 2
www.frontiersin.org

Table 2 The result of KEGG enrichment analysis of high-risk group and low-risk group in liver cancer and pancreatic cancer.

FIGURE 6
www.frontiersin.org

Figure 6 Enrichment plot of the OS-related TFs between the high-risk and low-risk groups using GSEA. (A–C) The enriched gene sets in KEGG collection in esophagus cancer sample (A), gastric cancer sample (B), and colon cancer sample (C) with high risk score and low risk score.

Nomogram Construction

TCGA training cohort was used to screen the prognostic risk factors. The risk score, existing the inducement to digestive system carcinoma, age, gender, T stage, N stage, M stage were found to be risk factors for OS after the univariable and Cox regression analyses. Then, a nomogram model with a C-index value of 0.763 (95% CI=0.693–0.831) in esophageal cancer, 0.658 (95% CI=0.607–0.708) in gastric cancer, 0.785 (95% CI=0.729–0.843) in colon cancer, 0.738 (95% CI=0.683–0.793) in liver cancer, 0.644 (95% CI=0.545–0.742) in pancreatic cancer, containing the above clinical features and risk score was constructed, as shown in Figures 7A–C and Supporting Figures S6A, B.

FIGURE 7
www.frontiersin.org

Figure 7 Prognostic capacity evaluation and nomogram analysis of panel of OS-TFs of digestive system carcinoma patients. (A–C) Nomogram predicting the OS in digestive system carcinoma patients containing the risk score: (A) esophageal cancer, (B) gastric cancer, (C) colon cancer. (D–F) ROC curves and AUC for 1-, 3-, and 5-year survival of the nomogram: (D) esophageal cancer, (E) gastric cancer, (F) colon cancer. (G–I) Calibration curve of 3-year survival in the nomogram and ideal model: (G) esophageal cancer, (H) gastric cancer, (I) colon cancer.

Furthermore, the score for each patient was calculated according to the nomogram, and the prediction accuracy of the nomogram was assessed using the ROC curve. The AUCs of the nomogram model are shown in Figures 7D–F and Supporting Figures S6C, D. Figures 7G–I and Supporting Figures S6E, F show the calibration curve between the nomogram and the ideal model. Results show that the model was consistent with the ideal model, indicating that the accuracy of our model was relatively high.

Validated OS-Related TFs Between Digestive System Carcinoma Tissues and Adjacent Normal Tissues

To validate whether NFIC, ZBTB47, SNAI1 are lowly expressed and YBX2, ZNF367, CREB3L3, HEYL, FOXD1, TIGD1, HSF4 are highly expressed in cancer tissues, we experimentally validated OS-related TFs expression in the tissues of 30 esophageal cancer patients, 30 gastric cancer patients, and 30 colon cancer patients. The results of qRT-PCR suggested that NFIC, ZBTB47, SNAI1 are lowly expressed and YBX2, ZNF367, CREB3L3, HEYL, FOXD1, TIGD1, HSF4 are highly expressed in cancer tissues (Figures 8A–J). The expressions of liver cancer and pancreatic cancer OS-related TFs in cancer tissues and adjacent normal tissues are shown in Supporting Figures S7A–G.

FIGURE 8
www.frontiersin.org

Figure 8 The results of qRT-PCR in OS-related TFs. (A–C) YBX2 was highly expressed and NFIC and ZBTB47 were lowly expressed in esophageal cancer tissues. (D–F) ZNF367, CREB3L3, and HEYL were highly expressed in gastric cancer tissues. (G–J) FOXD1, TIGD1, and HSF4 were highly expressed and SNAI1 was lowly expressed in colon cancer tissues. ***p < 0.001.

Functional Enrichment Analysis of the Target Genes of OS-Related TFs

The target genes of OS-related TFs were predicted using the CHEA, Encode, Jaspar Motifmap, Transfac, Trusting transcription factor databases. The results are shown in Supplementary Table 1. The GO and KEGG enrichment analysis of the target genes was performed. The results showed that the target genes of OS-related TFs were mainly enriched in rRNA transcription, transcription by RNA polymerase III, ncRNA transcription (esophageal cancer); prostatic bud formation, regulation of epithelial cell proliferation involved in prostate gland development, cellular response to testosterone stimulus (gastric cancer); positive regulation of cell adhesion, regulation of cell−cell adhesion, positive regulation of establishment of protein localization (colon cancer) (Figures 9A–C) and pathways related to the herpes simplex virus 1 infection, amyotrophic lateral sclerosis, salmonella infection (esophageal cancer); oocyte meiosis (gastric cancer); PI3K−Akt signaling pathway, human papillomavirus infection, cytokine−cytokine receptor interaction (colon cancer) (Figures 9D–F). The results of target genes of OS-related TFs enrichment analysis for liver cancer and pancreatic cancer are shown in Supporting Figures S8A–D.

FIGURE 9
www.frontiersin.org

Figure 9 Functional enrichment analysis of the target genes of OS-related TFs. (A–C) GO enrichment analysis results, showing only the first 20 terms in esophagus cancer (A), gastric cancer (B), and colon cancer (C). (D–F) KEGG pathway enrichment analysis results, showing only the first 20 pathways in esophagus cancer (D), gastric cancer, (E), and colon cancer (F). Gene Ratio refers to the ratio of the number of genes enriched in the term/pathway to the total number of genes in the term/pathway.

Discussion

The prognosis of digestive system carcinoma patients usually is not good, because the majority of digestive system carcinoma patients have been in an advanced stage when they are diagnosed. Even if advanced digestive system carcinoma patients receive chemotherapy or molecular-targeted drug therapy, the effect is usually poor. With the development of tumor molecular biology, prognostic markers that reflect tumor progression at the molecular level may help to achieve more accurate individualized survival prediction and treatment (3). Recently, molecular prognostic markers have drawn more and more attention of researchers in the survival prediction of digestive system carcinoma (4). We can use them to dynamically detect disease progression or changes in the prognosis of tumor patients. Moreover, compared with a single one marker, a panel of molecular markers will significantly increase the accuracy in reflecting the prognosis of digestive system carcinoma (5).

In the human genome, there are more than 35,000 genes expressed differently in different tissues and at different times. TF plays an important role in the regulation of gene expression and is also a target of signal transduction pathway. More and more researchers begin to study TFs. They think that TFs may be the target of diagnosis and treatment of human diseases. The combination of bioinformatics and molecular biology, that is, the combination of “dry” and “wet” experiments, has become the necessary means to solve various biological problems and has made good progress. TF-related database plays an important role in life science research, which is also the fundamental reason for the frequent development and use of TF-related database in recent years (6).

The results of our study indicate that NFIC, YBX2, ZBTB47 were closely related to the prognosis of esophageal cancer. H. Wang illustrated that the importance of miR-550a-3/NFIC in the regulation of esophageal squamous cell cancer cells growth and metastasis, which could contribute to developing novel targets for early diagnosis or neoteric therapeutic target for esophageal squamous cell cancer (7). G. Xu found that LBX2-AS1 upregulated by NFIC promoted gastric cancer progression via targeting miR-491-5p/ZNF703 (8). F. Chen’s research indicated that LINC00958 regulated miR-627-5p/YBX2 axis to facilitate cell proliferation and migration in oral squamous cell carcinoma (9). X. Niu found that LncRNA HOXA11-AS promotes oral squamous cell carcinoma progression by sponging miR-98-5p to upregulate YBX2 expression (10). M. Tan found that the expression of ZBTB47 was different in familial pancreatic cancer predisposed individuals, sporadic pancreatic cancer patients, and normal donor pancreatic tissue (11).

The results of our study indicate that ZNF367, CREB3L3, HEYL were closely related to the prognosis of gastric cancer. Several members of the zinc finger protein family have been recently shown to have a role in cancer initiation and progression. Jain showed that ZNF367 is overexpressed in adrenocortical carcinoma, malignant pheochromocytoma, paraganglioma, and thyroid cancer as compared to normal tissue and benign tumors. H. Zeng’s study indicated that ZNF367-induced transcriptional activation of KIF15 accelerates the progression of breast cancer (12), and X. Wu found that ZNF 367 promotes metastasis by inhibiting the Hippo pathway in breast cancer (13). However, some studies have shown that ZNF367 plays a tumor-suppressive role in tumors. The research results show that ZNF367 inhibits cellular proliferation, invasion, migration, and adhesion to extracellular proteins in vitro and in vivo (14). Dewaele illustrated that EWSR1-CREB3L3 gene fusion is related to a mesenteric sclerosing epithelioid fibrosarcoma (15). H. Liu et al. not only identified the close relationship between HEYL and tumor microenvironment phenotype but also emphasized the crucial importance of HEYL, which could be identified as a candidate biomarker to evaluate prognostic risk and therapeutic effect in gastric cancer (16).

The results of our study indicate that FOXD1, TIGD1, SNAI1, HSF4 were closely related to the prognosis of colon cancer. P. Quintero-Ronderos showed that molecular, structural, and functional aspects of FOXD1 are presented in light of physiological and pathogenic conditions, including its role in human disease etiology, such as cancer and recurrent pregnancy loss (17). L. Yin illustrated that TIGD1, a gene of unknown function, involves cell-cycle progression and correlates with poor prognosis in human cancer (18). D. Li’s analysis revealed that SNAI1 mRNA expression may potentially be a negative prognostic factor in breast cancer (19). J. Qi’s research indicated SNAI1 promotes the development of hepatocellular carcinoma (HCC) through the enhancement of proliferation and inhibition of apoptosis (20). Y. Yang found that high HSF4 expression is an independent indicator of poor overall survival and recurrence-free survival in patients with primary colorectal cancer (21).

The results of our study indicate that CENPA, ETS2, MYBL2, FOXM1, ETV4 were closely related to the prognosis of HCC. Y. Zhang’s study revealed that CENPA mRNA were upregulated in HCC patients with alpha-fetoprotein (AFP) elevation, advanced TNM stage, larger tumor size, advanced AJCC stage, advanced pathology grade, and vascular invasion. A Cox regression model including CENPA could predict OS in HCC patients effectively. CENPA might be an oncogenic factor in the development of HCC patients (22). According to literature reports, ETS2 can not only promote the development of tumors but also inhibit the development of tumors. M. Kabbout’s findings pointed to a tumor suppressor role for ETS2 in human non-small-cell lung cancer (NSCLC) pathogenesis through inhibition of the MET proto-oncogene (23). Y. L. Liao’s data indicated that ETS2 plays a key role in controlling the expression of miR-196b, and miR-196b may mediate the tumor suppressor effects of ETS2 (24). But X. Liu opined that ETS2 functions as an oncogene and plays a key role in the progression of hypopharyngeal cancer (25). L. Y. found that ETS2 knockdown inhibits tumorigenesis in esophageal squamous cell carcinoma in vivo and in vitro (26). G. W. Zhang revealed that downregulation of ETS2 inhibits the invasion and metastasis of renal cell carcinoma cells by inducing EMT via the PI3K/Akt signaling pathway (27). J. Zhu illustrated that MicroRNA-146b overexpression promotes human bladder cancer invasion via enhancing ETS2-mediated mmp2 mRNA transcription (28). J. Munera revealed that ETS2 could regulate colonic stem cells and sensitivity to tumorigenesis (29). Z. Guan found that high MYBL2 expression and transcription regulatory activity is associated with poor overall survival in patients with HCC (30). J. Dai’s research indicated that overexpression of FOXM1 was to the disadvantage of the prognosis for majority of solid tumor and therefore can be used as an evaluation index of prognosis (31). E. Kim revealed that capicua inhibited the progression of HCC by controlling the ETV4-MMP1 axis (32). Q. X.’s study indicated that PBK overexpression promotes metastasis of HCC via activating ETV4-uPAR signaling pathway (33).

The results of our study indicate that FOXQ1 and ZNF589 were closely related to the prognosis of pancreatic cancer. Forkhead Box Q1 (FOXQ1) is a member of the Forkhead Box protein family, which is a transcription factor with the function of regulating cell differentiation. In recent years, more and more studies have shown that FOXQ1 is significantly correlated with the pathogenesis of tumors. In many studies, upregulation of FOXQ1 expression has been reported in breast, colorectal, pancreatic, bladder, and ovarian cancers, as well as glioma, among other tumor types (34). ZNF589 is rarely reported in tumors. Oleksiewicz revealed that TRIM28 employs KRAB-ZNFs to evoke epigenetic silencing of its target differentiation genes via H3K9me3 and DNA methylation (35).

In summary, we analyzed the prognostic TFs in digestive system carcinoma based on the TCGA database, and then established a prognostic prediction model, which was verified successfully using the GEO database. TFs may affect the clinical prognosis of patients through different regulatory mechanisms. The above-mentioned literature suggests that proper regulation of TFs can benefit patients with digestive system carcinoma. Prognostic TFs in risk score models may provide new ideas for exploring therapeutic targets for digestive system carcinoma. Identification of appropriate TFs as therapeutic targets with further verification to ensure the clinical efficacy and safety on digestive system carcinoma patients would be a promising strategy in future studies. However, the molecular mechanism of TFs and the role of related signal transduction pathways in digestive system carcinoma remain unclear, which requires further research and exploration (6).

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: TCGA database (https://portal.gdc.cancer.gov/), GEO database (http://ncbi.nlm.nih.gov/geo/).

Author Contributions

Study concept and design: GF, JF, ZD, YZ, JL. Acquisition, analysis, or interpretation of data: All authors. Statistical analysis: GF. Establishment of the model: GF. Drafting of the manuscript: GF. Critical revision of the manuscript for important intellectual content: All authors. Administrative, technical, or material support: YZ, JL. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by Jieping Wu Medical Foundation (LDWJPMF-102-17007), Medical Innovation Project of Fujian Province (2018-CX-49), and Specific Foundation of Development and Reform Commission in Fujian Province (31010308).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors acknowledge the members of the Department of Hepatopancreatobiliary Surgery, Mengchao Hepatobiliary Hospital of Fujian Medical University, especially Jun Fu, Kongying Lin, Qizhen Huang for excellent technical assistance.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2021.670129/full#supplementary-material

Supplementary Figure 1 | Differential TFs between cancer tissue and paracancer tissue. (A, B) Heatmap (A) and Volcano plot (B) of the differential TFs in cancer tissue and paracancer tissue of TCGA database.

Supplementary Figure 2 | The construction of TFs signature and the evaluation of its independent prognostic value. (A, B) Forest plot of the univariate Cox regression analysis with TFs in liver cancer and pancreatic cancer. (C, D) Forest plots of the univariate Cox regression analysis with clinical features and risk score in TCGA cancer cohorts.

Supplementary Figure 3 | Heatmap, characteristics of the risk score, and Kaplan-Meier analysis of the OS-related TFs in training cohorts (TCGA database). (A, B) Heatmap of the gene-expression profiles of OS-related TFs in training cohorts: (A) liver cancer, (B) pancreatic cancer. (C, D) The distributions of the risk score, survival time, and status of patients in training cohorts: (C) liver cancer, (D) pancreatic cancer. (E, F) Kaplan-Meier curves of OS-related TFs in training cohorts: (E) liver cancer, (F) pancreatic cancer.

Supplementary Figure 4 | Heatmap, characteristics of the risk score, and Kaplan-Meier analysis of the OS-related TFs in validation cohorts (GEO database). (A, B) Heatmap of the gene-expression profiles of OS-related TFs in validation cohorts: (A) liver cancer, (B) pancreatic cancer. (C, D) The distributions of the risk score, survival time, and status of patients in validation cohorts: (C) liver cancer, (D) pancreatic cancer. (E, F) Kaplan-Meier curves of OS-related TFs in validation cohorts: (E) liver cancer, (F) pancreatic cancer.

Supplementary Figure 5 | Enrichment plot of the OS-related TFs between the high-risk and low-risk groups using GSEA. (A, B) The enriched gene sets in KEGG collection in liver cancer sample (A) and pancreatic cancer sample (B) with high risk score and low risk score.

Supplementary Figure 6 | Prognostic capacity evaluation and nomogram analysis of panel of OS-TFs of digestive system carcinoma patients. (A, B) Nomogram predicting the OS in digestive system carcinoma patients containing the risk score: (A) liver cancer, (B) pancreatic cancer. (C) ROC curves and AUC for 1-, 3, and 5-year survival of the nomogram in liver cancer. (D) ROC curves and AUC for 1-, 2, and 3-year survival of the nomogram in pancreatic cancer. (E, F) Calibration curve of 3-year survival in the nomogram and ideal model: (E) liver cancer, (F) pancreatic cancer.

Supplementary Figure 7 | The results of qRT-PCR in OS-related TFs. (A–E) CENPA, MYBL2, FOXM1, and ETV4 were highly expressed and ETS2 were lowly expressed in liver cancer tissues. (F, G) FOXQ1 and ZNF589 were highly expressed in pancreatic cancer tissues.

Supplementary Figure 8 | Functional enrichment analysis of the target genes of OS-related TFs. (A, B) GO enrichment analysis results, showing only the first 20 terms in liver cancer (A) and pancreatic cancer (B). (C, D) KEGG pathway enrichment analysis results, showing only the first 20 pathways in liver cancer (C) and pancreatic cancer (D). Gene Ratio refers to the ratio of the number of genes enriched in the term/pathway to the total number of genes in the term/pathway.

Abbreviations

TFs, transcription factors; TCGA, the cancer genome atlas; GEO, gene expression omnibus; FPKM, fragments per kilobase million; DEG, differential expression gene; FDR, false discovery rate; OS, overall survival; ROC, receiver operating characteristic; KM, Kaplan–Meier; GSEA, gene set enrichment analysis; RT-PCR, reverse transcription polymerase chain reaction; GO, gene ontology; KEGG, Kyoto encyclopedia of genes and genomes; HR, hazard ratio; 95% CI, 95% confidence interval; AUC, area under the curve; HCC, hepatocellular carcinoma.

References

1. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer Statistics, 2021. CA Cancer J Clin (2021) 71:7–33. doi: 10.3322/caac.21654

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Zhou TH, Su JZ, Qin R, Chen X, Ju GD, Miao S. Prognostic and Predictive Value of a 15 Transcription Factors (TFs) Panel for Hepatocellular Carcinoma. Cancer Manag Res (2020) 12:12349–61. doi: 10.2147/CMAR.S279194

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Minguez B, Lachenmayer A. Diagnostic and Prognostic Molecular Markers in Hepatocellular Carcinoma. Dis Markers (2011) 31:181–90. doi: 10.1155/2011/310675

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Chen K, Liu XH, Wang FR, Liu HP, Huang ZP, Chen X. The Prognostic Value of Decreased NDRG1 Expression in Patients With Digestive System Cancers: A Meta-Analysis. Med (Baltimore) (2018) 97:e12455. doi: 10.1097/MD.0000000000012455

CrossRef Full Text | Google Scholar

5. Chen Q, Li F, Gao Y, Xu G, Liang L, Xu J. Identification of Energy Metabolism Genes for the Prediction of Survival in Hepatocellular Carcinoma. Front Oncol (2020) 10:1210. doi: 10.3389/fonc.2020.01210

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Xu H, Liu L, Li W, Zou D, Yu J, Wang L, et al. Transcription Factors in Colorectal Cancer: Molecular Mechanism and Therapeutic Implications. Oncogene (2020) 40:1555–69. doi: 10.1038/s41388-020-01587-3

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Wang H, Shi X, Wu S. miR-550a-3/NFIC Plays a Driving Role in Esophageal Squamous Cell Cancer Cells Proliferation and Metastasis Partly Through EMT Process. Mol Cell Biochem (2020) 472:115–23. doi: 10.1007/s11010-020-03790-y

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Xu G, Zhang Y, Li N, Wu Y, Zhang J, Xu R, et al. LBX2-AS1 Up-Regulated by NFIC Boosts Cell Proliferation, Migration and Invasion in Gastric Cancer Through Targeting miR-491-5p/ZNF703. Cancer Cell Int (2020) 20:136. doi: 10.1186/s12935-020-01207-w

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Chen F, Liu M, Yu Y, Sun Y, Li J, Hu W, et al. LINC00958 Regulated miR-627-5p/YBX2 Axis to Facilitate Cell Proliferation and Migration in Oral Squamous Cell Carcinoma. Cancer Biol Ther (2019) 20:1270–80. doi: 10.1080/15384047.2019.1617571

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Niu X, Yang B, Liu F, Fang Q. LncRNA HOXA11-AS Promotes OSCC Progression by Sponging miR-98-5p to Upregulate YBX2 Expression. BioMed Pharmacother (2020) 121:109623. doi: 10.1016/j.biopha.2019.109623

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Tan M, Schaffalitzky de Muckadell OB, Joergensen MT. Gene Expression Network Analysis of Precursor Lesions in Familial Pancreatic Cancer. J Pancreat Cancer (2020) 6:73–84. doi: 10.1089/pancan.2020.0007

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zeng H, Li T, Zhai D, Bi J, Kuang X, Lu S, et al. ZNF367-Induced Transcriptional Activation of KIF15 Accelerates the Progression of Breast Cancer. Int J Biol Sci (2020) 16:2084–93. doi: 10.7150/ijbs.44204

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Wu X, Zhang X, Yu L, Zhang C, Ye L, Ren D, et al. Zinc Finger Protein 367 Promotes Metastasis by Inhibiting the Hippo Pathway in Breast Cancer. Oncogene (2020) 39:2568–82. doi: 10.1038/s41388-020-1166-y

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Jain M, Zhang L, Boufraqech M, Liu-Chittenden Y, Bussey K, Demeure MJ, et al. ZNF367 Inhibits Cancer Progression and Is Targeted by miR-195. PLoS One (2014) 9:e101423. doi: 10.1371/journal.pone.0101423

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Dong F, Quade BJ, Dal Cin P, Jo VY. Expanding the Spectrum of Translocations in Sclerosing Epitheloid Fibrosarcoma: A New Case With EWSR1-CREB3L3 Fusion. Genes Chromosomes Cancer (2018) 57:675–7. doi: 10.1002/gcc.22677

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Liu H, Ni S, Wang H, Zhang Q, Weng W. Charactering Tumor Microenvironment Reveals Stromal-Related Transcription Factors Promote Tumor Carcinogenesis in Gastric Cancer. Cancer Med (2020) 9:5247–57. doi: 10.1002/cam4.3133

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Quintero-Ronderos P, Laissue P. The Multisystemic Functions of FOXD1 in Development and Disease. J Mol Med (Berl) (2018) 96:725–39. doi: 10.1007/s00109-018-1665-2

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Yin L, Yan J, Wang Y, Sun Q. TIGD1, a Gene of Unknown Function, Involves Cell-Cycle Progression and Correlates With Poor Prognosis in Human Cancer. J Cell Biochem (2019) 120:9758–67. doi: 10.1002/jcb.28256

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Li D, Li L, Yang W, Chen L, Chen X, Wang Q, et al. Prognostic Values of SNAI Family Members in Breast Cancer Patients. Ann Transl Med (2020) 8:922. doi: 10.21037/atm-20-681

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Qi J, Li T, Bian H, Li F, Ju Y, Gao S, et al. SNAI1 Promotes the Development of HCC Through the Enhancement of Proliferation and Inhibition of Apoptosis. FEBS Open Bio (2016) 6:326–37. doi: 10.1002/2211-5463.12043

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Yang Y, Jin L, Zhang J, Wang J, Zhao X, Wu G, et al. High HSF4 Expression Is an Independent Indicator of Poor Overall Survival and Recurrence Free Survival in Patients With Primary Colorectal Cancer. IUBMB Life (2017) 69:956–61. doi: 10.1002/iub.1692

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Zhang Y, Yang L, Shi J, Lu Y, Chen X, Yang Z. The Oncogenic Role of CENPA in Hepatocellular Carcinoma Development: Evidence From Bioinformatic Analysis. BioMed Res Int (2020) 2020:3040839. doi: 10.1155/2020/3040839

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Kabbout M, Garcia MM, Fujimoto J, Liu DD, Woods D, Chow CW, et al. ETS2 Mediated Tumor Suppressive Function and MET Oncogene Inhibition in Human Non-Small Cell Lung Cancer. Clin Cancer Res (2013) 19:3383–95. doi: 10.1158/1078-0432.CCR-13-0341

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Liao YL, Hu LY, Tsai KW, Wu CW, Chan WC, Li SC, et al. Transcriptional Regulation of miR-196b by ETS2 in Gastric Cancer Cells. Carcinogenesis (2012) 33:760–9. doi: 10.1093/carcin/bgs023

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Liu X, Zhang C, Zhang Z, Zhang Z, Ji W, Cao S, et al. E26 Transformation-Specific Transcription Factor ETS2 as an Oncogene Promotes the Progression of Hypopharyngeal Cancer. Cancer Biother Radiopharm (2017) 32:327–34. doi: 10.1089/cbr.2017.2296

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Yang L, Li Q, Han K, Zhu L, Zhang Y, Ma S, et al. Ets2 Knockdown Inhibits Tumorigenesis in Esophageal Squamous Cell Carcinoma In Vivo and In Vitro. Oncotarget (2016) 20:61458–68. doi: 10.18632/oncotarget.11369

CrossRef Full Text | Google Scholar

27. Zhang GW, Tian X, Li Y, Wang ZQ, Li XD, Zhu CY. Down-Regulation of ETS2 Inhibits the Invasion and Metastasis of Renal Cell Carcinoma Cells by Inducing EMT via the PI3K/Akt Signaling Pathway. BioMed Pharmacother (2018) 104:119–26. doi: 10.1016/j.biopha.2018.05.029

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Zhu J, Xu C, Ruan L, Wu J, Li Y, Zhang X. MicroRNA-146b Overexpression Promotes Human Bladder Cancer Invasion via Enhancing ETS2-Mediated Mmp2 mRNA Transcription. Mol Ther Nucleic Acids (2019) 16:531–42. doi: 10.1016/j.omtn.2019.04.007

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Munera J, Cecena G, Jedlicka P, Wankell M, Oshima RG. Ets2 Regulates Colonic Stem Cells and Sensitivity to Tumorigenesis. Stem Cells (2011) 29:430–9. doi: 10.1002/stem.599

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Guan Z, Cheng W, Huang D, Wei A. High MYBL2 Expression and Transcription Regulatory Activity Is Associated With Poor Overall Survival in Patients With Hepatocellular Carcinoma. Curr Res Transl Med (2018) 66:27–32. doi: 10.1016/j.retram.2017.11.002

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Dai J, Yang L, Wang J, Xiao Y, Ruan Q. Prognostic Value of FOXM1 in Patients With Malignant Solid Tumor: A Meta-Analysis and System Review. Dis Markers (2015) 2015:1–10. doi: 10.1155/2015/352478

CrossRef Full Text | Google Scholar

32. Kim E, Kim D, Lee JS, Yoe J, Park J, Kim CJ, et al. Capicua Suppresses Hepatocellular Carcinoma Progression by Controlling the ETV4-MMP1 Axis. Hepatology (2018) 67:2287–301. doi: 10.1002/hep.29738

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Yang QX, Zhong S, He L, Jia XJ, Tang H, Cheng ST, et al. PBK Overexpression Promotes Metastasis of Hepatocellular Carcinoma via Activating ETV4-uPAR Signaling Pathway. Cancer Lett (2019) 452:90–102. doi: 10.1016/j.canlet.2019.03.028

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Zhang Y, Li Y, Yao Z, Li S, Yin Z, Xu M. Forkhead Box Q1: A Key Player in the Pathogenesis of Tumors (Review). Int J Oncol (2016) 49:51–8. doi: 10.3892/ijo.2016.3517

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Oleksiewicz U, Gladych M, Raman AT, Heyn H, Mereu E, Chlebanowska P, et al. TRIM28 and Interacting KRAB-ZNFs Control Self-Renewal of Human Pluripotent Stem Cells Through Epigenetic Repression of Pro-Differentiation Genes. Stem Cell Rep (2017) 9:2065–80. doi: 10.1016/j.stemcr.2017.10.031

CrossRef Full Text | Google Scholar

Keywords: digestive system carcinoma, prognosis, transcription factor, overall survival, bioinformatics

Citation: Fang G, Fan J, Ding Z, Li R, Lin K, Fu J, Huang Q, Zeng Y and Liu J (2021) Prognostic and Predictive Value of Transcription Factors Panel for Digestive System Carcinoma. Front. Oncol. 11:670129. doi: 10.3389/fonc.2021.670129

Received: 20 February 2021; Accepted: 24 September 2021;
Published: 21 October 2021.

Edited by:

Raluca Ioana Stefan-van Staden, National Institute of Research and Development for Electrochemistry and Condensed Matter (INCEMC), Romania

Reviewed by:

Nageswara Rao Reddy Neelapu, Gandhi Institute of Technology and Management (GITAM), India
Li Zhang, University of Minnesota Twin Cities, United States

Copyright © 2021 Fang, Fan, Ding, Li, Lin, Fu, Huang, Zeng and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yongyi Zeng, bGFtcDE5NzMxMUAxMjYuY29t; Jingfeng Liu, ZHJqaW5nZmVuZ0AxMjYuY29t

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.