Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 09 October 2023
Sec. Computational Genomics

Transcription factors direct epigenetic reprogramming at specific loci in human cancers

  • Department of General Surgery, Nanfang Hospital, The First School of Clinical Medicine, Southern Medical University, Guangzhou, Guangdong, China

The characterization of epigenetic changes during cancer development and progression led to notable insights regarding the roles of cancer-specific epigenetic reprogramming. Recent studies showed that transcription factors (TFs) are capable to regulate epigenetic reprogramming at specific loci in different cancer types through their DNA-binding activities. However, the causal association of dynamic histone modification change mediated by TFs is still not well elucidated. Here we evaluated the impacts of 636 transcription factor binding activities on histone modification in 24 cancer types. We performed Instrumental Variables analysis by using genetic lesions of TFs as our instrumental proxies, which previously discovered to be associated with histone mark activities. As a result, we showed a total of 6 EpiTFs as strong directors of epigenetic reprogramming of histone modification in cancers, which alters the molecular and clinical phenotypes of cancer. Together our findings highlight a causal mechanism driven by the TFs and genome-wide histone modification, which is relevant to multiple status of oncogenesis.

Introduction

Epigenetic reprogramming refers to the alteration of dynamic epigenetic marks (like histone modifications, DNA methylation, chromatin remodeling and some non-coding RNA molecules) on post-translational levels at various promoter or enhancer elements. Above all, modifications of histone tails are essential to define distinct gene expression states and other chromatin-based processes both actively (H3K4me3, H3K4me1 and H3K27ac, etc.) and silently (e.g., H3K9me3 and H3K27me3). Mounting evidence suggests that epigenetic modification plays a role in the diverse clinical behavior of cancers ranging from slow-growing to aggressive tumors, and thus contributes to the progression of cancers (Wang et al., 2019; Baksh et al., 2020; Chi et al., 2020; Li et al., 2021). Although various types of cancer exhibit common phenotypic characteristics like uncontrolled growth and apoptosis resistance, the epigenetic changes leading to these features can differ significantly among different cancers, resulting in a considerable landscape of heterogeneity. By characterizing epigenetic modification during cancer development and progression, researchers gained notable insights into cancer-specific epigenetic reprogramming.

Epigenetic modification is restricted locally and globally with varying substrate specificities created hierarchical modification patterns from individual promoters to the entire chromosomes. Such epigenetic modification patterns are induced by a set of regulatory enzymes, many of which are known oncogenes (Kurdistani, 2007). In fact, epigenetic modification enzymes are specific - but not loci-specific - to certain chemical groups. Hence the epigenetic reprogramming in cancer shows some general tendencies (McClellan et al., 2023). For example, tumor genome tends to be demethylated, and hypomethylation occurs only in specific genes. Nevertheless, more studies demonstrate loci-specific epigenetic modifications in cancer. The landscape of tumor epigenetic modification can result from complex determinations such as germline variation, somatic evolution, immune check-point and treatment.

Notably, recent studies report that transcription factors (TFs) are involved in the regulation of epigenetic reprogramming by introducing epigenetic modifications at specific loci in different cancer types, which provides evidence for the formation of specific epigenetic reprogramming signatures (Kant et al., 2022). AR-co-factors (e.g., FOXA1 and HOXB13) bring about H3K4me3/H3K27me3 bivalent marks at neural lineage–associated genes in Myc-driven advanced prostate cancer (Berger et al., 2019). IGFBP4 results in an AKT-EZH2 reciprocal loop to drive H3K27me3-mediated epigenetic reprogramming in hepatocellular carcinoma (Lee et al., 2018). Still, the current knowledge of TFs as general regulators of epigenetic reprogramming is limited in certain genes and cancer types. It remains unclear how many TFs are involved in epigenetic reprogramming and how these TFs impact cancer epigenomes. To inform future functional study in cancer epigenetics, we systematically evaluated the impacts of 636 transcription factor binding activities on histone modification in 90 cell lines of 24 cancer types. We used mutations of TFs as instrumental proxies and regressed histone modification to the binding activity of TF. To this end, we identified 10 TFs-Histone markers pairs which strongly regulated the epigenome in cancers. Cooperative interaction among regulatory factors is often required to achieve precise regulation. We further focused on the co-binding localization at specific regulatory elements within the genome to achieve accurate regulation of target genes to investigate the impacts on the molecular and clinical phenotypes of cancer. Our findings suggested a causal mechanism for epigenetic reprogramming in cancer driven by the TFs, which helps better understand the process of tumorigenesis and treatment.

Materials and methods

Data collection

We collected ChIP-seq data processed from the Encyclopedia of DNA Elements Project (ENCODE, http://encodeproject.org) (Moore et al., 2020) for the human genome (hg38) including seven histone markers (H3K4me3, H3K36me3, H3K4me1, H3K27ac, H3K79me2, H3K9ac, H4K20me1) and 759 TFs. The MACS2 algorithm was used to identify the influence of genome complexity to evaluate the significance of enriched ChIP regions with narrow peaks mode (TFs) and broad peaks mode (histone modifications). We defined the binding activities of TFs and histone modifiers as normalized signal enrichment of genomic region peaks. For each cell line, more than three histone markers were candidate. For each epigenome, at least two replicates of the input experiment were candidate. After filtration, 4,206 paired BED4+6 files from 636 TFs to 7 histone markers involved 24 cancer types were included in the next analysis.

We got the somatic variants and expression information from the Cancer Cell Line Encyclopedia (CCLE, https://sites.broadinstitute.org/ccle/, v19q2) (Nusinow et al., 2020). We defined the positive mutation status as that non-silent somatic mutation in the protein coding region of a gene, or any mutation identified in a non-coding gene, including nonsense, missense, frameshif indels, splice site mutations, stop codon readthroughs, change of start codon, in-frame indels.

We downloaded The CERES scores of 320 cancer cell lines from The Cancer Dependency Map Project at Broad Institute (DepMap, https://depmap.org, v19Q2) (Barretina et al., 2019). The CERES score is an algorithm used to calculate the degree of gene dependency based on results from CRISPR-Cas9 essentiality screens. The score took into consideration the impact of gene copy number variations on the results. (Meyers et al., 2017). Data on the sensitivity of 305 drugs, measured by IC50, across 988 cancer cell lines, was obtained from the Genomics of Drug Sensitivity in Cancer database (GDSC; https://www.cancerrxgene.org).

Generalized linear regression analyses suggest candidate TFs of epigenetic reprogramming

To estimate the linear effects of each TF on the epigenetic reprogramming, we performed Ordinary Least Squares (OLS) regression in order to infer the correlation between TFs binding activities and Histone marks binding activities in pan-cancer level. The regression coefficients of β represented the effect sizes of the binding activities of TFs. The regression coefficients of α represented the effect sizes of other covariates.

HiskiβijTFij*Ci+αiEiTi+εijk

Here, εijk∼ N (0, σ2) was a Gaussian error; Hiski referred to the binding activities of the kth Histone marks of ith cell lines; TFij referred to the binding activities of jth TFs of ith cell lines. Ei, Ci and Ti was the expression of TFs, classification of Histone marks (we set “activated” and “silent”) and the cancer types respectively. The regression coefficient αi was a 1*2 vector responded to two covariates of Ei and Ti. We corrected the multiple p values of the regression coefficients through “Benjamini–Hochberg” method, and we got the FDR for each TF. The significant TFs (FDR <0.1, lower than 10% of the false-positive rate associated with a p-value of 0.05 for each hypothesis testing) were candidate for the next analysis.

Instrumental variable regression analysis

Instrumental variable analysis was used to figure out the driver TFs by performing the R package ivpack(v1.2), of which, their binding activities at the genetic loci resulted in the epigenetic reprogramming. Briefly, the dependent variable was the binding activities of histone marks (His), and the independent variable was the binding activities of TFs that was significantly associated with His, and the genetic instruments were the binary mutation status (wild type or mutated) of the TFs and the covariates above (the expression of TFs and the cancer types respectively).

Hiskiβ0+β1TFij+Ei+Ti|Mutij+Ei+Ti+εijk

Here, εijk∼ N (0, σ2) was a Gaussian error; Hiski referred to the kth Histone marks of ith cell lines; TFij referred to the binding activities of jth TFs of ith cell lines; Mutij referred to the mutated status of jth TFs of ith cell lines. Ei, and Ti was the expression of TFs and the cancer types respectively. The IV models were estimated by Two-Stage least squares (2SLS). We corrected the multiple p values of β1 through “Benjamini–Hochberg” method and got the FDR for each TF to evaluate the significance of the independent variables. To figure out the significance of the instrumental variables, we recalculated the p values for the weak instruments test using the Kleibergen-Paap rank Wald F-statistic and estimated the false discovery rate. (FDRweak) The significant TFs were considered as “driver TFs”. (FDR <0.1 and FDRweak <0.1).

Identify the drug-related activities of EpiTFs in cancer cell lines

To evaluate the effects of the activities of EpiTFs on the drug sensitivity to therapy, we used a linear model to regress the mean mRNA expression of co-regulated genes for each TFs-HMs pair and the IC50 of drugs. According to the targets of drugs, we divided the drugs into different classes. Thus, we could calculate the OR values associated with different pathways.

IC50ikβ0+β1mRNAij+Ci+εijk

Here, εijk∼ N (0, σ2) was a Gaussian error; IC50ik referred to the kth drugs of ith cell lines; mRNAij referred to the mean mRNA expression of co-regulated genes of jth TFs in ith cell lines.

Gene sets enrichment analysis

To assess the overrepresentation of the target gene for each EpiTFs in established cancer gene sets, we used the R package GeneOverlap(v1.36.0) to conduct Fisher’s exact test. For the pathway enrichment analysis, we adopted the R package clusterProfiler(v4.8.1) to perform hypergeometric test in the target gene for each EpiTFs. The reference was hallmark gene sets (https://www.gseamsigdb.org/gsea/msigdb/collections.jsp).

Survival analysis

A total of 3,295 EpiTFs related TCGA patients, including Prostate Cancer (PRAD, N = 494), Colon Cancer (COAD, N = 287), Breast Cancer (BRCA, N = 789), Lung Adenocarcinoma (LUAD, N = 509), Pancreatic Cancer (PAAD, N = 170), Liver Cancer (LIHC, N = 358), Kidney Clear Cell Carcinoma (KIRC, N = 366), Cervical Cancer (CESC, N = 286), Bile Duct Cancer (CHOL, N = 36), were included to analysis clinical outcome.

We employed the Kaplan-Meier approach to estimate the overall survival (OS), and evaluated the divergence between groups by conducting the log-rank test. p < 0.05 was considered significant.

PPI network analysis

The search tool for retrieval of interacting genes (STRING, https://string-db.org) database (Szklarczyk et al., 2021) is served to predict the protein-protein interaction network (PPI network). To identify potential interactions between target genes according to different EpiTFs, R package STRINGdb(ver.2.10.1, database version = 11.5, species version = 9,606) was employed. Active interactions (score threshold = 700) were applied to construct the PPI networks. R packages igraph(ver.1.4.2) and ggraph(ver.2.1.0) were used to constructive and visualize the PPI network. In the networks, the nodes referred to the proteins and the edges represent the interactions. To display the highly connected regions of the network, we set the following criteria: minimum degree = 5, edge weights = combined score, the minimum average edge weight within the cluster = 0.05.

Results

TFs involved in epigenetic modification

To evaluate the dynamics of epigenetic modification in tumor genomes systematically and identify the reliable driver TFs, we collected ChIP-seq data for TFs and histone marks (HMs) from the ENCODE database. After a raw filtration (see Methods; Figure 1), the dataset included 636 transcription factors and 7 epigenetic marks (H3K4me3, H3K36me3, H3K4me1, H3K27ac, H3K79me2, H3K9ac, H4K20me1) from 54 cell lines among 17 different tumors. Then we performed an Ordinary least squares (OLS) regression model to screen for TFs and their corresponding epigenetic marks with consistent binding activities at their overlapped loci. To our surprise, we only obtained 6.9% (N = 44) candidate TFs (FDR <0.1), which can be considered to be associated with HMs, suggesting a finite correlation between TFs and epigenetic changes.

FIGURE 1
www.frontiersin.org

FIGURE 1. The schematic workflow of our study. In this study, we collected ChIP-seq data for TFs and HMs. Then, we inferred the causal biological mechanisms for the candidate determinants of the histone modification via EpiTFs mutation. In addition, we perform gene sets enrichment, PPI analysis, Drug resistance and Survival analysis for the co-regulated loci of EpiTFs and HMs to investigate the biological function and clinical outcome of EpiTFs.

In order to better understand the causal relationship between TFs and epigenetic changes, we further generated an instrumental variable (IV) regression for the 44 candidate driver TFs based on a mechanism of epigenetic reprogramming: the mutational status of a candidate TF directly alter its binding activity in cancers and then influence the epigenetic changes (Figure 1). We used the mutational status of the TFs as the instrumental variable to exogenously separate the changes in explanatory variable that are independent of the error term and ignore the part of explanatory variable that causes bias in the OLS estimator. This change could be considered as the causal effect in TF binding strength on corresponding tumor genomic HMs. As a result, we identified 10 significant TFs-HMs pairs that satisfied the condition of the mutational statuses of TFs being an exogenous variable (FDR <0.1 and FDRweak <0.1), among which 6 TFs (AR, EP300, FOXA1, GATA3, POLR2A, TP53) and 3 HMs (H3K4me1, H3K4me3, H3K27ac) were included (Table 1). Of note, H3K4me3 was considered as a marker of promoter regions, and H3K4me1 and H3K27ac were markers of enhancer regions. Thus, all of our driver TFs mainly impacted the promoter and enhancer elements: mutations in TFs such as AR, EP300, and TP53 tended to affect the enhancer regions of downstream target genes, while other transcription factors affected the binding intensity of both promoter and enhancer regions (Table 1).

TABLE 1
www.frontiersin.org

TABLE 1. The results of IV analysis for 10 EpiTFs-HMs pairs.

EpiTFs influence specific cancer pathways

As the epigenetic determinants of driver TFs (also we named it EpiTFs) influence many biological processes in tumors, we next explored how the EpiTFs impact carcinogenesis and the consequent biological–clinical characteristics of cancer.

Firstly, we obtained the downstream target genes through figuring out the overlapped regions for each TFs-HMs pair. The most recent advances in gene-editing technology has allowed for precise examination of the role of specific genes in driving the growth of cancer cells. According to the CERES scores for genes from Depmap portal, we categorized 10,343 genes of cancer dependency according to the median CERES score across 233 cell lines. We then compared the fold enrichment of our target genes among 10 TFs-HMs pairs. We found out that almost all the target genes of these pairs were significantly enriched as the oncogenes, whose CERES score is less than zero (CERES = −2 ∼ −0.4, Figure 2A). Among them, the target genes of H3K4me3 resulting from FOXA1, GATA3 and PLOR2A as well as H3K27ac resulting from EP300 and GATA3 were significantly enriched in the gene sets (CERES = −2∼−0.4, p < 0.001), signifying a broad dependency for tumor proliferation. Compared to H3K4me3 and H3K27ac, H3K4me1 target genes were weaker to cancer dependency in vitro (Figure 2A).

FIGURE 2
www.frontiersin.org

FIGURE 2. The gene sets enrichment analysis. (A) The grouped bar plot showed the fold enrichment results of the target genes come from 10 EpiTFs-HMs pairs in different groups of cancer dependency gene sets (the length of each intervals was 0.4 and CERES <1 meant cancer dependency). (B) The bubble chart showed the 50 hallmark gene sets enrichment results of the target genes come from 10 EpiTFs-HMs pairs (the color indicated FDR, the size of the dots indicated gene ratio).

Secondly, to better know the biological characteristics of these TFs-HMs pairs, we performed pathway enrichment analysis on their target genes using the Hallmark reference databases. The results showed that the target genes of AR:H3K4me1 were mainly enriched in the estrogen-ER activity, and the target genes of TP53:H3K4me1 were enriched in pathways related to hypoxia, apoptosis, UV, and the p53 pathway (Figure 2B), which is consistent with our understanding of these two transcription factors (Mills, 2014; Levine, 2020). It is worth noting that FOXA1 target genes were mainly enriched in pathways related to apoptosis, mitosis, mTOR, TNF-α (Figure 2B), indicating its possible involvement in tumor malignant proliferation and supporting our fold enrichment of CERES scores above (Figure 2A).

EpiTFs define specific subtype of cancer

To further validate whether the causal relationship among these 10 pairs define specific subtype of cancer, we plotted the binding activities at the overlapped loci of EpiTFs and HMs in different tumors, and calculated the Pearson correlation coefficient between them. We observed that for FOXA1, it was more closely associated with the enhancer marker H3K4me1 than the promoter marker H3K4me3. The FOXA1:H3K4me1 correlation coefficient was 0.86, 0.79, while the FOXA1:H3K4me3 coefficient was only 0.082, 0.29 in COAD and LIHC respectively (Figures 3A,B), indicating that FOXA1 is enhancer-dependent transcription factor in COAD and LIHC. In addition to FOXA1, we also found that three other transcription factors - TP53, AR, and EP300 - still had significant correlations in COAD (correlation coefficient R2 > 0.3), including TP53:H3K4me1 (R2 = 0.5), AR:H3K4me1 (R2 = 0.79), EP300:H3K4m1 (R2 = 0.38), and EP300:H3K27ac (R2 = 0.55) (Figures 3C–F). It suggested that these TFs were closely related to tumors in COAD. Interestingly, the corresponding epigenetic markers of these TFs were all enhancer markers such as H3K4me1 and H3K27ac (Figures 3B–F). Therefore, we believed that the epigenetic changes mediated by these TFs mainly concentrated on the enhancer regions of the genome. However, GATA3 (R2≤0.3, Supplementary Figures S1B–D) and PLOR2A (R2≤0.3, Supplementary Figure S1A) led less correlation to epigenetic changes.

FIGURE 3
www.frontiersin.org

FIGURE 3. The Pearson correlation analysis. (A, B) H3K4me1 rather than H3K4me3 showed consistent effects (Pearson correlation) in FOXA1. (C–F) TP53, AR, and EP300 had significant correlations in COAD.

Additionally, some TFs-HMs pairs were occurred in specific tumor. For example, only EP300 and H3K27ac were closely related to BRCA (R2 = 0.47, p < 2.2e-10, Figure 3F); FOXA1 and EP300 were both related to H3K4me1 in LIHC (R2 = 0.79, p < 2.2e-16; R2 = 0.7, p < 2.2e-16, respectively, Figures 3B,E). These prominent pairs in specific tumors may reflect the epigenetic subtypes as well as epigenetic features for the tumor biology.

EpiTF activities are associated with treatment responses

To better describe the biological function of these regulatory regions on genes, we generated a protein-protein interaction (PPI) network analysis on the EpiTFs and epigenetic marks. In PPI networks, “hub nodes” usually refer to nodes with extremely high degree. Researchers often focus on these “hub nodes”to gain deeper insights into the topology of PPI networks and related biological processes (Wang et al., 2023). We labeled the hub proteins that were associated with at least 5 other genes. As a result, the key proteins in the Wnt pathway, such as CTNNB1 (β-Catenin) and MEF2A/D (MEF2), which were influenced by FOXA1:H3K4me3, were located in the core position of the interaction network (Figure 4A). Similarly, among the target genes affected by AR:H3K4me1, we also spotted the presence of key proteins in the Wnt-β-Catenin pathway such as AXIN1, as well as targets of the PI3K/NF-κB pathway (Figure 4B), which respond to cell proliferation. HDAC4, notably, exists in the network, which may be an important auxiliary factor for FOXA1 to participate in epigenetic modifications.

FIGURE 4
www.frontiersin.org

FIGURE 4. The key molecular and drug sensitivities analysis. (A, B) The protein-protein interaction (PPI) analysis showed the labeled hub protein (degree >5) in FOXA1 (A) and AR (B) regulatory network. (C) The bubble plot showed the correlation between the sensitivities of 302 drugs targeted to different pathways and the mean expression of the target genes come from 10 EpiTFs-HMs pairs.

The expression of genes co-regulated by EpiTFs and HMs reflected the biological function levels of EpiTFs. To assess the therapeutic implications of the EpiTFs activities, we used a linear model to calculate the association between the mean expression of the co-regulated genes and the IC50 in 304 clinical small-molecule agents. As a result, four EpiTFs (TP53, AR, EP300 and FOXA1), which were strongly corelated with HMs derived a strong resistance to the similar agents (OR > 1 and p < 0.001), which targeted to the cell proliferation such as cell cycle, mitosis, and PI3K/MTOR signaling (Figure 4C). Notably, chromatin histone acetylation was also related to higher activities of these four EpiTFs in H3K4me1, H3K27ac (Figure 4C), suggesting that HDAC inhibitors may intervene in the development of corresponding tumors. This requires further verification in subsequent research. Moreover, the activities induced by H3K4me3 showed weaker drug effects than those induced by H3K4me1 and H3K27ac (Figure 4C), demonstrated that EpiTFs-driven epigenetic reprogramming tended to occur in enhancer elements rather than promoter elements when suffered from drug resistance.

EpiTFs are associated with clinical outcomes

To further clarify the impact of these EpiTFs on prognosis of clinical outcome, a total of 3295 TCGA patients were enrolled to investigate the effects of different mutation statuses and target gene expression on overall survival. The results indicated that among the four activated EpiTFs, AR mutation and TP53 mutation led to a poor prognosis in the EpiTFs positive cancers (p = 5.5e-4 and p = 0.024, Figure 5A). The 5-year OS were 50.2% (95% CI 28–89.9) and 67.05% (95% CI 64.33–69.85) respectively. While FOXA1 and EP300 had no significant effect on patient survival in all the involved tumors (p = 0.21 and p = 0.59, Figure 5A). In comparison to different types of tumors, FOXA1 mutation in BRCA (27/788 mutated) resulted in worse prognosis (p = 0.044, the 5-year OS was 53.5%, 95% CI 35–58.7, Figure 5B); COAD patients (3/287 mutated) with FOXA1 mutation had better prognosis but without statistically significance (p = 0.4, Figure 5B). Conversely, EP300 mutation did not affect the prognosis of overall survival at both pan-cancer level and cancer specific level (Figure 5A, Supplementary Figure S2C). Next, we grouped patients by the median mRNA expression of these EpiTFs target genes. We found that patients with lower co-regulated target genes expression of AR as well as TP53 had a worse prognosis (P = 5e-4 and P = 1e-04, Figure 5C). And both of them were associated with H3K4me1 modification (The 5-year OS rates of H3K4me1:AR and H3K4me1:TP53 were 79.12% (95% CI 73.66–85) and 65.09% (95% CI 60.44–70.1), respectively). To our surprise, the higher expression of target genes regulated by EP300:H3K27ac (the 5-year OS was 65.57%, 95% CI 62.44–68.84) rather than EP300:H3K4me1 led to a poorer prognosis (p = 0.001, Figure 5C).

FIGURE 5
www.frontiersin.org

FIGURE 5. Comparison of survival analysis between TCGA patients with different mutation statuses and transcriptional activities in four activated EpiTFs. (A) Kaplan‒Meier curves for patients with different mutation statuses in four activated EpiTFs in pan-cancer level. (B) Kaplan‒Meier curves for patients with different mutation statuses of FOXA1 in specific cancers. (C). Kaplan‒Meier curves for patients with different transcriptional activities in 6 Epi-HMs pairs at pan-cancer level.

Discussion

In this study, we aimed to evaluate the dynamics of epigenetic modification systematically and identify the reliable driver TFs in human cancers. In traditional multiple linear regression models, if there is a correlation between some independent variables and the error term, it will lead to biased OLS estimates, of which the obtained regression coefficients loss a causal interpretation (Lu et al., 2023). To eliminate endogeneity problems and obtain accurate estimates of causal effects, we used the IV regression to infer the significant causal relationship between TFs and HMs. We identified a total of 6 EpiTFs (FOXA1, AR, EP300, TP53, GATA3, LOR2A), which resulted in the corresponding histone modification (e.g., H3K4me3, H3K4me1 and H3K27ac) (Table. 1). Among them, FOXA1 was wildly regarded as pioneer transcription factor (pTF) in prostate cancers. Recent studies demonstrated that pTFs referred to a type of TFs that can bind to packed chromatin tightly and promote the opening and accessibility of gene promoter regions and even enhancer regions (Iwafuchi-Doi et al., 2016; Parolia et al., 2019). Similarly, the EpiTFs mainly caused H3K4me1 and H3K4me3 in our study (Table. 1), suggesting that both of EpiTFs and pTFs fulfill their functions through Cis- and trans-regulated. Moreover, pTF FOXA1 usually collaborated with other partners (like AR) to mediate processes such as histone modification and chromatin remodeling, enabling cells to respond rapidly to external stimuli and achieve complex regulatory networks (Barbieri et al., 2012). Thus, we supposed that the EpiTFs was an extensive concept including pTFs and their partners. However, since bulk TFs lacked their ChIP-seq data (86.25%, 276/320), it was unable to identify all the EpiTFs in human cancers. Increased sample size can improve the statistical power to identify more EpiTFs in the future study.

After further confirmed with the binding activities between TFs and HMs, 4 of 6 EpiTFs (FOXA1, AR, TP53, EP300) were thought to be more positive to induce corresponding histone modification in one or multiple tumors (Figure 3). Among them, EP300 was an acetyltransferase that directly catalyzed H3K27 acetylation modification and promoted chromatin relaxation and transcriptional initiation (Cui et al., 2023), which was in line with the modes of EpiTFs. Through comparing the CERES fold enrichment of our target genes among 10 TFs-HMs pairs, we found that the target genes of these EpiTFs were significantly enriched as cancer dependent genes. The target genes of FOXA1 and H3K4me1 were significantly enriched in both the negative interval (CERES = -1.6∼-0.4, p < 0.001) (Figure 3A). Indeed, it is still unknown how FOXA1 alterations affect the prognosis in human cancers, and how FOXA1 is able to serve as both tumor-suppressor (Jin et al., 2013; Jin et al., 2014; Song et al., 2019) and oncogenic genes (Robinson et al., 2011; Pomerantz et al., 2015). Consistent with this, FOXA1 mutations and transcriptional activities had no significant effect on overall survival in pan-cancer level (Figures 5A,C), and FOXA1 mutations only led to a worse prognosis in BRCA (p = 0.044, Figure 5B). FOXA1 mutations dysregulated estrogen-ER activity (Figure 2B) and were associated with worse outcome for metastatic ER + breast cancer (Hurtado et al., 2011; Arruabarrena-Aristorena et al., 2020). However, large breast cancer gene expression datasets revealed that histone acetyltransferases EP300 was correlated with the cancer stem cells and poor prognosis in triple negative breast cancer and basal-like Breast cancers (Ring et al., 2020). Compared to FOXA1, EP300 has higher association with H3K27ac in BRCA (R = 0.38, p < 0.001, Figure 3F), which could well support those viewpoints. Thus, TFs-HMs pairs characterized some biological features of tumors, and EpiTFs could define different subtype of tumors through their downstream histone modification.

In the PPI networks analysis, the results indicated that some key proteins (β-Catenin, MEF2, AXIN1) in the Wnt pathway were “hub nodes” in the interaction network of FOXA1:H3K4me1 and AR:H3K4me1 (Figures 4B,C). β-Catenin acted as an adhesion protein and accumulated in the nucleus when the Wnt signal was upregulated. As a coactivator of the TCF/LEF family of transcription factors, it can activate Wnt response genes, such as the genes encoding cell cycle proteins like cyclin-D and c-myc that promote cell proliferation, leading to tumor fast progression in cancers such as colon, ovarian, prostate, hepatoblastoma, and hepatocellular carcinoma (O'Brien et al., 2023). We assumed that both FOXA1 and AR resulted in the elevated H3K4me1 level, activating the Wnt/β-Catenin pathway to accelerate the process of downstream cell cycle and mitotic (Figures 2B, 4B). Moreover, HDAC4 was also the “hub node” occurred in the FOXA1 network (Figure 2B). It was reported that FOXA1 could be modulated by HDAC3 through the Wnt/β-catenin signaling in ovarian carcinoma (Lou et al., 2022). HDAC3 and HDAC4 are both histone deacetylases (HDACs), which participate in histone modification within the cell nucleus, altering the structure and function of chromatin by removing acetyl groups (Mustafa and Krämer, 2023). Despite these two HDACs belong to different classes (HDAC3 was HDACI, HDAC4 was HDACIIa), both of them were Zn2+ dependent HDACs and targeted by the pan-HDAC or HDACI/II inhibitors.

Regarding to TP53 and AR, the mutation of these two EpiTFs led to poor prognosis in involved patients (Figure 5A). Meanwhile, the diminished target gene expression of these two EpiTFs also resulted in a worse overall survival (Figure 5C). These results suggesting that the mutation of TP53 and AR caused the reduced binding of DNA. Indeed, the majority mutations in these 2 TFs resulted in defective gene function, including loss-of-function and gain-of-function gene alterations (Rebello et al., 2021; Soussi, 2022). However, in EP300, high transcriptional activities rather than genomic lesions were contributing to tumor progression at pan-cancer level (Figures 5A,C; Supplementary Figure S2C). In cancer cells, structural variations (SVs) in genome can disrupt three-dimensional chromosomal organization, so that the increased deposition of H3K27 (“enhancer hijacking”) promoted oncogene expression (Sungalee et al., 2021; Wang et al., 2021). Although mutated EP300 failed to convert the overall survival at pan-cancer level, high transcriptional activities in EP300:H3K27ac (rather than EP300:H3K4me1) target genes resulted in a worse prognosis (Figures 5A,C). It suggested that H3K27ac tend to aggravate the cancer biology of EP300 mutations though other mechanisms like enhancer-hijacking.

In summary, the study provides valuable insights into the role of EpiTFs in epigenetic modification and their association with cancer. We identified 6 driver EpiTFs via a causality inference including four strong EpiTFs (FOXA1, AR, TP53, EP300). They tended to induce histone modification in enhancer regions (H3K4me1 and H3K27ac). The downstream target genes of them were cancer dependent and enriched in some pathways related to cell proliferation as a whole. Of note, HDAC4 and Wnt/β-Catenin were played critical roles on FOXA1 and AR. These findings may have implications for the development of targeted therapies for cancer treatment.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding authors.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

HJ: Study design, Data collection, Data analysis, Drafting the manuscript. GL: Project supervision, Study design, Methodology, Writing–Reviewing and Editing. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2023.1234515/full#supplementary-material

References

Arruabarrena-Aristorena, A., Maag, J., Kittane, S., Cai, Y., Karthaus, W., Ladewig, E., et al. (2020). FOXA1 mutations reveal distinct chromatin profiles and influence therapeutic response in breast cancer. Cancer Cell 38, 534–550. doi:10.1016/j.ccell.2020.08.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Baksh, S., Todorova, P., Gur-Cohen, S., Hurwitz, B., Ge, Y., Novak, J., et al. (2020). Extracellular serine controls epidermal stem cell fate and tumour initiation. Nat. Cell Biol. 22, 779–790. doi:10.1038/s41556-020-0525-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Barbieri, C., Baca, S., Lawrence, M., Demichelis, F., Blattner, M., Theurillat, J., et al. (2012). Exome sequencing identifies recurrent SPOP, FOXA1 and MED12 mutations in prostate cancer. Nat. Genet. 44, 685–689. doi:10.1038/ng.2279

PubMed Abstract | CrossRef Full Text | Google Scholar

Barretina, J., Caponigro, G., Stransky, N., Venkatesan, K., Margolin, A., Kim, S., et al. (2019). Addendum: the cancer cell line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 565, E5–E6. doi:10.1038/s41586-018-0722-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Berger, A., Brady, N., Bareja, R., Robinson, B., Conteduca, V., Augello, M., et al. (2019). N-Myc-mediated epigenetic reprogramming drives lineage plasticity in advanced prostate cancer. J. Clin. investigation 129, 3924–3940. doi:10.1172/JCI127961

PubMed Abstract | CrossRef Full Text | Google Scholar

Chi, Z., Chen, S., Xu, T., Zhen, W., Yu, W., Jiang, D., et al. (2020). Histone deacetylase 3 couples mitochondria to drive IL-1β-dependent inflammation by configuring fatty acid oxidation. Mol. Cell 80, 43–58. doi:10.1016/j.molcel.2020.08.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Cui, J., Wang, Y., Tian, X., Miao, Y., Ma, L., Zhang, C., et al. (2023). LPCAT3 is transcriptionally regulated by YAP/ZEB/EP300 and collaborates with ACSL4 and YAP to determine ferroptosis sensitivity. Antioxidants & redox signaling.

Google Scholar

Hurtado, A., Holmes, K., Ross-Innes, C., Schmidt, D., and Carroll, J. (2011). FOXA1 is a key determinant of estrogen receptor function and endocrine response. Nat. Genet. 43, 27–33. doi:10.1038/ng.730

PubMed Abstract | CrossRef Full Text | Google Scholar

Iwafuchi-Doi, M., Donahue, G., Kakumanu, A., Watts, J., Mahony, S., Pugh, B., et al. (2016). The pioneer transcription factor FoxA maintains an accessible nucleosome configuration at enhancers for tissue-specific gene activation. Mol. Cell 62, 79–91. doi:10.1016/j.molcel.2016.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, H., Zhao, J., Wu, L., Kim, J., and Yu, J. (2014). Cooperativity and equilibrium with FOXA1 define the androgen receptor transcriptional program. Nat. Commun. 5, 3972. doi:10.1038/ncomms4972

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, H., Zhao, J., Ogden, I., Bergan, R., and Yu, J. (2013). Androgen receptor-independent function of FoxA1 in prostate cancer metastasis. Cancer Res. 73, 3725–3736. doi:10.1158/0008-5472.CAN-12-3468

PubMed Abstract | CrossRef Full Text | Google Scholar

Kant, R., Manne, R., Anas, M., Penugurti, V., Chen, T., Pan, B., et al. (2022). Deregulated transcription factors in cancer cell metabolisms and reprogramming. Seminars cancer Biol. 86, 1158–1174. doi:10.1016/j.semcancer.2022.10.001

CrossRef Full Text | Google Scholar

Kurdistani, S. (2007). Histone modifications as markers of cancer prognosis: A cellular view. Br. J. cancer 97, 1–5. doi:10.1038/sj.bjc.6603844

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, Y., Mok, M., Kang, W., Yang, W., Tang, W., Wu, F., et al. (2018). Loss of tumor suppressor IGFBP4 drives epigenetic reprogramming in hepatic carcinogenesis. Nucleic acids Res. 46, 8832–8847. doi:10.1093/nar/gky589

PubMed Abstract | CrossRef Full Text | Google Scholar

Levine, A. (2020). p53: 800 million years of evolution and 40 years of discovery. Nat. Rev. Cancer 20, 471–480. doi:10.1038/s41568-020-0262-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, K., Han, J., and Wang, Z. (2021). Histone modifications centric-regulation in osteogenic differentiation. Cell death Discov. 7, 91. doi:10.1038/s41420-021-00472-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Lou, T., Liu, C., Qu, H., Zhang, Z., Wang, S., and Zhuang, H. (2022). FOXA1 can be modulated by HDAC3 in the progression of epithelial ovarian carcinoma. J. Transl. Med. 20, 19. doi:10.1186/s12967-021-03224-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, B., Thomson, S., Blommaert, S., Tadrous, M., Earle, C., and Chan, K. (2023). Use of instrumental variable analyses for evaluating comparative effectiveness in empirical applications of Oncology: A systematic review. J. Clin. Oncol. official J. Am. Soc. Clin. Oncol. 41, 2362–2371. doi:10.1200/JCO.22.00023

CrossRef Full Text | Google Scholar

McClellan, B., Haase, S., Nunez, F., Alghamri, M., Dabaja, A., Lowenstein, P., et al. (2023). Impact of epigenetic reprogramming on antitumor immune responses in glioma. J. Clin. investigation 133, e163450. doi:10.1172/JCI163450

CrossRef Full Text | Google Scholar

Meyers, R., Bryan, J., McFarland, J., Weir, B., Sizemore, A., Xu, H., et al. (2017). Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. Nat. Genet. 49, 1779–1784. doi:10.1038/ng.3984

PubMed Abstract | CrossRef Full Text | Google Scholar

Mills, I. (2014). Maintaining and reprogramming genomic androgen receptor activity in prostate cancer. Nat. Rev. Cancer 14, 187–198. doi:10.1038/nrc3678

PubMed Abstract | CrossRef Full Text | Google Scholar

Moore, J., Purcaro, M., Pratt, H., Epstein, C., Shoresh, N., Adrian, J., et al. (2020). Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 583, 699–710. doi:10.1038/s41586-020-2493-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Mustafa, A., and Krämer, O. (2023). Pharmacological modulation of the crosstalk between aberrant janus kinase signaling and epigenetic modifiers of the histone deacetylase family to treat cancer. Pharmacol. Rev. 75, 35–61. doi:10.1124/pharmrev.122.000612

PubMed Abstract | CrossRef Full Text | Google Scholar

Nusinow, D., Szpyt, J., Ghandi, M., Rose, C., McDonald, E., Kalocsay, M., et al. (2020). Quantitative proteomics of the cancer cell line Encyclopedia. Cell 180, 387–402. doi:10.1016/j.cell.2019.12.023

PubMed Abstract | CrossRef Full Text | Google Scholar

O'Brien, S., Chidiac, R., and Angers, S. (2023). Modulation of wnt-β-catenin signaling with antibodies: therapeutic opportunities and challenges. Trends Pharmacol. Sci. 44, 354–365. doi:10.1016/j.tips.2023.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Parolia, A., Cieslik, M., Chu, S., Xiao, L., Ouchi, T., Zhang, Y., et al. (2019). Distinct structural classes of activating FOXA1 alterations in advanced prostate cancer. Nature 571, 413–418. doi:10.1038/s41586-019-1347-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Pomerantz, M., Li, F., Takeda, D., Lenci, R., Chonkar, A., Chabot, M., et al. (2015). The androgen receptor cistrome is extensively reprogrammed in human prostate tumorigenesis. Nat. Genet. 47, 1346–1351. doi:10.1038/ng.3419

PubMed Abstract | CrossRef Full Text | Google Scholar

Rebello, R., Oing, C., Knudsen, K., Loeb, S., Johnson, D., Reiter, R., et al. (2021). Prostate cancer. Nat. Rev. Dis. Prim. 7, 9. doi:10.1038/s41572-020-00243-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Ring, A., Kaur, P., and Lang, J. (2020). EP300 knockdown reduces cancer stem cell phenotype, tumor growth and metastasis in triple negative breast cancer. BMC cancer 20, 1076. doi:10.1186/s12885-020-07573-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Robinson, J., Macarthur, S., Ross-Innes, C., Tilley, W., Neal, D., Mills, I., et al. (2011). Androgen receptor driven transcription in molecular apocrine breast cancer is mediated by FoxA1. EMBO J. 30, 3019–3027. doi:10.1038/emboj.2011.216

PubMed Abstract | CrossRef Full Text | Google Scholar

Song, B., Park, S., Zhao, J., Fong, K., Li, S., Lee, Y., et al. (2019). Targeting FOXA1-mediated repression of TGF-β signaling suppresses castration-resistant prostate cancer progression. J. Clin. investigation 129, 569–582. doi:10.1172/JCI122367

PubMed Abstract | CrossRef Full Text | Google Scholar

Soussi, T. (2022). Benign SNPs in the coding region of TP53: finding the needles in a haystack of pathogenic variants. Cancer Res. 82, 3420–3431. doi:10.1158/0008-5472.can-22-0172

PubMed Abstract | CrossRef Full Text | Google Scholar

Sungalee, S., Liu, Y., Lambuta, R., Katanayeva, N., Donaldson Collier, M., Tavernari, D., et al. (2021). Histone acetylation dynamics modulates chromatin conformation and allele-specific interactions at oncogenic loci. Nat. Genet. 53, 650–662. doi:10.1038/s41588-021-00842-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Szklarczyk, D., Gable, A., Nastou, K., Lyon, D., Kirsch, R., Pyysalo, S., et al. (2021). The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic acids Res. 49, D605–D612. doi:10.1093/nar/gkaa1074

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, F., Yang, X., Ren, Z., Chen, C., and Liu, C. (2023). Alternative splicing in mouse brains affected by psychological stress is enriched in the signaling, neural transmission and blood-brain barrier pathways. Mol. psychiatry. doi:10.1038/s41380-023-02103-1

CrossRef Full Text | Google Scholar

Wang, X., Xu, J., Zhang, B., Hou, Y., Song, F., Lyu, H., et al. (2021). Genome-wide detection of enhancer-hijacking events from chromatin interaction data in rearranged genomes. Nat. methods 18, 661–668. doi:10.1038/s41592-021-01164-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Z., Li, K., Wang, X., and Huang, W. (2019). MiR-155-5p modulates HSV-1 replication via the epigenetic regulation of SRSF2 gene expression. Epigenetics 14, 494–503. doi:10.1080/15592294.2019.1600388

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: epigenetic reprogramming, oncology, transcription factors, histone modification, instrumental variable regression

Citation: Jiang H and Li G (2023) Transcription factors direct epigenetic reprogramming at specific loci in human cancers. Front. Genet. 14:1234515. doi: 10.3389/fgene.2023.1234515

Received: 04 June 2023; Accepted: 21 September 2023;
Published: 09 October 2023.

Edited by:

Qiyuan Li, Xiamen University, China

Reviewed by:

Tianhai Ji, Shanghai Jiao Tong University, China
Rongshan Yu, Xiamen University, China

Copyright © 2023 Jiang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Guoxin Li, Z3psaWd1b3hpbkAxNjMuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.