Identification of the novel FOXP3-dependent Treg cell transcription factor MEOX1 by high-dimensional analysis of human CD4+ T cells

Baßler, Kevin; Schmidleithner, Lisa; Shakiba, Mehrnoush Hadaddzadeh; Elmzzahi, Tarek; Köhne, Maren; Floess, Stefan; Scholz, Rebekka; Ohkura, Naganari; Sadlon, Timothy; Klee, Kathrin; Neubauer, Anna; Sakaguchi, Shimon; Barry, Simon C.; Huehn, Jochen; Bonaguro, Lorenzo; Ulas, Thomas; Beyer, Marc

doi:10.3389/fimmu.2023.1107397

ORIGINAL RESEARCH article

Front. Immunol. , 25 July 2023

Sec. Systems Immunology

Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.1107397

This article is part of the Research Topic Advances in Omics Data Analysis and Application in Immunology View all 8 articles

Identification of the novel FOXP3-dependent T_reg cell transcription factor MEOX1 by high-dimensional analysis of human CD4⁺ T cells

Kevin Baßler^1,2†

Lisa Schmidleithner^3†

Mehrnoush Hadaddzadeh Shakiba³

Tarek Elmzzahi^3,4

Maren Köhne³

Stefan Floess⁵

Rebekka Scholz³

Naganari Ohkura⁶

Timothy Sadlon⁷

Kathrin Klee²

Anna Neubauer³

Shimon Sakaguchi⁶

Simon C. Barry⁷

Jochen Huehn⁵

Lorenzo Bonaguro^1,2‡

Thomas Ulas^1,2,8‡

Marc Beyer^3,8‡*

¹Systems Medicine, German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany
²LIMES-Institute, Laboratory for Genomics and Immunoregulation, University of Bonn, Bonn, Germany
³Immunogenomics & Neurodegeneration, German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany
⁴Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
⁵Experimental Immunology, Helmholtz Centre for Infection Research, Braunschweig, Germany
⁶Laboratory of Experimental Immunology, WPI Immunology Frontier Research Center, Osaka University, Osaka, Japan
⁷Molecular Immunology, Robinson Research Institute, University of Adelaide, Norwich Centre, North Adelaide, SA, Australia
⁸PRECISE, Platform for Single Cell Genomics and Epigenomics at the German Center for Neurodegenerative Diseases and the University of Bonn, Bonn, Germany

CD4⁺ T cells play a central role in the adaptive immune response through their capacity to activate, support and control other immune cells. Although these cells have become the focus of intense research, a comprehensive understanding of the underlying regulatory networks that orchestrate CD4⁺ T cell function and activation is still incomplete. Here, we analyzed a large transcriptomic dataset consisting of 48 different human CD4⁺ T cell conditions. By performing reverse network engineering, we identified six common denominators of CD4⁺ T cell functionality (CREB1, E2F3, AHR, STAT1, NFAT5 and NFATC3). Moreover, we also analyzed condition-specific genes which led us to the identification of the transcription factor MEOX1 in T_reg cells. Expression of MEOX1 was comparable to FOXP3 in T_reg cells and can be upregulated by IL-2. Epigenetic analyses revealed a permissive epigenetic landscape for MEOX1 solely in T_reg cells. Knockdown of MEOX1 in T_reg cells revealed a profound impact on downstream gene expression programs and T_reg cell suppressive capacity. These findings in the context of CD4⁺ T cells contribute to a better understanding of the transcriptional networks and biological mechanisms controlling CD4⁺ T cell functionality, which opens new avenues for future therapeutic strategies.

1 Introduction

CD4⁺ helper T cells (T_H cells) are critically involved in most adaptive immune responses as they are responsible for activation of B cells, enhancing the response of cytotoxic T cells, promoting macrophage function and enabling them to mount an immune response against invading microorganisms (1). T_H cells can be divided into different subgroups depending on their function and cytokine production. For example, T_H1 cells secrete mainly IFN-γ and are important for the defense against intracellular pathogens, while T_H2 cells produce a variety of cytokines, including IL-4, IL-5, and IL-13, and are involved in mounting an immune response against extracellular parasites (2). T_H17 cells, on the other hand, produce primarily IL-17 and defend the organism against extracellular bacteria and fungi (3).

However, T_H cells are not only involved in the induction of the immune response but also play a vital role in regulation, modulation, and fine-tuning of immune responses through interactions with regulatory T cells (T_reg cells). T_reg cells exert their regulatory function through a variety of different mechanisms such as expression of inhibitory cytokines or surface markers, direct cytotoxicity or disruption of the metabolism of target cells (4, 5). T_reg cells, like all helper T cell subtypes, are dependent on complex interactions of signaling pathways converging in the activity of different transcription factors (6). The major transcription factor responsible for the induction of T_reg cell programing is Forkhead box protein 3 (FOXP3), which is involved in the generation, maintenance and function of T_reg cells (4, 5). However, even though our understanding of T_reg cell biology has greatly improved since their existence was first hypothesized in the 1970s, we still do not completely understand the underlying regulatory networks which mediate T_reg cell functionality (7). Modern Omics-technologies in combination with innovative bioinformatic analysis approaches have made it possible to analyze immune cells in more detail and to better understand the underlying mechanisms of the activation, functionality and polarization of different immune populations (8). For myeloid cells, transcriptome analysis of differently stimulated macrophages revolutionized our understanding of different macrophage polarization states (9). We hypothesized that utilizing a similar strategy by combining different complementary bioinformatic analysis approaches would enable us to better understand the orchestration of transcriptional and epigenetic events governing T_reg cell programing and provide new insights into T_reg cell biology. The usefulness of such approaches has also been documented for T cell biology as bioinformatic analyses have successfully been applied to advance our understanding of T cell differentiation. For instance, we and others identified novel genes which are important for the differentiation and functionality of T_reg cells (10–12), T_H17 cells (13, 14), and T_H1 cells and T_H2 cells (15).

The advent of single-cell transcriptomics over the last years has even further increased our appreciation that these transcriptional changes can be governed through the interaction of transcription factors, posttranscriptional changes induced by miRNAs and lncRNAs but also through epigenetic mechanisms including DNA methylation, chromatin accessibility and histone modifications. Previous epigenetic analyses have shown that a better understanding of the underlying cell type specific epigenetic events is important to better understand T cell biology. E.g. nucleosomes around loci of the different lineage-defining T_H-cell cytokines are differentially methylated between the different T_H-cell subgroups, while the histone methylation of lineage defining transcription factors is more plastic (16, 17). In addition, DNA methylation has been described as a key element for T cell differentiation with the FOXP3 locus serving as the prime example for DNA methylation being important for T_reg cell generation and stability (18).

In light of this, it is obvious that a multi-layered approach analyzing pre-existing or novel datasets has the potential to yield new insights into the mechanisms governing T cell proliferation, differentiation and function and thus may also shed light on different disease progresses. However, validating these findings experimentally can be challenging (11, 19).

In this study, we combined different bioinformatic analysis methods using a systems immunology approach to analyze previously published datasets from human CD4⁺ T_H cells including differently stimulated CD4⁺ T_H cells and T_reg cells to establish common traits for all CD4⁺ T_H cells but also to identify new T_reg cell signature genes.

Using this approach, we identified the transcription factor MEOX1 (Mesenchyme Homeobox 1) to be highly expressed particularly in activated T_reg cells with a similar expression pattern to FOXP3. MEOX1 has been primarily implicated in early development, where MEOX1 is necessary in all somatic compartments to ensure proper development (20), with expression levels dropping with gestational age. Frameshift mutations resulting in an unstable MEOX1 transcript or nonsense mutations of MEOX1 have been described to cause Klippel-Feil-Syndrome, a segmentation defect in the cervical spine (20, 21). Furthermore, MEOX1 has been implicated in the development and progression of breast and non-small cell lung cancer (22, 23). As such, MEOX1 expression has been correlated with breast cancer stage and poor survival (22). However, despite the importance of MEOX1 in both development and cancer, only recently a first report has indicated MEOX1 to be important for T_reg cells in the context of the tumor environment in intrahepatic cholangiocarcinoma and the acquisition of a tumor-infiltrating T_reg cell phenotype (24).

In an effort to better understand the role of MEOX1 in T_reg cells, we analyzed the MEOX1 locus and demonstrated that it is epigenetically more accessible in T_reg cells than in all other CD4⁺ T cell subsets and that it contains a FOXP3 binding site. Furthermore, we validated these data in human T_reg cells at both the mRNA and protein level, and by siRNA knockdown experiments established that transcriptional control through MEOX1 is downstream of FOXP3. Using transcription factor binding prediction, we identified target genes of MEOX1 which we experimentally verified to be MEOX1 target genes at the mRNA level. Furthermore, we could link expression of MEOX1 to T_reg cell suppressive function. Thus, by applying a systems immunology approach we discovered a new transcription factor in human T_reg cells operating downstream of FOXP3 important for human T_reg cell activity.

2 Materials and methods

Key resources table

2.1 Experimental model and subject details

2.1.1 Human subjects

Human T cells were purified from Buffy coats of healthy human donors obtained in compliance with institutional review board protocols (Ethics committee, University of Bonn, Reg. No. 288/13) after written consent. Due to privacy regulations, gender and age of these donors could not be ascertained.

2.1.2 Cell lines

Human embryonic kidney (HEK) 293T (ATCC CRL-11268; female) cells were maintained in DMEM containing 10% heat-inactivated fetal calf serum. Cells were cultivated at 37°C, 5%CO2.

2.2 Method details

2.2.1 Dataset compilation and primarydata handling

All microarray gene expression data (GSE15390 and GSE17241) were downloaded from the GEO database and compiled using GenomeStudio (Illumina). A total of 217 samples were imported into Partek Genomics Suite 5.0 (PGS) for further analysis including quantile normalization. Batch effects caused by separate array experiments were removed from normalized log2-transformed data. Background signal was calculated within R based on the coefficient of variation (the computed background for the entire dataset was 7.183). Genes were defined as expressed and kept for further analyses if their mean expression values were higher than background in at least one of the 217 samples. Afterwards, multi-probes were filtered to retain only a single probe with the highest mean expression as representative for the corresponding gene. To this end, principal component analysis (PCA) was performed and validated using networks based on Pearson’s correlations which were visualized in BioLayout Express3D. Only samples, which clearly deviated from other samples in both methods, were assumed to be outliers and hence removed from the dataset. Finally, 217 samples containing 14,632 expressed genes (also referred to as present genes) were kept for further analyses in R 3.4.1. Validation of MEOX1 expression in human T_reg cells was performed by reanalyzing a publicly available dataset downloaded from GEO (GSE11292) (11).

2.2.2 Principal component analysis and t-distributed stochastic neighbor embedding

PCA was applied on all present genes using the function prcomp of the R package stats by leaving the default setting unaltered. Moreover, nonlinear dimensionality reduction was performed to identify similarities between the CD4⁺ T cell samples by utilizing t-distributed stochastic neighbor embedding (t-SNE) (29). Therefore, the R package Rtsne was applied to all present genes by leaving the standard parameters unaltered despite of theta which was set to 0.0.

2.2.3 Correlation coefficient matrices combined with hierarchical clustering

The computation of the Pearson’s correlation coefficients (PCC) was done in a pairwise fashion between all CD4⁺ T cell conditions using PGS, which resulted in correlation coefficient matrices (CCMs). PCCs were computed using Pearson’s (Linear) correlation based on all present genes. The hierarchical clustering is based on Euclidean distance of the cells and was plotted as the standardized correlation coefficients (mean of zero and standard deviation of one) for the CD4⁺ T cell conditions. This resulted in 11 larger clusters representing all 48 conditions.

2.2.4 ANOVA calculation for differential gene identification

Data were analyzed in PGS by 2-way ANOVA for more than two CD4⁺ T cell conditions and student’s t-test for two conditions only. DE genes were defined by the 2-way ANOVA model (|FC| > 2, FDR adjusted p-value < 0.05) (30). The identified DE-genes were visualized in an UpSet plot using the R package ‘UpSetR’ (31).

2.2.5 K-means clustering combined with hierarchical clustering

In accordance to k-means clustering performed by Smeekens et al. (32) we used as input to the algorithm the most variable genes out of the 2-way ANOVA (p < 0.05; 9,925 genes) and calculated the fold-change (FC) between any sample and freshly isolated naïve CD4⁺ T cells (also referred to as cluster 11 or “T_conv cell resting”). To determine the optimal number of k clusters, the Davies-Bouldin index of absolute expression values was determined using PGS which resulted in 25 clusters. Similar to CCMs, the hierarchical clustering was calculated followed by ranking of the rows according to k-means cluster-affiliation and plotting of the standardized fold-changes (mean of zero and standard deviation of one) for the CD4⁺ T cell conditions. This resulted in 10 larger clusters representing all 48 conditions.

2.2.6 Self-organizing map clustering

Using the SOM-clustering implementation of PGS, the CCM-defined clusters were compared based on topological patterns and enabled the investigation of cluster-specific genes. First, the mean expression of the most variable genes based on a 2-way ANOVA was calculated for each cluster separately and standardized to a mean of zero and standard deviation of one. Next, 20,000 training iterations were used to cluster similar genes close to each other on the map. In our settings, the 9,925 genes were divided into 10 x 10 SOM-clusters (approximately 90 genes in each SOM-cluster), and the mean expression values of each SOM-cluster genes were used to calculate an eigenvalue, which represented the general expression value of this SOM-cluster. The resulting data were then visualized as a heatmap representing increased values in red, decreased values in blue and intermediate values in green.

2.2.7 Weighted gene co-expression network analysis

We utilized the WGCNA R package (33) to identify co-expressed genes associated with the 11 CCM-defined clusters. As input to this algorithm served the union of all DE-genes (|FC| > 2, FDR adjusted p-value < 0.05) between a certain cluster and the ‘T_conv cell resting’-cluster. The standard parameter of WGCNA was altered to a power of 15 and a minModuleSize of 10 resulting in 32 modules. For each module, the eigengene corresponding to the first principal component was calculated and subsequently correlated to the respective clusters. The correlation values were visualized in a heatmap.

2.2.8 Gene set enrichment analysis

To validate WGCNA, GSEA on the 32 modules in 10,000 permutations using PGS was performed (34). For each comparison (samples within a CCM-defined cluster versus all other samples of the dataset), normalized enrichment score (NES), allowing comparisons of overrepresentation between different gene sets, together with p-value of GSEA were plotted by Volcano plots. The two WGCNA modules, which exhibited both the highest eigengene-to-cluster correlation and a significant p-value (< 0.05), based on GSEA were selected. Genes within these modules were visualized in another Volcano plot by plotting expression ratios (reference: ‘T_conv cell resting’) versus p-values obtained by t-test statistics.

2.2.9 Prediction of potential FOXP3 targets

The Cytoscape plug-in iRegulon (35) was used to investigate the potential upstream transcription factors (TFs) controlling the expression of genes found within the two WGCNA modules with the highest correlation to ‘T_reg cell CD3/IL2’. Therefore, TF prediction was performed in a genomic region 500 bp upstream of the TSS. Subsequently, all genes which exhibited binding motifs for FOXP3 were visualized in a circular layout in Cytoscape.

2.2.10 Prediction of potential MEOX1 targets

To identify potential MEOX1 target genes, we utilized the database provided by the R package tftargets (https://github.com/slowkow/tftargets ) and queried genes that carry a binding site for MEOX1 in their promoter regions. The results were then filtered for genes found within the two WGCNA modules with the highest correlation to ‘T_reg cell CD3/IL2’ and that exhibited an expression fold change > |1.0|.

2.2.11 Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe)

The expression data was loaded into an integrated genomic analysis platform genomics Workbench 2.6.0 (36) to implement the ARACNe algorithm for network analysis (37). All present genes were taken into calculation of mutual information (MI) with p-values less than 0.05 without Bonferroni correction. The threshold of data processing inequality (DPI) theorem from information theory was set to 0.01. The resulting network consisted of 14,494 nodes with 179,876 edges

2.2.12 Tool for inferring network of genes (TINGe)

Similar to ARACNe, all present genes were used as input to TINGe (38). However, we used neither a p-value cutoff nor a DPI-value. The resulting network consisted of 14,494 nodes with 235,933 edges.

2.2.13 Context likelihood of relatedness (CLR)

We took advantage of the web server LegumeGRN that provides a CLR-implementation (39). We used the CLR-method (“plos”) which was used in the original publication (40) and left the default setting unaltered with the exception that the number of edges in the output file was limited to 250,000. The resulting network consisted of 12,641 nodes.

2.2.14 Gene network inference with ensemble of trees (GENIE3)

In the present study, we applied the R package genie3 (41) for the prediction of the regulatory network of all present genes by setting the number of trees to 1000 and limit report.max to 250,000. The resulting network consisted of 12,302 nodes with 227,449 edges.

2.2.15 Basic correlation

Pearson’s correlation was employed to compute the relationships between all gene pairs within the gene expression data. As input to this calculation, we used genes which were found in the output-networks of all of the abovementioned algorithms (10,721 genes). Pearson’s correlation was calculated using BioLayout Express3D. Setting the correlation cutoff to 0.7 resulted in a network consisted of 7,216 nodes with 700,136 edges.

2.2.16 Consensus network

All gene-gene interactions derived from each individual network prediction were ranked by its ranking function. By computing average rank for each gene pair using the GP-DREAM module AverageRank (42), 10,000 top ranking interactions was obtained. The resulting consensus network consisted of 3,845 nodes (R² = 0.938) and was visualized in a force-directed layout in Cytoscape.

2.2.17 Candidate gene prioritization

To link identified hub genes with transcriptional regulation of T cells, we supplemented the consensus network obtained by AverageRank with prior knowledge by applying the following strategy. The top 20% highly connected hub genes were prioritized by association with transcriptional regulation of T cells using the well-known T cell biology regulators TCF7, SATB1, EZH2 and GATA3 as test genes for the prioritization tools ToppGene (43) and Endeavour (44). As training parameters of ToppGene we used following features: “GO: Molecular Function”, “GO: Biological Process”, “GO: Cellular Component”, “Pathway”, “Pubmed”, “Coexpression” and “Coexpression Atlas”. For Endeavour following features were used: “EnsemblEst”, “GeneOntology”, “Kegg”, “Swissprot”, “Expression – SonEtAl” and “Expression – SuEtAl”. The results of the two approaches were combined by the Borda ranking method.

2.2.18 Identification of target genes of prioritized TFs using iRegulon

The 20% hub genes were used as input to iRegulon and potential upstream TFs were predicted in a genomic region of 500 bp of their respective TSS. Subsequently, the predicted TFs CREB1, E2F3, AHR, STAT1, NFAT5 and NFATC3 and their putative target genes were visualized in a network using the circular layout tool of Cytoscape. Moreover, the prioritized TFs and their target genes were used to perform GOEA.

2.2.19 Clustering of the consensus network using Markov clustering algorithm (MCL)

To identify sub-structures within the consensus network, we applied MCL which is implemented in the Cytoscape plug-in clusterMaker (45). In the present study we applied MCL using the default settings.

2.2.20 Single-cell RNA-seq analysis

We downloaded a recently published T cell single-cell RNA-seq dataset from the GEO database (GSE99254) that comprises CD4⁺ T cell- and CD8⁺ T cell-populations from non-small-cell lung cancer patients (28). The published t-SNE map of the single-cell study was reconstructed by utilizing the normalized and centered data provided on the GEO database and following the recommended pipeline of the authors. Briefly, we took advantage of the sscClust analysis pipeline (https://github.com/Japrin/sscClust) with which we determined the 1,500 most variable genes using the “sd” parameter followed by the calculation of the top 30 principal components, which were then used as input into the t-SNE construction. The cluster-annotation was extracted from published analysis results provided on the webpage http://lung.cancer-pku.cn. Identification of MEOX1 co-expressed genes in the single-cell RNA-seq dataset was performed using the TPM normalized expression data as input and calculating the Pearson’s correlation of MEOX1 with all other genes.

2.2.21 Analysis of histone modifications of the genomic MEOX1 locus

ChIP-seq data, which were provided by the NIH Roadmap Epigenomics Mapping Consortium, (46), was downloaded as WIG (wiggle) file and visualized using the integrative genomics viewer (IGV). For the analysis we used the available information about histone modifications for CD4⁺CD25^-CD45RA⁺ T cells and CD4⁺CD25⁺CD127^- T cells (both cell-types obtained from donor 63).

2.2.22 Isolation of human T_reg and T_conv cells from buffy coats

Human T cells were purified from Buffy coats of healthy human donors. CD4⁺ T cells were isolated by positive selection using pluribeads (Pluriselect) according to manufacturer’s instructions. To isolate T_reg and T_conv cells, cells were either isolated using CD25 MACS beads (Miltenyi Biotech) or sorted on a BD Aria III cell sorter (BD Biosciences) after staining with antibodies against CD3, CD4, CD25 and CD127. Gating strategy is shown in Figures S2A, B. As MEOX1 is an intracellular protein, during sorting for living cells it was not possible to stain for the expression of MEOX1. In addition, dead cells were excluded using the LIVE/DEAD fixable near-IR dead cell stain kit (Thermo Fisher Scientific).

2.2.23 Transfection of MEOX1

MEOX1 vector was transfected into HEK cells using the Turbofect transfection reagent (Thermo Fisher) according to manufacturer’s instructions. Transfection efficacy was tested by flow cytometry and only cells with a transfection efficacy of ≥85% were used for subsequent experiments.

2.2.24 Flow cytometric analysis

Antibodies for flow cytometric analyses were purchased from Biolegend or Thermo Fisher Scientific. Extracellular staining was performed at 4°C in the dark for 30 minutes. Intracellular staining of FOXP3 was performed using the Foxp3 Staining Buffer kit (Thermo Fisher Scientific) according to manufacturer’s instructions using the PCH101 clone for unstimulated T_reg cells, while staining of FOXP3 in stimulated T_reg cells was performed with the 206D clone. MEOX1 staining for flow cytometry was performed by first staining for MEOX1 for 1 hour at 4°C followed by staining with a fluorochrome conjugated anti-rabbit secondary antibody for 30 minutes at 4°C in the dark. Samples were acquired on a BD LSR II or Symphony A5 flow cytometer (BD Biosciences). Data were analyzed using FlowJo.

2.2.25 IL-2 stimulation

For IL-2 stimulation, MACS-isolated T_reg cells were cultured in 96 well plates at a concentration of 1x10⁵ cells/well in X-Vivo 15 medium in the presence of 0, 10, 100 or 1000 U/ml IL-2 for 0, 12, 24, 48 or 72 hours. Cells were subsequently harvested in Trizol (Thermo Fisher Scientific) prior to RNA isolation. RNA from higher cell numbers was isolated by isopropanol precipitation as previously described (47). For flow cytometry staining of MEOX1, PBMCs were isolated and seeded in 6 well plates at a concentration of 4x10⁶ cells/well in 3 ml RPMI stimulated with 100 U/ml IL-2 overnight.

2.2.26 RNA isolation, cDNA synthesis and qRT-PCR

Cells were resuspended in Trizol (Thermo Fisher Scientific) and RNA was isolated according to the manufacturer’s recommendations. If fewer than 50.000 cells were used for RNA isolation, the miRNeasy Mini Kit (Qiagen) was used instead according to the manufacturer’s instructions. cDNA was generated using the Transcriptor First Strand cDNA synthesis kit (Roche Diagnostics) according to manufacturer’s specifications. qRT-PCR was performed using LightCycler Taqman master kit and the Universal Probe Library assay (Roche Diagnostics). Primer Sequences are listed in Table S7.

2.2.27 siRNA-mediated knockdown

siRNAs were purchased from Biomers and used for transfection of MACS-isolated T_reg cells. Transfection was carried out with the human T cell nucleofector kit (Lonza) as per manufacturer’s specifications as previously described (10). Sequences of siRNAs can be found in Table S7.

2.2.28 DNA methylation analysis

Genomic DNA was isolated from sorted and stimulated T_reg and T_conv cells using the DNeasy Blood & Tissue Kit (Qiagen) and concentrated using the DNA Clean & Concentrator Kit (Zymo Research), both following the manufacturer’s instructions. The sample DNA was converted by bisulfite using the EZ DNA Methylation-Lightning Kit (Zymo Research) according to the manufacturers protocol and subjected to pyrosequencing as described previously (48). Amplification and sequencing of the regions A (chromosome position17: 43662392-43662439) and B (chromosome position 17: 43661827-43661888) in the MEOX1 gene locus was performed with the amplification/sequencing primers listed in Table S7. The indicated chromosome positions refer to genome assembly GRCh38.p13.

2.2.29 Immunoblotting

Immunoblotting was performed as previously described (12). Briefly, cells were lysed in RIPA buffer (10 mM Tris-Cl (pH 8.0), 1 mM EDTA, 0.5 mM EGTA, 1% Triton X-100, 0.1% sodium deoxycholate, 0.1% SDS, 140 mM NaCl) Lysates were denatured and run on a 10% SDS-PAGE gels and blotted onto nitrocellulose membranes. Primary AB incubation was performed in 5% milk in PBST or 2.5% BSA in PBST, according to AB manufacturer recommendations for at least 12 hours at 4°C. Blots were then incubated with fluorochrome coupled secondary AB. Following the secondary antibody incubation, protein signals were detected on the LICOR Odyssey Imaging System. The following antibodies were used: MEOX1 (abcam) ab23279, Actin (Sigma-Aldrich) MAB1501.

2.3 Quantification and statistical analysis

All statistical analysis except analysis of gene expression data were performed with Graph Prism software version 8.0 (GraphPad Software). Unless otherwise specified, data were plotted as mean ± SEM. To determine significant differences the two-tailed student’s t test was performed when comparing normally distributed data of two groups or post-hoc Bonferroni when comparing multiple groups. P values less than 0.05 were considered significant (n.s. indicates not significant, * = p < 0.05).

2.4 Data and software availability

GEO: microarray data, GSE15390 and GSE17241; tiling array data, GSE20995; public datasets: MeDIP-seq data, DRP000902 (http://trace.ddbj.nig.ac.jp); Histone modification dataset, GEO NIH Roadmap Epigenomics (http://www.roadmapepigenomics.org); FOXP3 ChIP-seq data: SRA: SRP006674; Human immune cell dataset, www.nextbio.com; Validation of MEOX1 expression in human T_reg cells: GSE11292. Single-cell RNA-seq: GSE99254.

3 Results

3.1 Transcriptome analysis of CD4⁺ T_conv and T_reg cells reveals activation-dependent clustering of cells

To better understand how activation influences gene expression in T_conv and T_reg cells and to identify key regulatory events responsible for T cell differentiation, we analyzed the transcriptome of human resting or activated T_conv and natural T_reg cells together with T_H1 and T_H2 differentiated cells (Table S1). To do this, we combined data from our previous work (10) with an analysis focusing on T_H1- and T_H2-differentiation of CD4⁺ T cells (26) resulting in a dataset comprising 48 different conditions.

To gain insight into the general sample-to-sample relationships, we first visualized the transcriptional variance of the complete dataset using principal component analysis (PCA, Figure 1A). This analysis revealed a bipolar structure in which cells stimulated by CD3/CD28 antibodies (activated T cells) clustered away from all other T cells, indicating that the activation stimulus is causing the highest variance within the dataset. Moreover, this analysis further indicated that the global transcriptional changes induced by CD3/CD28 stimulation were so prominent, that co-incubation with putative inhibitory molecules like IL-10, TGF-β, and VEGF were not able to revert or modulate this response. The only exceptions to this observation were activated CD4⁺ T cells which were co-incubated with blocking antibodies against immune checkpoint molecules, such as PD-1 or CTLA-4, as these cells exhibited a close transcriptional relationship to resting CD4⁺ T cells in the PCA. Interestingly, when plotting first and second principal components, CD3/CD28-activated CD4⁺ T cells were separated into two populations. Activated CD4⁺ T cells stimulated by plate-bound CD3 and soluble CD28 in combination with T_H1- and T_H2-polarizing cytokines formed a distinct population (26), while CD4⁺ T cells stimulated with CD3/CD28-coated beads were clearly separated from this population (10). Within the population of CD4⁺ T cells not activated by CD3/CD28, there was also a separation detectable between cells cultured in vitro for more than 12 hours and cells harvested directly after or within 12 hours after isolation. Unexpectedly, T_reg cells were grouped together with either the unstimulated or short-term cultivated cells, further supporting that the major difference between samples was driven by activation. In summary, using multidimensional scaling we were able to identify several distinct sample clusters within the dataset, with TCR/CD28 activation being the main driver of biological variance while lineage defining aspects, e.g. differences between T_conv and T_reg cells only contributing a small portion to the overall variance.

FIGURE 1

Figure 1 Data dimensionality reduction reveals transcriptional relationships between different CD4⁺ T cell samples (A) Visualization of the dataset by depicting the first (PC1) and the second principal components (PC2) of the computed PCA annotated with all 48 conditions. (B) Z-score normalized matrix of hierarchically clustered Pearson’s correlation coefficients between CD4⁺ T cell conditions. Conditions were annotated according to Table S1. Clusters of transcriptional similar CD4⁺ T cell conditions were annotated according to the predominant stimulation among the conditions. (C) PCA annotated according to the predominant stimulation among the conditions. (D) Visualization of the dataset using t-SNE. (E) Visualization of k-means clusters combined with hierarchical clustering. As input to k-means clustering served expression differences (fold changes with resting T_conv cells as reference), which were calculated for the most variable genes within the dataset. Z-score normalized fold changes are indicated by the coloring (blue to red). Conditions were annotated according to Table S1.

3.2 Reduction of data complexity using correlation coefficient matrices identifies subclusters of transcriptionally related CD4⁺ T cells

Since the PCA indicated that the samples within the dataset exhibited different degrees of transcriptional relationships and thus formed distinct clusters, we reasoned that we could reduce the complexity of the dataset by grouping transcriptionally related samples together and performing further analyses using these clusters, instead of analyzing each sample separately. To this end, we computed Pearson’s correlation coefficients between all conditions and visualized the results in a correlation coefficient matrix (CCM) combined with hierarchical clustering (HC) (Figure 1B). This analysis revealed that the samples clustered in a bipolar fashion, grouping activated separately from non-activated CD4⁺ T cells, supporting the observation that CD3/CD28 activation caused the biggest variance within the dataset. However, the agglomerative nature of HC enabled us to refine the bipolar structure of the dataset by identifying several sub-clusters. More specifically, we found that the complete dataset was characterized by at least 11 clusters. Next, cluster names were selected representing the majority of T cell conditions within the clusters (Table S2) and mapped to the PCA (Figure 1C). For example, the cluster on the right hand side was mainly composed of untreated CD4⁺ T cells and hence was termed ‘T_conv cell resting’. As this cluster represented the cell state with the lowest degree of activation within the dataset, we were surprised that these cells were most closely related to T_reg cells.

To substantiate the results obtained by the hierarchical clustering of the CCM, we visualized the data structure by utilizing t-Distributed Stochastic Neighbor Embedding (tSNE) as a non-linear data reduction approach and colored the samples according to their assignments in the respective CCM clusters (Figure 1D). We observed that the tSNE clusters nicely overlapped with the CCM clusters. Moreover, we also corroborated the validity of the CCM clusters by applying K-means clustering (Figure 1E). Taken together, the samples within the dataset were successfully grouped into biologically informative clusters based on their transcriptional profile.

3.3 Identification of common denominators of CD4⁺ T cell functionality using reverse network engineering

To identify the key genes for general CD4⁺ T cell functionality, we took advantage of reverse network engineering (RNE). To obtain a robust interaction network, we generated a consensus network by combining the following network inference methods: ARACNe (Algorithm for the Reconstruction of Accurate Cellular Networks), CLR (Context Likelihood of Relatedness), GENIE3 (Gene Network Inference with Ensemble of Trees), Pearson’s correlation networks, and TINGe (Tool for Inferring Network of Genes). Next, we combined the generated networks and used the 10,000 top ranking interactions of the resulting consensus network for further analysis (remaining gene/node number: 3845). As expected, given the scale-free nature of the network, a small percentage of genes accounts for most of the connections and thus are referred to as hub genes. We defined as major hub genes the largest 20% of hubs in the network which collectively participate in 7,948 interactions (Figure 2A). As we were interested in transcriptional key regulators of CD4⁺ T cell functionality, we used the TFCat database (49) for filtering the consensus network. Among the top 20% hub genes, we identified 23 common TFs and additional 43 zinc finger (ZNF) TFs (e.g. ZNF454, ZNF549, and ZNF136). Interestingly, we found that the TFs tended to form clusters within the network, suggesting a complex regulatory TF network in which TFs were strongly co-regulated (Figure 2A). To further reduce the list of important TFs for CD4⁺ T cell functionality, we applied two gene prioritization tools, ToppGene and Endeavour, on the top 20% hub-genes. As internal validation gene set, we decided to use the transcriptional regulators TCF7, SATB1, GATA3 and EZH2, which were already described to be important during T cell development, chromatin-organization, T cell differentiation, and T cell homeostasis, respectively (10, 50–53). Of note, 11 transcription factors (TFs) were found among the top 15 highest prioritized genes (Figure 2B). Reasoning that of those, the most highly expressed TFs within the dataset are the most relevant for CD4⁺ T cell functionality, we identified six central TFs (CREB1, E2F3, AHR, STAT1, NFAT5 and NFATC3). For all of them, important roles in T cell functionality have already been described (54–59). To investigate genes which were presumed to be regulated by these TFs, we used the top 20% hub genes as input to iRegulon. As expected, for all six TFs, a relatively high prediction score (normalized enrichment scores (NES) > 2.5) was computed which indicated that a multitude of the top 20% hub genes contained binding-motifs for these TFs. Next, we generated a network of the six TFs and their putative targets (Figure 2C). Remarkably, the network was highly interconnected and showed binding of the TFs between each other, supporting the idea of a sophisticated regulatory network built by TFs which control the central CD4⁺ T cell functionality. To link the genes within the network to biological information, we performed Gene Ontology Enrichment Analysis (GOEA) for each TF together with its direct neighbors (Figure 2D, Table S3). Most of the enriched GO-terms were related to regulation of metabolism and transcription, clearly supporting that metabolic and transcriptional adaptation are key events upon T cell activation (60, 61). In addition, NFAT5 and NFATC3 were found to regulate genes associated with immune response which is in line with previous observations that NFAT family members are critically involved in the induction of a T cell mediated adaptive immune response (54, 57, 62). Taken together, RNE analysis identified six putative central transcriptional regulators of CD4⁺ T cell functionality.

FIGURE 2

Figure 2 Reverse network engineering to infer common CD4⁺ T cell genes. (A) Visualization of the consensus network obtained by the combination of five different RNE-methods. TFs found among the top 20% hub genes were highlighted (common/regular TFs in red; ZNF-TFs in blue). Node size reflects degree of connectivity. (B) Top 23 highest interconnected common TFs were ranked according to Gene Prioritization (GP) among the top 20% hubs. Mean expression (log2) from each cluster is displayed as a heatmap. Degree refers to degree of connectivity. (C) Subnetworks of the six highest expressed TFs from the top 11 GP-ranked hubs. Direct targets (predicted by iRegulon) surround corresponding TFs. Node size reflects degree of connectivity.

3.4 Identification of genes specifically associated with distinct CCM clusters

After identifying common denominators of T cell biology, we wanted to identify the key genes specific for the respective CCM clusters. Therefore, we compared gene expression of each of the specific CCM clusters with resting T_conv cells. This resulted in a total of 2,385 genes which were differentially expressed (DE) (|FC| > 2, FDR adjusted p-value < 0.05) between ‘T_conv cell resting’ and all of the other CCM clusters in at least one condition (Table S4). To visualize the overlap between conditions we used an UpSet plot and observed distinct sets of DE genes for each condition but also overlap between the differentially expressed genes across conditions (Figure 3A). Therefore, we attempted to reduce the complexity of this analysis to identify groups of genes specific for each CCM cluster and applied Self-Organizing-Map (SOM) clustering to identify specific genes within each CCM cluster of samples (Figure 3B). Noteworthy, the color-coding of this clustering method revealed that every CCM cluster was characterized by a specific module-correlation structure. In addition, information about the relationships of certain CCM clusters visible in HC of CCM clusters was also preserved in SOM clustering. For example, all CCM clusters containing T_reg cells (‘T_reg cell resting’ and ‘T_reg cell CD3/IL-2’) exhibited high correlation with modules which were not associated with any of the other CCM clusters (Figure 3B). Examination of these modules unveiled T_reg cell specific genes such as FOXP3 but also potentially novel genes (e.g. mesenchyme homeobox 1 (MEOX1)) which have not yet been described in the context of T_reg cells. We also investigated modules, which were only correlated with activated CD4⁺ T cells (‘T_conv cell act. 8h’, ‘T_conv cell act. 20h’ and ‘T_H1/T_H2’). As expected, these modules mainly contained genes associated with T cell activation (e.g. NFKB1, INFG, TNF and ETS2).

FIGURE 3

Figure 3 Identification of genes associated with the identified CCM clusters (A) UpSet plot of calculated DE genes across the CCM clusters. DE genes found in the same clusters are binned and the size of the bins is represented as a bar chart. At the bottom, dots indicate which CCM clusters contained and shared these DE genes. Only bins with >2 DEgenes are depicted. (B) SOM-clustering using the most variable genes within the dataset as input. Correlation of SOM-cluster genes to CCM-defined clusters are indicated by z-scaled color coding; blue indicates low correlation and red indicates high correlation. SOM clusters specific for either T_reg cells or activated CD4⁺ T cells are marked with a black frame; exemplary genes within these clusters are displayed. (C) WGCNA heatmap showing the correlation of the module eigengene (first principal component; ME) to the traits (CCM clusters). Blue means negative correlation and red means positive correlation. (D) Volcano plots of normalized enrichment scores (NES) and enrichment p-values based on GSEA using WGCNA modules defined in (C). Shown are data for the clusters ‘T_conv cell act. 20h’, ‘T_conv cell TGF-β’, ‘T_reg cell CD3/IL-2’ and ‘T_reg cell resting’. Red circles show significantly enriched gene sets; blue circles show significantly depleted gene sets. Gene sets which exhibited the highest correlation to a certain cluster in WGCNA are indicated by red font. (E) Volcano plots genes within the two WGCNA modules with the highest correlation to the CCM cluster ‘T_conv cell act. 20h’ and ‘T_reg cell CD3/IL-2’. Depicted are the logarithmic gene ratios (‘T_conv cell act. 20h’ or ‘T_reg cell CD3/IL-2’ versus ‘T_conv cell resting’) and logarithmic p-values obtained by t-test. Red circles show upregulated genes (FC >2; p-value <0.05); blue circles show downregulated genes (FC <-2; p-value <0.05). On the right side of each plot, all module genes are shown; on the left side, transcriptional regulators (TRs) among the respective module genes are shown. Genes of interest are highlighted. (F) Microarray expression values of FOXP3 and MEOX1 across the CD4⁺ T cell conditions within the dataset. Conditions are colored according to the identified CCM clusters and annotated according to Table S1. Dashed lines indicate the computed background value of the microarray dataset.

To further investigate CCM-cluster specific gene sets, we applied weighted gene co-expression network analysis (WGCNA), which defines transcriptional modules based on the expression-correlations among genes across all samples. We identified 32 distinct modules containing 14 to 376 genes per module (Table S5). The expression data from all genes within a certain module were used to calculate its module eigengene (ME, the first principal component of a module), which were then correlated to the 11 clusters and visualized in a heatmap (Figure 3C). To identify modules containing genes which were most characteristic for a certain CCM cluster, we utilized WGCNA modules to perform gene-set enrichment analysis (GSEA). Therefore, we calculated NES, which were plotted against enrichment p-values in a Volcano plot. As representatives of the complete GSEA results, we only depicted Volcano plots for ‘T_conv cell act. 20h’, ‘T_conv cell TGF-β’, ‘T_reg cell CD3/IL-2’, and ‘T_reg cell resting’ (Figure 3D). As expected, among the modules with the highest positive NES and lowest p-value, we found those modules that were also most highly correlated in the WGCNA analysis to the respective CCM cluster. Next, we used the genes within the WGCNA modules with the highest ME-to-cluster correlation for further analysis. Since DE genes were utilized as input for WGCNA, we visualized the module genes by plotting the gene expression ratios between the respective cluster and ‘T_conv cell resting’ against p-values obtained by performing FDR-adjusted Student t-tests. The resulting Volcano plot of module genes correlated with ‘T_conv cell act. 20h’ revealed an enrichment of genes associated with tumor necrosis factor (TNF, LTA, TNFRSF4, TNFRSF18) and chemokine receptor ligands (CCL20, CCL4, CCL, 3, CCL3L1), whose expression is known to be increased upon T cell activation (Figure 3E). In addition, we found key molecules for T_reg cell functionality such as FOXP3, EOS, IL1R1, CTLA4, and HPGD among the WGCNA module genes correlated with ‘T_reg cell CD3/IL-2’. Interestingly, we also identified a transcriptional regulator, MEOX1, which was most significantly expressed in clusters containing stimulated T_reg cells (Figure 3E) and has not been associated with T_reg cells to date. Plotting the expression of MEOX1 across all stimulation conditions independent of CCM clustering, we identified MEOX1 expression to be highly upregulated only in condition 48, which corresponds to activated T_reg cells, while its expression was lower in resting T_reg cells (Figure 3F).

In summary, SOM clustering and WGCNA analysis unveiled CCM-cluster associated gene sets which were important for the biological characteristics of the respective CD4⁺ T cells. Moreover, in-depth examination of WGCNA modules associated with the cluster of ‘T_reg cell CD3/IL-2’ enabled us to identify MEOX1 as a novel TF associated with T_reg cells.

3.5 Further characterization of MEOX1 expression confirms its T_reg cell specificity

To validate T_reg cell specific expression of MEOX1, we examined MEOX1-expression in an additional dataset (11). In this microarray experiment, the genome-wide expression of genes was measured by performing a high-resolution time-series analysis during the activation process of human T_reg cells and T_conv cells by treating the cells for up to 360 min with a combination of CD3/CD28-coated beads and IL-2. At each of the measured time-points, we were able to observe a higher expression of MEOX1 in T_reg cells than in T_conv cells (Figure 4A), similar to our own data, supporting increased MEOX1 expression in T_reg cells and demonstrating that the combination of CD3/CD28-coated beads and IL-2 can maintain high expression levels of MEOX1 over 6 hours.

FIGURE 4

Figure 4 Assessment of T_reg cell specific expression of MEOX1 (A) Expression of MEOX1 in activated T_reg cells and T_conv cells over a time period of 360 min (n=2; dataset: GSE11929). (B) MEOX1 gene expression in different immune cells assessed by qRT-PCR and normalized to B2M (n=3). *p < 0.05 (paired Student’s t-test), ** p < 0.01 (paired Student’s t-test). (C) MEOX1 gene expression in different immune cells according to the NextBio database. (D) Application of Markov Clustering Algorithm ‘MCL’ to the consensus network generated in Figure 2. Visualized is a subnetwork consisting of only three genes (FOXP3, HPGD, and MEOX1). (E) Analysis of MEOX1 protein expression in either unstimulated T_reg cells or in T_reg cells stimulated with 100 U/ml IL-2 overnight by immunoblotting. (F) MFI (left) and exemplary histogram (right) of MEOX1 expression in human T_reg cells and naïve T_conv cells. PBMCs were isolated from buffy coats and stimulated overnight with 100 U/ml IL-2 (n=3 of different donors). T_reg cells (red) were gated on size, singlets, live, CD4⁺, CD3⁺, FOXP3⁺ (Clone 206D), CD45RA^- and T_conv cells (blue) were gated on size, singlets, live, CD4⁺, CD3⁺, FOXP3^- (Clone 206D), CD45RA⁺. Secondary antibody controls are depicted in light (T_reg cells) and dark grey (T_conv cells). *p < 0.05 (paired Student’s t-test). (G) MEOX1 gene expression in T_reg cells (red), T_reg cells stimulated with IL-2 (light ref), T_reg cells incubated with supernatant from stimulated T_conv cells (rose) and T_reg cells incubated with supernatant from stimulated T_conv cells in combination with anti-CD25 and anti-IL-2 antibodies (grey) assessed by qRT-PCR and normalized to B2M. Data is from one representative experiment of three (mean and s.e.m.) with cells derived from different donors. *p < 0.05 (two-way ANOVA).

To confirm specificity of MEOX1 expression in T_reg cells, we performed qRT-PCR for MEOX1 in other immune cells (Figure 4B). Expression of MEOX1 was solely detectable in CD4⁺ T cells, with significantly higher expression in T_reg cells and highest expression detectable in T_reg cells cultured in the presence of IL-2. We observed that the IL-2 dependent upregulation of MEOX1 mRNA is dose-independent and that a concentration of as little as 10 U/ml IL-2 is sufficient to induce a significant upregulation of MEOX1 (Figure S1A). This IL-2 dependent upregulation reaches its peak at around 12 hours post stimulation and then continually declines in a dose-independent manner (Figure S1A). This finding is in agreement with gene expression data from the public domain e.g. NextBio (Figure 4C) and VisuTranscript (Figure S1B), clearly indicating that MEOX1 is highly expressed in T_reg cells but not in other immune cells. Next, we asked how MEOX1 is expressed in expanded T_reg cells and could not observe an increase in MEOX1 expression in comparison to freshly isolated T_reg cells (Figure S1C).

Next, we reasoned that the specificity of MEOX1 expression could also be confirmed using network inference. To this end, we applied MCL (Markov Clustering Algorithm) to the consensus network obtained by the RNE and selected the generated sub-network which contained FOXP3 (Figure 4D). Interestingly, this network comprises only three genes: FOXP3, HPGD and MEOX1. As HPGD is also known to be important in T_reg cells (12), the network obtained by MCL further supported the hypothesis of T_reg cell specific MEOX1 expression.

Next, we validated MEOX1 expression at the protein level by western blotting. To this end, we generated a MEOX1 overexpressing vector which we transfected into HEK293 cells to show that the antibody specifically binds MEOX1 (Figure S1D). While we did not detect MEOX1 expression in steady-state T_reg cells, we observed a significant upregulation of MEOX1 in IL-2-stimulated T_reg cells by immunoblotting (Figures 4E). We confirmed these data by flow cytometry, where we showed a significant upregulation of MEOX1 in T_reg cells stimulated overnight with IL-2 compared with T_conv cells (Figures 4F).

Based on these results we further asked if T_conv cells themselves can contribute to the higher expression of MEOX1 in T_reg cells. To this end, we incubated T_reg cells with supernatants from activated T_conv cells and could observe increased MEOX1 expression (Figure 4G). To demonstrate that IL-2 is contributing to the increased MEOX1 expression, we blocked IL-2 by incubating cells with anti-IL-2Rα and anti-IL-2 antibodies and could observe a partial reduction of the increased MEOX1 expression, supporting the idea that IL-2 derived from T_conv cells can contribute to the increased MEOX1 expression in T_reg cells. Taken together, we demonstrate that MEOX1 is expressed in T_reg cells and upregulated in activated T_reg cells through IL-2.

3.6 Single-cell RNA-seq predicts a close co-regulation of MEOX1 and FOXP3 expression

To investigate the expression of MEOX1 in more detail, we reanalyzed a recently published single-cell RNA-seq dataset that comprised human CD4⁺ and CD8⁺ T cell populations from non-small-cell lung cancer patients (28). After reconstruction of the t-SNE representation and the clustering of the original publication (Figure 5A), we visualized the expression of MEOX1. As expected, we found a trend towards higher expression of MEOX1 in the clusters “CD4-C8-FOXP3” and “CD4-C9-CTLA4” (Figure S3A). To validate this observation, we visualized the density of MEOX1-expressing cells in a contour plot and found the highest cell-density in the aforementioned clusters (Figure 5B). Interestingly, the contour plot for MEOX1 was highly reminiscent of the contour plot for FOXP3-expressing cells (Figure 5B), indicating a close co-expression of the two genes on single-cell level, which was also visible when plotting expression for both genes on individual cells (Figure S3B). Indeed, when assessing the dataset for genes correlated with MEOX1 expression, the top ranked gene is FOXP3 followed by typical T_reg cell-markers, such as IL2RA, IL1R1, FANK1, HPGD and CTLA4 (Figure 5C). Moreover, when module genes associated with the ‘T_reg cell CD3/IL-2’ cluster (with a FC > 2 between ‘T_reg cell CD3/IL-2’ and ‘T_conv cell resting’) were visualized in the correlation plot, we found a tendency towards positive correlation values, showing that the results obtained on bulk-scale can be reproduced on single-cell level (Figure 5C). In summary, the analysis of single-cell RNA-seq data substantiated the hypothesis of T_reg cell specific MEOX1 expression and indicated a close correlation with the expression of FOXP3.

FIGURE 5

Figure 5 MEOX1 expression in single-cell RNA-seq data comprising different T cell populations (A) Visualization of single-cell RNA-seq data (GSE99254) in a t-SNE plot. Cells are colored according to the cell labels obtained by Guo et al. (28). Accumulation of cells with the same label are highlighted with colored background. (B) Contour plots showing the areas in the t-SNE plot of T cell clusters from GSE99254 with highest expression of FOXP3 and MEOX1. Accumulations of cells with the same cell label according to (A) are indicated by gray circles. (C) Pearson’s correlation between MEOX1 and all other genes across the CD4⁺ T cells in the single-cell RNA-seq dataset. Genes are grouped according to their respective correlation values. Genes found in the ‘T_reg cell CD3/IL-2’-associated WGCNA modules ‘26’ and ‘32’ (according to Figures 3B–D) are highlighted in light red and their accumulation in the ordered genes is indicated by the histogram at the bottom. A magnification of the 25 genes with the highest expression correlation to MEOX1 is additionally shown in the box at the top.

3.7 Epigenetic regulation of MEOX1 expression in CD4⁺ T cells

One important aspect of the specification of T_reg cells and their transcriptional regulation are epigenetic mechanisms (63). Therefore, we determined whether differential chromatin accessibility in human T_reg and T_conv cells at the genomic MEOX1 locus contributes to the differential expression of MEOX1. Analysis of chromatin accessibility using ATAC-seq (assay for transposase-accessible chromatin using sequencing) revealed a similarly open chromatin landscape in T_reg and T_conv cells (Figure 6A), supporting that additional mechanisms might regulate expression of MEOX1 in T_reg cells. An additional layer of epigenetic regulation orchestrating cell type-specific gene expression are permissive and repressive histone modifications, particularly in close proximity to promoter regions. For example, a combination of trimethylation of H3K4 (H3K4me3) and acetylation of H3K27 (H3K27ac) indicates open chromatin states with accessible promoters and active transcription. To further extend on the epigenetic regulation of MEOX1 expression, we made use of publicly available datasets of ChIP-seq experiments which were provided by the NIH Roadmap Epigenomics Consortium (46). We observed that T_reg cells have a higher tag-count for both H3K4me3 and H3K27ac at the MEOX1 promoter compared with T_conv cells, indicating a more permissive epigenetic landscape at the promoter site in T_reg cells which consequently eases the transcription of this gene, as evidenced by increased H3K36me3 at the MEOX1 gene body in T_reg cells (Figure 6B). Analysis of repressive histone marks (H34K9me3 and H3K27me3) supports the notion that the activation of the transcription of MEOX1 in T_reg cells is an active process mediated mainly by events promoting permissive modifications of histone proteins and TF activity as we did not detect significant trimethylation of any of the two repressive marks over the gene body in T_conv cells (Figure 6B).

FIGURE 6

Figure 6 Epigenetic state of the MEOX1 gene locus in CD4⁺ T_conv and T_reg cells (A) Open chromatin assessment of the MEOX1 locus in T_reg and T_conv cells using ATAC-seq data. (B) ChIP-seq data of histone modifications at the human genomic MEOX1 locus in T_reg and T_conv cells (data obtained from the NIH Roadmap Epigenomics Mapping Consortium). Data on epigenetic regulation of the MEOX1 locus were extracted from a publicly available dataset on genome-wide histone modifications in human T_conv and T_reg cells (46). ChIP sequencing analysis of T_conv and T_reg cells for the genomic MEOX1 locus with antibodies specific for the permissive histone modifications H3K4me3 and H3K27Ac, the transcription-associated mark H3K36me1, the enhancer-associated mark H3K4me1, and the repressive histone modifications H3K27me3 and H3K9me3. (C, D) Methylation of individual CpG motifs within two CpG-rich regions in upstream region of the genomic MEOX1 coding sequence for freshly isolated T_conv and T_reg cell as well as T_conv and T_reg cells stimulated overnight with 100 U/ml IL-2. Each box represents an individual CpG motif after normalization and quantification of methylation signals from pyrosequencing data by calculating ratios of T and C signals at CpG sites. The methylation status of individual CpG motifs is color coded according to the degree of methylation at that site. The color code ranges from yellow (0% methylation) to violet (100% methylation) according to the color scale on the right.

As DNA methylation has been reported as an additional important layer of T_reg cell specific gene expression, we reanalyzed publicly available datasets (64, 65) and identified two CpG islands within the promoter region of MEOX1 which were methylated in T_conv cells but showed significantly lower levels of methylation in T_reg cells (Figure 6C). To confirm this observation, we isolated T_reg and T_conv cells from peripheral blood, stimulated them for 24 hrs with IL-2 and assessed the methylation state of these regions by pyrosequencing (Figures 6D, S4). This analysis suggests that transcription of MEOX1 in T_reg cells is promoted through demethylation of the promoter region. Taken together, the epigenetic analyses of the promoter region of MEOX1 revealed an open chromatin structure with an activating histone landscape and demethylation of promoter-associated CpG islands in T_reg cells but not in T_conv cells.

3.8 Identification of FOXP3 as a putative upstream regulator of MEOX1 expression

As a next step, we investigated the transcriptional regulation of MEOX1. We postulated that potential master regulators upstream of this TF could be identified using TF-binding prediction tools based on motif-search algorithms such as iRegulon. As the input into this algorithm, we used WGCNA module-genes correlated with ‘T_reg cell CD3/IL-2’ cluster. This analysis identified 91 TFs with enriched binding motifs within this gene set including FOXP3 - the lineage-defining TF for T_reg cells (Table S6). We observed that several of the genes within this cluster, including MEOX1, exhibited FOXP3-binding motifs in their promoter region (Figure 7A), suggesting that FOXP3 is a direct regulator of MEOX1 expression. To confirm this hypothesis, we re-analyzed our ChIP-seq data for genome-wide detection of FOXP3 binding sites (10, 25), in which we detected an enrichment of FOXP3 in the promoter region of MEOX1 (Figure 7B).

FIGURE 7

Figure 7 FOXP3 as upstream regulator of MEOX1 expression. (A) Module genes correlated with ‘T_reg cell CD3/IL-2’ which exhibit a FOXP3 binding-motif in their promoter region. Genes were colored according to their respective fold change (reference: ‘T_conv cell resting’). (B) FOXP3 ChIP tiling array data from human expanded cord-blood T_reg cells. Data were analyzed with MAT and overlayed to the MEOX1 locus to identify binding regions (p < 10^-5 and FDR < 0.5%). Data are representative of two independent experiments with cells derived from different donors. (C) Overlay of MeDip-seq (66) and FOXP3 ChIP-seq data (SRA : SRP006674) for the human genomic MEOX1 locus. FOXP3 binding as well as DNA methylation is depicted for T_reg (red) and T_conv cells (blue). (D) mRNA expression of FOXP3 and MEOX1 in T_reg cells treated with scrambled (scrmbld, left) or FOXP3 specific (right) siRNA (E) mRNA expression of FOXP3 and MEOX1 in T_reg cells treated with scrambled (left) or MEOX1 specific (right) siRNA (F) mRNA expression of RPS27L in T_reg cells treated with scrambled, MEOX1 or FOXP3 specific siRNA. (D–F) Data were first normalized to B2M expression and shown in relation to donor-specific scrambled mRNA expression. (D,E) *p < 0.05 (Student’s t-test). (F) *p < 0.05 (two-way ANOVA). (D–F) Data are representative of three to five independent experiments (mean ± s.e.m.), each with cells derived from a different donor. (G, H) Suppression of allogeneic CD4⁺CD25^- T_conv cells labelled with the cytosolic dye CFSE by human T_reg cells transfected with siRNA targeting MEOX1 (MEOX1) or non-targeting siRNA (scrmbld) presented as CFSE dilution in responding T_conv cells cultured with CD3/CD28/anti-MHC-I antibody-coated beads and T_reg cells at a ratio of 1:1 (G), and as relative suppression (H). Data is from one representative experiment of three with cells derived from different donors. *p < 0.05 (paired Student’s t-test). n.s. = not significant.

FOXP3-mediated regulation of MEOX1 expression was further supported by re-assessing additional datasets combining DNA methylation and FOXP3 ChIP-seq data of human T_reg cells (27, 66). Recent work has highlighted the occurrence of hypomethylation of CpG-rich regions within FOXP3-binding regions in the genome (64, 66) and we identified MEOX1 as one of these genes showing FOXP3 binding at a hypomethylated promoter region (Figure 7C).

To determine whether FOXP3 is directly controlling MEOX1 expression, we performed siRNA-mediated knock-down of FOXP3 in human T_reg cells. This resulted in significantly reduced FOXP3 expression levels (Figures 7D, S5A) and reduced suppressive function (Figure S5B). When we now assessed MEOX1 expression after FOXP3 silencing, we observed significantly lower levels of MEOX1 mRNA (Figure 7D), supporting that MEOX1 expression is directly induced by FOXP3.

Next, we questioned whether MEOX1 also contributed to the reciprocal regulation of FOXP3 expression in T_reg cells. Therefore, we also silenced MEOX1 in T_reg cells (Figure 7E). While we detected a clear downregulation of MEOX1, we could not observe a direct influence of MEOX1 on FOXP3 expression (Figure 7E). To determine how MEOX1 influences potential downstream target genes in T_reg cells independent of FOXP3, we performed prediction of potential MEOX1 binding to genes specific for the cluster “T_reg cell CD3/IL-2”. We identified two potential MEOX1 target genes (RPL27L, PIM1) (67, 68). When we assessed their expression following knockdown of MEOX1 or FOXP3 in T_reg cells, we observed a MEOX1-specific upregulation of RPS27L (Figure 7F) and downregulation of PIM1 (Figure S6) independent of FOXP3, supporting the notion that MEOX1 can act as a TF supporting the transcriptional make-up of human T_reg cells as e.g. for PIM1 has a destabilizing function for FOXP3 as recently described (68). Considering the importance of FOXP3 for T_reg cell suppressive capacity, we investigated whether perturbing MEOX1 expression would also impact T_reg cell function. In line with our previous findings, knockdown of MEOX1 decreased the capacity of T_reg cells to suppress T_conv cell expansion in vitro demonstrating a functional role of MEOX1 in T_reg cell biology downstream of FOXP3 co-governing part of the classical T_reg cell properties (Figures 7G, H).

In summary, we used several bioinformatic approaches to identify MEOX1 as a potential novel TF important for T_reg cells and unveiled FOXP3 as one of the upstream regulators of MEOX1, which further corroborates the observation of a specific expression of MEOX1 in human T_reg cells.

4 Discussion

In the present study we analyzed the transcriptome of human CD4⁺ T cells by taking advantage of a highly diverse dataset. Using data dimensionality reduction methods, we observed that the highest transcriptional difference within the dataset can be attributed to the presence or absence of CD3/CD28 activation of CD4⁺ T_conv cells. Interestingly, these methods also revealed an unexpected close relationship between resting T_reg cells and resting T_conv cells. In several recent reports it was shown that several hundreds of transcripts are differentially expressed between T_reg cells and CD4⁺ T cells, by which these two cell types can be clearly defined as distinct populations at the transcriptional level (10–12, 27). Noteworthy, these reports often focused on one-to-one comparisons between resting CD4⁺ T cells and T_reg cells (11, 27). This approach has the tendency to overestimate the number of cell-type specific genes, as it has become evident over the last years that activation and differentiation of T cells can induce a broad spectrum of genes initially characterized as T_reg cell specific, e.g. CTLA-4. Hence, to our knowledge, the present study is one of the first which explicitly shows the close relationship of the transcriptional profile of human resting T_reg cells and resting T_conv cells, but also describes the distinct transcriptional modules for each subtype taking the enormous differences in gene expression into account that can be induced by T cell activation.

Global analysis of the dataset using reverse network engineering revealed CREB1, E2F3, AHR, STAT1, NFAT5 and NFATC3 as putative master regulators of CD4⁺ T cell functionality. Interestingly, we found that most of these genes were located in dense clusters of highly interconnected TFs within the consensus network. One limitation to this approach is that the importance of some of the classic T cell-related transcription factors, such as NF-κB and AP-1, as multi-protein complexes, cannot be properly captured, as their components besides FOS were not enriched in the analysis. Noteworthy, we observed several of these TF-enriched clusters which were mainly characterized by an accumulation of ZNF TFs. In regard to these observations, we postulate that the regulatory network of CD4⁺ T cells is defined by a highly interconnected TF-network consisting of a few master regulators and a plethora of adjacent ZNF TFs. This is in agreement with the observation that most of the predicted master regulators exhibit binding-motifs in their promoter region for one another, indicating a sophisticated mutual regulation of these TFs (69).

To analyze gene sets which were characteristic for clusters of transcriptionally related CD4⁺ T cells, we applied WGCNA and validated the calculated gene sets using GSEA. Besides the identification of cluster-specific transcriptional programs which have already been described to be central for T cell functionality (e.g. NFKB1 and TNF-related genes found in clusters associated with CD4⁺ T cell activation), gene set analysis led to the identification of MEOX1 in a cluster containing genes mainly expressed in T_reg cells stimulated with CD3 and IL-2. Next, we could show that IL-2 signaling can upregulate expression of MEOX1 in human T_reg cells while CD3/CD28 stimulation even in the presence of IL-2 is able to maintain the high expression of MEOX1 compared to T_conv cells. Our data further indicate that in humans FOXP3 is the direct transcriptional regulator of MEOX1 expression. Although FOXP3 is crucial for the suppressive function of T_reg cells, ectopic expression of FOXP3 in human CD4⁺ T_conv cells can result in induction of hypo-responsiveness and suppression of IL-2 production but might not always lead to the acquisition of suppressor activity which is characteristic for T_reg cells. This is in agreement with previous reports describing CD4⁺ T cells in humans which upregulate the expression of FOXP3 without the acquisition of a suppressive capacity (70, 71). These studies indicated that in humans, factors in addition to FOXP3 are required for the generation of T_reg cell function. To this end, we introduce MEOX1 as such a new candidate TF. The complexity of the interplay of FOXP3 with such additional factors is highlighted by the fact that expression of RPS27L as downstream target of MEOX1 is upregulated after knockdown of FOXP3 in T_reg cells. One further layer required for this fine-tuned transcriptional response of CD4⁺ T cells dedicated towards the T_reg cells fate is the respective epigenetic chromatin landscape. Here, we could show that while accessibility at the promoter region of this TF in T_conv cells is not different to T_reg cells, both histone landscape and DNA methylation pattern are distinct and might be responsible of the specific expression of MEOX1 in T_reg cells. In addition, this epigenetic configuration might explain why ectopic FOXP3 is not able to induce bona fide T_reg cells if expression of TFs like MEOX1 is prevented through epigenetic modifications.

Under the assumption that MEOX1 has an important role in regulating T_reg cell function, we could show that knockdown of MEOX1 in T_reg cells impacts T_reg cell suppressive activity. Furthermore, we could uncover that IL-2, potentially derived from stimulated CD4⁺ T_conv cells, but also other IL-2 producing cells, can upregulate MEOX-1 expression in T_reg cells with the contribution of other factors including costimulation still to be uncovered. With a recently published report of MEOX1 being important for the acquisition of a tumor-infiltrating T_reg cell phenotype (24) it is conceivable to speculate, that MEOX1 in humans is a key factor for the transcriptional acquisition of a T_reg cell effector phenotype downstream of IL-2 signaling. Taken together, by combining transcriptome analysis of a large dataset of T cell states with computational analysis including TF prediction, single-cell RNA-seq data of T cells, assessment of epigenetic regulation as well as in vitro knockdown experiments, we establish MEOX1 as a novel T_reg cell-specific TF. We propose that such T cell state-associated TFs and effector molecules can be identified in other modules established in this publicly available dataset of human T cell states.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accessionnumber(s) can be found below: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE15390, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE17241, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE20995, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE11292, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE99254, https://ddbj.nig.ac.jp/resource/sra-study/DRP000902, https://www.ncbi.nlm.nih.gov/geo/roadmap/epigenomics/, https://trace.ncbi.nlm.nih.gov/Traces/?view=study&acc=SRP006674.

Ethics statement

The studies involving human participants were reviewed and approved by Ethics committee of the University of Bonn. The patients/participants provided their written informed consent to participate in this study.

Author contributions

Conceptualization: KB, LS, LB, TU, MB. Methodology: KB, LS, MHS, TE, MK, KK, AN, SF, NO, TS. Investigation: KB, LS, MHS, TE, MK, KK, SF, NO, TS. Visualization: KB, LS, LB, TU, MB. Supervision: JH, SS, SCB, LB, TU, MB. Writing - original draft: KB, LS, LB, TU, MB. Writing - review & editing: KB, LS, MHS, TE, MK, SF, RS, NO, SS, KK, AN, JH, TS, SCB, LB, TU, MB.

Funding

MB is supported by the Helmholtz Foundation and the German Research Foundation (DFG) (SFB1454 project number 432325352, IGK2168/2 project number 272482170). MB is a member of the excellence cluster ImmunoSensation2 (EXC2151 project number 390873048) and the EU project SYSCID (grant number 733100). LB is supported by the DFG (ImmuDiet BO 6228/2-1 - Project number 513977171). SCB is supported by NHMRC project grants 339123, 565314.

Acknowledgments

We thank S. Bourry, S. Weber, D. Huesson and M. Kraut for technical assistance.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1107397/full#supplementary-material

References

1. Dong C. Cytokine regulation and function in T cells. Annu Rev Immunol (2021) 39:51–76. doi: 10.1146/annurev-immunol-061020-053702

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Ruterbusch M, Pruner KB, Shehata L, Pepper M. In vivo CD4(+) T cell differentiation and function: revisiting the Th1/Th2 paradigm. Annu Rev Immunol (2020) 38:705–25. doi: 10.1146/annurev-immunol-103019-085803

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Stockinger B, Omenetti S. The dichotomous nature of T helper 17 cells. Nat Rev Immunol (2017) 17:535–44. doi: 10.1038/nri.2017.50

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Vignali DAA, Collison LW, Workman CJ. How regulatory T cells work. Nat Rev Immunol (2008) 8:523–32. doi: 10.1038/nri2343

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Josefowicz SZ, Lu LF, Rudensky AY. Regulatory T cells: mechanisms of differentiation and function. Annu Rev Immunol (2012) 30:531–64. doi: 10.1146/annurev.immunol.25.022106.141623

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Huehn J, Beyer M. Epigenetic and transcriptional control of Foxp3+ regulatory T cells. Semin Immunol (2015) 27:10–8. doi: 10.1016/j.smim.2015.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Savage PA, Klawon DEJ, Miller CH. Regulatory T cell development. Annu Rev Immunol (2020) 38:421–53. doi: 10.1146/annurev-immunol-100219-020937

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Yu J, Peng J, Chi H. Systems immunology: integrating multi-omics data to infer regulatory networks and hidden drivers of immunity. Curr Opin Syst Biol (2019) 15:19–29. doi: 10.1016/j.coisb.2019.03.003

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Xue J, Schmidt SV, Sander J, Draffehn A, Krebs W, Quester I, et al. Transcriptome-based network analysis reveals a spectrum model of human macrophage activation. Immunity (2014) 40:274–88. doi: 10.1016/j.immuni.2014.01.006

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Beyer M, Thabet Y, Muller RU, Sadlon T, Classen S, Lahl K, et al. Repression of the genome organizer SATB1 in regulatory T cells is required for suppressive function and inhibition of effector differentiation. Nat Immunol (2011) 12:898–907. doi: 10.1038/ni.2084

PubMed Abstract | CrossRef Full Text | Google Scholar

11. He F, Chen H, Probst-Kepper M, Geffers R, Eifes S, Sol A.d., et al. PLAU inferred from a correlation network is critical for suppressor function of regulatory T cells. Mol Syst Biol (2012) 8:624–4. doi: 10.1038/msb.2012.56

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Schmidleithner L, Thabet Y, Schonfeld E, Kohne M, Sommer D, Abdullah Z, et al. Enzymatic activity of HPGD in treg cells suppresses tconv cells to maintain adipose tissue homeostasis and prevent metabolic dysfunction. Immunity (2019) 50:1232–1248 e1214. doi: 10.1016/j.immuni.2019.03.014

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Ciofani M, Madar A, Galan C, Sellars M, Mace K, Pauli F, et al. A validated regulatory network for Th17 cell specification. Cell (2012) 151:289–303. doi: 10.1016/j.cell.2012.09.016

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Yosef N, Shalek AK, Gaublomme JT, Jin H, Lee Y, Awasthi A, et al. Dynamic regulatory network controlling TH17 cell differentiation. Nature (2013) 496:461–8. doi: 10.1038/nature11981

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Kanduri K, Tripathi S, Larjo A, Mannerström H, Ullah U, Lund R, et al. Identification of global regulators of T-helper cell lineage specification. Genome Med (2015) 7:122–2. doi: 10.1186/s13073-015-0237-0

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Wei G, Wei L, Zhu J, Zang C, Hu-Li J, Yao Z, et al. Global mapping of H3K4me3 and H3K27me3 reveals specificity and plasticity in lineage fate determination of differentiating CD4+ T cells. Immunity (2009) 30:155–67. doi: 10.1016/j.immuni.2008.12.009

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Farh KK-H, Marson A, Zhu J, Kleinewietfeld M, Housley WJ, Beik S, et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature (2014) 518:337–43. doi: 10.1038/nature13835

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Floess S, Freyer J, Siewert C, Baron U, Olek S, Polansky J, et al. Epigenetic control of the foxp3 locus in regulatory T cells. PloS Biol (2007) 5:e38–8. doi: 10.1371/journal.pbio.0050038

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Walhout AJM. What does biologically meaningful mean? a perspective on gene regulatory network validation. Genome Biol (2011) 12:109–9. doi: 10.1186/gb-2011-12-4-109

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Bayrakli F, Guclu B, Yakicier C, Balaban H, Kartal U, Erguner B, et al. Mutation in MEOX1 gene causes a recessive klippel-feil syndrome subtype. BMC Genet (2013) 14:95. doi: 10.1186/1471-2156-14-95

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Mohamed JY, Faqeih E, Alsiddiky A, Alshammari MJ, Ibrahim NA, Alkuraya FS. Mutations in MEOX1, encoding mesenchyme homeobox 1, cause klippel-feil anomaly. Am J Hum Genet (2013) 92:157–61. doi: 10.1016/j.ajhg.2012.11.016

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Sun L, Burnett J, Gasparyan M, Xu F, Jiang H, Lin CC, et al. Novel cancer stem cell targets during epithelial to mesenchymal transition in PTEN-deficient trastuzumab-resistant breast cancer. Oncotarget (2016) 7:51408–22. doi: 10.18632/oncotarget.9839

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Sun L, Yuan H, Burnett J, Gasparyan M, Zhang Y, Zhang F, et al. MEOX1 promotes tumor progression and predicts poor prognosis in human non-Small-Cell lung cancer. Int J Med Sci (2019) 16:68–74. doi: 10.7150/ijms.27595

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Alvisi G, Termanini A, Soldani C, Portale F, Carriero R, Pilipow K, et al. Multimodal single-cell profiling of intrahepatic cholangiocarcinoma defines hyperactivated tregs as a potential therapeutic target. J Hepatol (2022) 77:1359–72. doi: 10.1016/j.jhep.2022.05.043

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Sadlon TJ, Wilkinson BG, Pederson S, Brown CY, Bresatz S, Gargett T, et al. Genome-wide identification of human FOXP3 target genes in natural regulatory T cells. J Immunol (2010) 185:1071–81. doi: 10.4049/jimmunol.1000082

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Ahlfors H, Limaye A, Elo LL, Tuomela S, Burute M, Gottimukkala KV, et al. SATB1 dictates expression of multiple genes including IL-5 involved in human T helper cell differentiation. Blood (2010) 116:1443–53. doi: 10.1182/blood-2009-11-252205

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Birzele F, Fauti T, Stahl H, Lenter MC, Simon E, Knebel D, et al. Next-generation insights into regulatory T cells: expression profiling and FoxP3 occupancy in human. Nucleic Acids Res (2011) 39:7946–60. doi: 10.1093/nar/gkr444

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Guo X, Zhang Y, Zheng L, Zheng C, Song J, Zhang Q, et al. Global characterization of T cells in non-small-cell lung cancer by single-cell sequencing. Nat Med (2018) 24:978–85. doi: 10.1038/s41591-018-0045-3

PubMed Abstract | CrossRef Full Text | Google Scholar

29. van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res (2008) 9:2579–605.

Google Scholar

30. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodological) (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

CrossRef Full Text | Google Scholar

31. Conway JR, Lex A, Gehlenborg N. UpSetR: an r package for the visualization of intersecting sets and their properties. Bioinformatics (2017) 33:2938–40. doi: 10.1093/bioinformatics/btx364

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Smeekens SP, Ng A, Kumar V, Johnson MD, Plantinga TS, van Diemen C, et al. Functional genomics identifies type I interferon pathway as central for host defense against candida albicans. Nat Commun (2013) 4:1342. doi: 10.1038/ncomms2343

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Langfelder P, Horvath S. WGCNA: an r package for weighted correlation network analysis. BMC Bioinf (2008) 9:559. doi: 10.1186/1471-2105-9-559

CrossRef Full Text | Google Scholar

34. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci United States America (2005) 102:15545–50. doi: 10.1073/pnas.0506580102

CrossRef Full Text | Google Scholar

35. Janky R, Verfaillie A, Imrichova H, Van de Sande B, Standaert L, Christiaens V, et al. iRegulon: from a gene list to a gene regulatory network using large motif and track collections. PloS Comput Biol (2014) 10:e1003731. doi: 10.1371/journal.pcbi.1003731

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Floratos A, Smith K, Ji Z, Watkinson J, Califano A. geWorkbench: an open source platform for integrative genomics. Bioinformatics (2010) 26:1779–80. doi: 10.1093/bioinformatics/btq282

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Margolin AA, Nemenman I, Basso K, Wiggins C, Stolovitzky G, Dalla Favera R, et al. ARACNE: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinf (2006) 7 Suppl 1:S7. doi: 10.1186/1471-2105-7-S1-S7

CrossRef Full Text | Google Scholar

38. Aluru M, Zola J, Nettleton D, Aluru S. Reverse engineering and analysis of large genome-scale gene networks. Nucleic Acids Res (2013) 41:e24. doi: 10.1093/nar/gks904

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Wang M, Verdier J, Benedito VA, Tang Y, Murray JD, Ge Y, et al. LegumeGRN: a gene regulatory network prediction server for functional and comparative studies. PloS One (2013) 8:e67434. doi: 10.1371/journal.pone.0067434

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Faith JJ, Hayete B, Thaden JT, Mogno I, Wierzbowski J, Cottarel G, et al. Large-Scale mapping and validation of escherichia coli transcriptional regulation from a compendium of expression profiles. PloS Biol (2007) 5:e8. doi: 10.1371/journal.pbio.0050008

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Huynh-Thu VA, Irrthum A, Wehenkel L, Geurts P. Inferring regulatory networks from expression data using tree-based methods. PloS One (2010) 5:e12776. doi: 10.1371/journal.pone.0012776

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Marbach D, Costello JC, Kuffner R, Vega NM, Prill RJ, Camacho DM, et al. Wisdom of crowds for robust gene network inference. Nat Methods (2012) 9:796–804. doi: 10.1038/nmeth.2016

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res (2009) 37:W305–311. doi: 10.1093/nar/gkp427

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Tranchevent LC, Barriot R, Yu S, Van Vooren S, Van Loo P, Coessens B, et al. ENDEAVOUR update: a web resource for gene prioritization in multiple species. Nucleic Acids Res (2008) 36:W377–384. doi: 10.1093/nar/gkn325

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Morris JH, Apeltsin L, Newman AM, Baumbach J, Wittkop T, Su G, et al. clusterMaker: a multi-algorithm clustering plugin for cytoscape. BMC Bioinf (2011) 12:436. doi: 10.1186/1471-2105-12-436

CrossRef Full Text | Google Scholar

46. Roadmap Epigenomics C, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, et al. Integrative analysis of 111 reference human epigenomes. Nature (2015) 518:317–30. doi: 10.1038/nature14248

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Beyer M, Abdullah Z, Chemnitz JM, Maisel D, Sander J, Lehmann C, et al. Tumor-necrosis factor impairs CD4 T cell-mediated immunological control in chronic viral infection. Nat Immunol (2016) 17:593–603. doi: 10.1038/ni.3399

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Yang BH, Hagemann S, Mamareli P, Lauer U, Hoffmann U, Beckstette M, et al. Foxp3(+) T cells expressing RORgammat represent a stable regulatory T-cell effector lineage with enhanced suppressive capacity during intestinal inflammation. Mucosal Immunol (2016) 9:444–57. doi: 10.1038/mi.2015.74

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Fulton DL, Sundararajan S, Badis G, Hughes TR, Wasserman WW, Roach JC, et al. TFCat: the curated catalog of mouse and human transcription factors. Genome Biol (2009) 10:R29. doi: 10.1186/gb-2009-10-3-r29

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Ho IC, Tai TS, Pai SY. GATA3 and the T-cell lineage: essential functions before and after T-helper-2-cell differentiation. Nat Rev Immunol (2009) 9:125–35. doi: 10.1038/nri2476

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Yu Q, Sharma A, Oh SY, Moon HG, Hossain MZ, Salay TM, et al. T Cell factor 1 initiates the T helper type 2 fate by inducing the transcription factor GATA-3 and repressing interferon-gamma. Nat Immunol (2009) 10:992–9. doi: 10.1038/ni.1762

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Karantanos T, Chistofides A, Barhdan K, Li L, Boussiotis VA. Regulation of T cell differentiation and function by EZH2. Front Immunol (2016) 7:172. doi: 10.3389/fimmu.2016.00172

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Kitagawa Y, Ohkura N, Kidani Y, Vandenbon A, Hirota K, Kawakami R, et al. Guidance of regulatory T cell development by Satb1-dependent super-enhancer establishment. Nat Immunol (2017) 18:173–83. doi: 10.1038/ni.3646

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Oukka M, Ho IC, de la Brousse FC, Hoey T, Grusby MJ, Glimcher LH. The transcription factor NFAT4 is involved in the generation and survival of T cells. Immunity (1998) 9:295–304. doi: 10.1016/S1074-7613(00)80612-3

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Humbert PO, Verona R, Trimarchi JM, Rogers C, Dandapani S, Lees JA. E2f3 is critical for normal cellular proliferation. Genes Dev (2000) 14:690–703. doi: 10.1101/gad.14.6.690

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Afkarian M, Sedy JR, Yang J, Jacobson NG, Cereb N, Yang SY, et al. T-Bet is a STAT1-induced regulator of IL-12R expression in naive CD4+ T cells. Nat Immunol (2002) 3:549–57. doi: 10.1038/ni794

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Trama J, Go WY, Ho SN. The osmoprotective function of the NFAT5 transcription factor in T cell development and activation. J Immunol (2002) 169:5477–88. doi: 10.4049/jimmunol.169.10.5477

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Quintana FJ, Basso AS, Iglesias AH, Korn T, Farez MF, Bettelli E, et al. Control of t(reg) and T(H)17 cell differentiation by the aryl hydrocarbon receptor. Nature (2008) 453:65–71. doi: 10.1038/nature06880

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Hernandez JB, Chang C, LeBlanc M, Grimm D, Le Lay J, Kaestner KH, et al. The CREB/CRTC2 pathway modulates autoimmune disease by promoting Th17 differentiation. Nat Commun (2015) 6:7216. doi: 10.1038/ncomms8216

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Zhu J, Yamane H, Paul WE. Differentiation of effector CD4 T cell populations. Annu Rev Immunol (2010) 28:445–89. doi: 10.1146/annurev-immunol-030409-101212

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Maciolek JA, Pasternak JA, Wilson HL. Metabolism of activated T lymphocytes. Curr Opin Immunol (2014) 27:60–74. doi: 10.1016/j.coi.2014.01.006

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Macian F. NFAT proteins: key regulators of T-cell development and function. Nat Rev Immunol (2005) 5:472–84. doi: 10.1038/nri1632

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Kitagawa Y, Sakaguchi S. Molecular control of regulatory T cell development and function. Curr Opin Immunol (2017) 49:64–70. doi: 10.1016/j.coi.2017.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Zhang Y, Maksimovic J, Naselli G. Genome-wide DNA methylation analysis identifies hypomethylated genes regulated by FOXP3 in human regulatory T cells. Blood (2013) 122:2823–36. doi: 10.1182/blood-2013-02-481788

PubMed Abstract | CrossRef Full Text | Google Scholar

65. Morikawa H, Ohkura N, Vandenbon A, Itoh M, Nagao-Sato S, Kawaji H, et al. Differential roles of epigenetic changes and Foxp3 expression in regulatory T cell-specific transcriptional regulation. Proc Natl Acad Sci United States America (2014) 111:5289–94. doi: 10.1073/pnas.1312717110

CrossRef Full Text | Google Scholar

66. Ohkura N, Yasumizu Y, Kitagawa Y, Tanaka A, Nakamura Y, Motooka D, et al. Regulatory T cell-specific epigenomic region variants are a key determinant of susceptibility to common autoimmune diseases. Immunity (2020) 52:1119–1132 e1114. doi: 10.1016/j.immuni.2020.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Joller N, Lozano E, Burkett PR, Patel B, Xiao S, Zhu C, et al. Treg cells expressing the coinhibitory molecule TIGIT selectively inhibit proinflammatory Th1 and Th17 cell responses. Immunity (2014) 40:569–81. doi: 10.1016/j.immuni.2014.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

68. Li Z, Lin F, Zhuo C, Deng G, Chen Z, Yin S, et al. PIM1 kinase phosphorylates the human transcription factor FOXP3 at serine 422 to negatively regulate its activity under inflammation. J Biol Chem (2014) 289:26872–81. doi: 10.1074/jbc.M114.586651

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Rothenberg EV. The chromatin landscape and transcription factors in T cell programming. Trends Immunol (2014) 35:195–204. doi: 10.1016/j.it.2014.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Allan SE, Crome SQ, Crellin NK, Passerini L, Steiner TS, Bacchetta R, et al. Activation-induced FOXP3 in human T effector cells does not suppress proliferation or cytokine production. Int Immunol (2007) 19:345–54. doi: 10.1093/intimm/dxm014

PubMed Abstract | CrossRef Full Text | Google Scholar

71. Wang J, Ioan-Facsinay A, van der Voort EI, Huizinga TW, Toes RE. Transient expression of FOXP3 in human activated nonregulatory CD4+ T cells. Eur J Immunol (2007) 37:129–38. doi: 10.1002/eji.200636435

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Treg cells, MEOX1, Foxp3, regulatory T cells, human CD4

Citation: Baßler K, Schmidleithner L, Shakiba MH, Elmzzahi T, Köhne M, Floess S, Scholz R, Ohkura N, Sadlon T, Klee K, Neubauer A, Sakaguchi S, Barry SC, Huehn J, Bonaguro L, Ulas T and Beyer M (2023) Identification of the novel FOXP3-dependent T_reg cell transcription factor MEOX1 by high-dimensional analysis of human CD4⁺ T cells. Front. Immunol. 14:1107397. doi: 10.3389/fimmu.2023.1107397

Received: 24 November 2022; Accepted: 27 June 2023;
Published: 25 July 2023.

Edited by:

Zewen Kelvin Tuong, The University of Queensland, Australia

Reviewed by:

James William Wells, The University of Queensland, Australia
Ethan Menahem Shevach, National Institutes of Health (NIH), United States
Ye Zheng, Salk Institute for Biological Studies, United States

Copyright © 2023 Baßler, Schmidleithner, Shakiba, Elmzzahi, Köhne, Floess, Scholz, Ohkura, Sadlon, Klee, Neubauer, Sakaguchi, Barry, Huehn, Bonaguro, Ulas and Beyer. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Marc Beyer, bWFyYy5iZXllckBkem5lLmRl

^†These authors have contributed equally to this work and share first authorship

‡These authors share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Identification of the novel FOXP3-dependent Treg cell transcription factor MEOX1 by high-dimensional analysis of human CD4+ T cells

1 Introduction

2 Materials and methods

Key resources table

2.1 Experimental model and subject details

2.1.1 Human subjects

2.1.2 Cell lines

2.2 Method details

2.2.1 Dataset compilation and primarydata handling

2.2.2 Principal component analysis and t-distributed stochastic neighbor embedding

2.2.3 Correlation coefficient matrices combined with hierarchical clustering

2.2.4 ANOVA calculation for differential gene identification

2.2.5 K-means clustering combined with hierarchical clustering

2.2.6 Self-organizing map clustering

2.2.7 Weighted gene co-expression network analysis

2.2.8 Gene set enrichment analysis

2.2.9 Prediction of potential FOXP3 targets

2.2.10 Prediction of potential MEOX1 targets

2.2.11 Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNe)

2.2.12 Tool for inferring network of genes (TINGe)

2.2.13 Context likelihood of relatedness (CLR)

2.2.14 Gene network inference with ensemble of trees (GENIE3)

2.2.15 Basic correlation

2.2.16 Consensus network

2.2.17 Candidate gene prioritization

2.2.18 Identification of target genes of prioritized TFs using iRegulon

2.2.19 Clustering of the consensus network using Markov clustering algorithm (MCL)

2.2.20 Single-cell RNA-seq analysis

2.2.21 Analysis of histone modifications of the genomic MEOX1 locus

2.2.22 Isolation of human Treg and Tconv cells from buffy coats

2.2.23 Transfection of MEOX1

2.2.24 Flow cytometric analysis

2.2.25 IL-2 stimulation

2.2.26 RNA isolation, cDNA synthesis and qRT-PCR

2.2.27 siRNA-mediated knockdown

2.2.28 DNA methylation analysis

2.2.29 Immunoblotting

2.3 Quantification and statistical analysis

2.4 Data and software availability

3 Results

3.1 Transcriptome analysis of CD4+ Tconv and Treg cells reveals activation-dependent clustering of cells

3.2 Reduction of data complexity using correlation coefficient matrices identifies subclusters of transcriptionally related CD4+ T cells

3.3 Identification of common denominators of CD4+ T cell functionality using reverse network engineering

3.4 Identification of genes specifically associated with distinct CCM clusters

3.5 Further characterization of MEOX1 expression confirms its Treg cell specificity

3.6 Single-cell RNA-seq predicts a close co-regulation of MEOX1 and FOXP3 expression

3.7 Epigenetic regulation of MEOX1 expression in CD4+ T cells

3.8 Identification of FOXP3 as a putative upstream regulator of MEOX1 expression

4 Discussion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good

Identification of the novel FOXP3-dependent T_reg cell transcription factor MEOX1 by high-dimensional analysis of human CD4⁺ T cells

2.2.22 Isolation of human T_reg and T_conv cells from buffy coats

3.1 Transcriptome analysis of CD4⁺ T_conv and T_reg cells reveals activation-dependent clustering of cells

3.2 Reduction of data complexity using correlation coefficient matrices identifies subclusters of transcriptionally related CD4⁺ T cells

3.3 Identification of common denominators of CD4⁺ T cell functionality using reverse network engineering

3.5 Further characterization of MEOX1 expression confirms its T_reg cell specificity

3.7 Epigenetic regulation of MEOX1 expression in CD4⁺ T cells