- 1Institute for Transfusion Medicine and Gene Therapy, Medical Center-University of Freiburg, Freiburg, Germany
- 2Center for Chronic Immunodeficiency (CCI), Medical Center-University of Freiburg, Freiburg, Germany
- 3Ph.D. Program, Faculty of Biology, University of Freiburg, Freiburg, Germany
- 4Institute of Medical Bioinformatics and Systems Medicine, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
- 5German Cancer Consortium (DKTK), Partner Site Freiburg, and German Cancer Research Center (DKFZ), Heidelberg, Germany
- 6Faculty of Medicine, University of Freiburg, Freiburg, Germany
- 7Comprehensive Cancer Center Freiburg (CCCF), Medical Center—University of Freiburg, Freiburg, Germany
Transcription activator-like effector nucleases (TALENs) are programmable nucleases that have entered the clinical stage. Each subunit of the dimer consists of a DNA-binding domain composed of an array of TALE repeats fused to the catalytically active portion of the FokI endonuclease. Upon DNA-binding of both TALEN arms in close proximity, the FokI domains dimerize and induce a staggered-end DNA double strand break. In this present study, we describe the implementation and validation of TALEN-specific CAST-Seq (T-CAST), a pipeline based on CAST-Seq that identifies TALEN-mediated off-target effects, nominates off-target sites with high fidelity, and predicts the TALEN pairing conformation leading to off-target cleavage. We validated T-CAST by assessing off-target effects of two promiscuous TALENs designed to target the CCR5 and TRAC loci. Expression of these TALENs caused high levels of translocations between the target sites and various off-target sites in primary T cells. Introduction of amino acid substitutions to the FokI domains, which render TALENs obligate-heterodimeric (OH-TALEN), mitigated the aforementioned off-target effects without loss of on-target activity. Our findings highlight the significance of T-CAST to assess off-target effects of TALEN designer nucleases and to evaluate mitigation strategies, and advocate the use of obligate-heterodimeric TALEN scaffolds for therapeutic genome editing.
1 Introduction
Since the advent of CRISPR-Cas technology for genome engineering in 2012 (Gasiunas et al., 2012; Jinek et al., 2012), genome editing has gradually moved towards clinical application (Cornu et al., 2017). Despite being used less frequently in research and development, first in line genome editing tools, such as zinc-finger nucleases (ZFNs) and transcription activator-like effector nucleases (TALEN), have been applied in numerous clinical trials, to treat inter alia infection with human immunodeficiency virus (HIV) (Tebas et al., 2014) hematologic malignancies (Qasim et al., 2017; Benjamin et al., 2022), mucopolysaccharidosis and hemophilia (Harmatz et al., 2022), as well as sickle cell disease (NCT03653247). In contrast to RNA-guided nucleases, CRISPR-Cas9 and CRISPR-Cas12a/Cpf1, ZFNs and TALENs act as dimers and are all-protein. They consist of an engineered DNA-binding domain (DBDs) fused to the C-terminal nuclease portion derived from the FokI endonuclease (Porteus and Carroll, 2005; Cathomen and Joung, 2008; Urnov et al., 2010; Gaj et al., 2013). The DBD of a TALEN arm is typically composed of 15–18 TALE repeats, each consisting of 34 amino acids (Christian et al., 2010; Miller et al., 2011; Mussolino et al., 2011). The two amino acids in positions 12 and 13, the so-called repeat-variable di-residues (RVDs), code for the binding specificity of each TALE repeat to a respective DNA base in a simple 1:1 code (Boch et al., 2009; Moscou and Bogdanove, 2009). However, although each RVD typically favors one particular base, there is some promiscuity in binding the DNA. This means that usually more than one base can be recognized by a given RVD (Streubel et al., 2012; Guilinger et al., 2014; Juillerat et al., 2014). To induce a DNA double strand break (DSB), a TALEN pair is designed such that either DBD targets an opposing DNA strand of the target site in a tail-to-tail configuration, separated by a spacer of 10–25 bps (Mussolino et al., 2011; Christian et al., 2012). Upon concomitant binding of both TALEN arms, the two FokI domains dimerize and introduce a staggered-end DNA double strand break (Christian et al., 2010).
In various protocols to manufacture cells for clinical applications, TALENs have proven themselves as a highly efficient gene editing tool, reaching editing efficiencies of >80% in multiple primary human cells (Gautron et al., 2017; Alzubi et al., 2021; Romito et al., 2021; Yang et al., 2022). Also, much effort has been invested to improve the specificity of TALENs for clinical application. This includes the optimization of the length and composition of the linker that connects the TALE to the FokI domain (Mussolino et al., 2011; Christian et al., 2012; Guilinger et al., 2014), the expansion of the RVD repertoire (Juillerat et al., 2015; Miller et al., 2015), as well as preventing homodimerization by creating obligate heterodimeric FokI domains (Miller et al., 2007; Szczepek et al., 2007; Söllü et al., 2010; Doyon et al., 2011; Cade et al., 2012; Nakajima and Yaoita, 2013; Schwarze et al., 2021).
In recent years, genotoxicity, the umbrella term for all unwanted and potentially harmful gene editing events, has become a focal point in the gene editing field aimed towards clinical applications. Several methods have been developed to predict and detect off-target editing, which can be subdivided in three main categories: i) in silico prediction of off-target sites based on homology to the target site, ii) methods to determine off-target cleavage in vitro and iii) assays to detect off-target effects in cellula, reviewed in (Kim et al., 2019; Wienert and Cromer, 2022).
In silico prediction algorithms for TALENs, like PROGNOS (Fine et al., 2014), compare a given target sequence to the whole genome and return sequences with high similarity, which are therefore likely to be inadvertently targeted. They have the advantage of being easy to use but can suffer from a lack of sensitivity. In vitro based methods do not exist for TALENs because a sufficient amount of purified TALEN protein needed for such assays cannot be produced thus far. Cell-based methods and in situ assays generally return a low false positive rate, and have been employed successfully to detect TALEN off-target activity (Frock et al., 2015; Romito et al., 2021; Turchiano et al., 2021; Liu et al., 2022). Two of the cell-based assays detect chromosomal rearrangements between the on-target site and an off-target site (Frock et al., 2015; Turchiano et al., 2021) and can use this information to nominate the putative off-target site. CAST-Seq in particular can further classify the structural variations into off-target mediated translocations (OMTs), homology-mediated translocations (HMTs), and large deletions/inversions at the on-target site (Turchiano et al., 2021).
The first CAST-Seq pipeline for the analysis of designer nuclease was based on the analysis of off-target effects of several CRISPR-Cas nucleases and a single TALEN. Revisiting the pipeline with a different TALEN pair revealed some flaws for its use for TALENs. Because the original CAST-Seq pipeline was optimized for evaluation of monomeric CRISPR-Cas nucleases, its use with dimeric TALENs can lead to false classification of translocation events and wrongful annotation of the off-target sites. This prompted us to implement T-CAST as an extension to CAST-Seq, specifically dedicated to and optimized for the analysis of TALEN-mediated off-target events. We improved off-target annotation by implementing a new substitution matrix combined with coverage plot analyses, and validated the T-CAST pipeline with previously published TALEN pairs targeting the clinically relevant loci CCR5 and TRAC (Mussolino et al., 2014; Alzubi et al., 2021). In this context we verified that obligate-heterodimeric TALENs outperformed the wild type scaffold in terms of specificity without impact on on-target activity. Taken together, T-CAST is a novel tool for the unbiased evaluation of on- and off-target effects induced by TALENs, with the possibility to extend its use to other dimeric nucleases.
2 Materials and methods
2.1 TALEN design and production
All wild type TALEN pairs used were previously described. TRAC-targeting TALE nucleases were used in a proof-of-concept study to show feasibility of large scale production of off-the-shelf CAR T cells (Alzubi et al., 2021). TALE nucleases targeting CCR5 have been investigated in a comparative study benchmarking them against ZFNs (Mussolino et al., 2014). OH-TALEN encoding plasmids were produced in two steps by conventional cloning. First, the wild type FokI domains were excised using restriction enzymes PmeI and BamHI (NEB). Second, FokI domains encoding for OH-substitutions, ordered as gBlocks from IDT, were inserted using the NEBuilder® HiFi DNA Assembly Master Mix following manufacturer instructions. All TALEN-encoding mRNAs were produced by in vitro transcription using the HiScribe™ T7 ARCA mRNA Kit with tailing (NEB) following manufacturer instructions. Target sequences of TALENs used in this study are shown in Supplementary Table S1.
2.2 PBMC isolation
PBMCs were isolated from leukocyte reduction system (LRS) chambers obtained from the Blood Donation Center (University of Freiburg, Medical Center) by density gradient centrifugation using Ficoll. Appropriate aliquots were resuspended in CryoStor™ CS10 (StemCell Technologies) for long-term storage in liquid nitrogen.
2.3 T cell activation and culture
PBMCs/T cells were cultured at 37°C with 5% CO2. Upon thawing, PBMCs were washed with PBS (300xg, 5min) and resuspended at a density of 2 × 106 cells/ml in X-VIVO™ 15 (Lonza) medium supplemented with 200U/ml rhIL-2 (Immunotools) and seeded into 24-well plates (1 ml/well). Adherent cells were allowed to attach for 4 h. After 4 h, non-adherent cells were collected, counted, adjusted to a cell density of 1 × 106 cells/ml with X-VIVO™ 15 (Lonza) medium supplemented with 200U/ml rhIL-2 (Immunotools) and re-seeded into 24-well plates (1ml/well). To each well, 5 µl of ImmunoCult™ Human CD3/CD28/CD2 T Cell Activator (StemCell Technologies) was added and the T cells activated for 72–96 h prior to gene editing.
2.4 Gene editing of T cells
Prior to gene editing, activation of T cells was assessed by staining for CD25. Downstream experiments were performed only when T cells were highly activated (>85% CD25-expression). For small scale transfer of TALEN mRNAs, 1 × 106 cells were harvested by centrifugation (300xg, 5 min) and the supernatant removed. Cells were resuspended in 50 µl CliniMACS ® Electroporation Buffer (Miltenyi Biotec). Just before electroporation, 7.5 µg of each left and right (or left/left, right/right) TALEN pairs were mixed with the resuspended T cells and electroporated using a CliniMACS ® Prodigy with Electroporator unit (Miltenyi Biotec) with the previously described Setting 3 (Alzubi et al., 2021). Post electroporation, cells were recovered in 400 µl of pre-warmed X-VIVO™ 15 (Lonza) medium supplemented with 200 U/ml rhIL-2 (Immunotools) and seeded into two wells of U-shaped 96-well plates. Half of the cells were subjected to a transient low temperature shift to 32°C for 24 h before being shifted to 37°C. The other half was directly cultured at 37°C. Approximately half of the media was changed every 2 days and the cells split every 3–4 days.
2.5 Antibodies and surface staining for flow cytometry
All flow cytometry measurements were carried out using a BD Accuri C6 device. Cells were stained for 45–60 min at 4°C–8°C. T cell activation was assessed by staining with anti-human CD3-APC (Miltenyi Biotec, clone BW264/56) and anti-human CD25-PE (Miltenyi Biotec, clone 4E3). TRAC knock-out was assessed 6–7 days after gene editing by staining with anti-human TCRα/β-PE (Miltenyi Biotec, clone BW242/412) and anti-human CD3-APC (Miltenyi Biotec, clone BW264/56).
2.6 Intracellular staining of TALEN
1 × 106 cells were electroporated with 7.5 μg of left and right TALEN mRNAs. After 4 h of incubation at 37°C, 1 × 105 cells were harvested (300xg, 5min) and washed with 500 μL FACS buffer (PBS supplemented with 5% FCS). Cells were subsequently treated with 100 μL BD Cytofix/Cytoperm™ (BD Biosciences). After 30 min incubation on ice, the cells were washed twice with 500 μL BD Perm/Wash Buffer (BD Biosciences). Permeabilized/fixed cells were stained with 50 μL rabbit anti-RVD antibody (Yang et al., 2022) (1:250) via incubation for 30 min on ice. After another wash step with 500 μL BD Perm/Wash Buffer, 100 μL of 1:500 diluted secondary goat anti-rabbit antibody (Life Technologies, clone A-11034) was added. Following an incubation for 30 min on ice, the cells were washed once with BD Perm/Wash Buffer and finally FACS buffer. Afterwards, the cells were analyzed using a BD Accuri C6 device.
2.7 T-CAST library preparation
Library preparations were basically performed as previously described (Turchiano et al., 2021) using samples that showed particularly high editing as determined by T7E1. In contrast to the original protocol, agarose gel-extraction using the QIAquick Gel extraction kit (QIAGEN) was performed on PCR fragments with a size of 200–500 bp originating from PCRII. This step was included to remove non-informative, short PCR fragments (<200 bp) prior to the barcoding step. NGS libraries were sequenced by Genewiz (part of Azenta Life Sciences) on Illumina HiSeq or NovaSeq with 2 × 150 bp read lengths. Only sites detected as significant hits in two technical replicates are depicted throughout this study. CAST-Seq and T-CAST analysis results for all TALENs targeting CCR5 and TRAC are provided in Supplementary Tables S4–S10. Oligonucleotides used for T-CAST are listed in Supplementary Tables S11.
2.8 T-CAST pipeline
Every single replicate, treated and untreated control, is processed independently from the alignment up to the cluster definition, as described in (39). Then, an overlap analysis is performed to unify the clusters from several replicates. Clusters overlapping or separated by less than 1,500 bp are merged and considered as a single translocation event [see (Turchiano et al., 2021) for details]. Based on the number of replicates, the user can define the minimum number of replicates where the site was found, and the minimum number of samples in which the site was significantly different from untreated control (i.e., the number of reads was significantly higher in treated vs. untreated based on Fisher’s exact test).
Barcode hopping: We introduced an additional filter to eliminate artifacts generated by barcode hopping events. Barcode hopping are identified by their low reads:hits ratio in comparison to real translocation events by the formula: log10 (reads:hits) distribution (<Q1—2.5*IQ).
Coverage: For the remaining sites, the read coverage is calculated in order to identify highly covered regions. Sites are divided into 100 bins of equal size. For each site, the coordinates of bin with the highest coverage across all replicates is used for downstream analysis instead of the whole site coordinates. This new feature restricts the alignment against the target sequence to a smaller, and highly covered region. This makes the alignment more specific and less prone to identification of false-positive OMTs/HMTs.
Alignment: A new TALEN-specific substitution matrix was implemented (Supplementary Tables S12) inspired by (18), and analysis restricted to four TALEN combinations: LF.LR, LF.RR, RF.RR, and RF.LR (L/RX, left/right; XF/R, forward/reverse). In order to determine the best combination, i.e., the one that is most likely cleaving an off-target site, different spacer lengths from 8 to 28 bp, are tested for each combination. Artificial sequences, representing binding sites of two TALEN arms separated by a spacer “Nk” of 8–28 nucleotides (k belong to 8:28) are tested. N can match any bases without cost, therefore the length of the spacer does not influence the alignment score by itself. An example sequence is shown in Supplementary Figure S2B. Alignment score is calculated using the pairwise Alignment function from Biostrings R package with a “local-global” alignment type. The different TALEN combinations and spacer lengths are first selected based on two criteria: a) The first (5′) aligned base is a T, b) the last (3′) aligned base is an A. Then we ordered them based on the alignment score and define the highest score as the most probable TALEN combination and spacer length for a given target site. The same approach was performed on randomly selected regions over the entire genome to determine the overall distribution of the alignment score on random sequences. p values of a given combination and spacer length are assessed based on the empirical cumulative distribution function. Sites with p values below 0.05 are considered as OMT. HMTs and NBSs were classified in the same way as described in (39).
2.9 Amplicon NGS
TALEN target sites as well as putative off-target sites were amplified from 100 ng genomic DNA by standard PCR using Q5® Hot Start High-Fidelity DNA Polymerase (NEB). PCR fragments were purified using either the QIAquick PCR Purification Kit or the QIAquick Gel Extraction Kit (both QIAGEN). Purified PCR products were pooled per sample and NGS libraries constructed using the NEBNext® Ultra™ II DNA Library Prep Kit for Illumina® (NEB). The resulting NGS libraries were quantified by ddPCR using the ddPCR™ Library Quantification Kit for Illumina TruSeq (Bio-Rad) and sequenced on Illumina HiSeq or NovaSeq platforms with 2 × 150 bp read length by NGS service provider Genewiz (part of Azenta Life Sciences). Reads were analyzed using the CRISPResso2 package (Clement et al., 2019) and the p values obtained from CRISPResso Compare. All primers used for Amplicon-NGS are listed in Supplementary Tables S11.
2.10 On-target activity assessed by T7E1 assay
TALEN target sites were amplified from 100 ng of genomic DNA extracted using the NucleoSpin Tissue Mini Kit for DNA from cells and tissue (Macherey-Nagel) by standard PCR using Q5® Hot Start High-Fidelity DNA Polymerase (NEB). Fragments were purified using the QIAquick PCR Purification Kit (QIAGEN) and subsequently denatured by incubation at 95°C for 5 min. Denatured DNA fragments were allowed to re-anneal through slow cooling of the samples to room temperature. Heteroduplex cleavage as surrogate readout for Indel formation/gene editing was visualized through enzymatic restriction of 100 ng of re-annealed sample with 7.5U of T7 endonuclease I (NEB) for 30 min at 37°C. T7E1 cleavage efficiency was determined by agarose gel electrophoresis and adjacent analysis of the gel images using ImageJ 1.47v.
2.11 In silico prediction
In silico prediction of TALEN off-target sites was performed using the PROGNOS web tool (Cradick et al., 2014). For prediction, the individual RVDs were entered and up to six mismatches per TALEN half site allowed. Further, a distance of 10–25 bp between TALEN binding sites was allowed as well as formation of both hetero- and homodimers enabled. PROGNOS results are listed in Supplementary Tables S2 (CCR5) and S3 (TRAC).
3 Results
3.1 The need for a specialized CAST-Seq pipeline for TALENs
CAST-Seq is a highly sensitive assay to nominate off-target (OT) sites by identifying gross chromosomal aberrations, such as large deletions and inversions at the on-target sites of designer nucleases. It was applied successfully to samples treated with various CRISPR-Cas9 nucleases (Turchiano et al., 2021; AlJanahi et al., 2022) but, to date, only to a single TALEN pair targeting the HBB locus (Turchiano et al., 2021). The fact that the bioinformatic nomination of an off-target site is fundamentally different for a dimeric TALEN pair as compared to a monomeric CRISPR-Cas nuclease, prompted us to revisit OT-activity of previously published TALEN targeting the CCR5 gene (Mussolino et al., 2014). To this end, TALEN-encoding mRNAs were produced and transferred into primary human T cells expanded from PBMCs. Amplicon NGS revealed that the CCR5-targeting TALEN caused small insertions and deletions (Indels) in up to 55% of alleles at the on-target site (Figure 1A). In addition, Indel formation at a frequency of 11.5% was observed at the previously described off-target site in CCR2 (Figure 1A) (Mussolino et al., 2014). Indeed, alignment of the TALEN target site in CCR5 with the off-target site in CCR2 situated 15 kb upstream, revealed a single mismatched binding by the right TALEN arm, while the left TALEN can bind to CCR2 without mismatches with an optimal spacer of 14 bp with reference to the right arm (Supplementary Figure S1A).
FIGURE 1. CAST-Seq analysis for CCR5-targeting TALEN. (A) NGS-based genotyping of CCR5 and CCR2. Indicated is the fraction of alleles bearing Indels at the CCR5 on-target site and the off-target site in CCR2 in untreated (UT) T cells, and T cells edited with TALEN. (B) Structural variations. Circos plot illustrates CAST-Seq results with enlargement of the chromosome three region encompassing CCR5 and CCR2 loci. Lines represent chromosomal rearrangements with the CCR5 target site: OMTs with >20 hits in red, NBSs with >40 hits in grey, ambiguous classification (OMT/HMT) in yellow. Red and blue layers represent the alignment and homology scores, respectively. (C) Alignment. Illustrated is the CAST-Seq alignment for ANKRD55 (OMT#1) to two right TALEN arms. Mismatched bases are highlighted in red. Positions of primers P#1/P#2 for NGS validation are indicated. (D) Genotyping of ANKRD55. NGS was performed with primers P#1/P#2. (E) Coverage plot of ANKRD55. Plot shows chromosomal position vs. number of reads, the putative TALEN binding sites (green and purple dashed lines), as well as the positions of primers P#1/P#2 and P#3/P#4. Red vertical lines show boundary of region. (F) Genotyping of OMT#1. NGS was performed with primers P#3/P#4. *** specifies p-value<0.001 (Fisher’s exact test, Bonferroni corrected p-values).
CAST-Seq analysis performed on these samples nominated in addition to CCR2 six high-scoring off-target mediated translocations (OMTs) harboring >20 CAST-seq hits. In addition to those seven off-target sites, three sites on chromosomes 4 (347 hits), chromosome 3 (66 hits), and chromosome 2 (41 hits) were identified (Figure 1B; Table 1). The current algorithm could neither classify them as OMTs nor as homology-mediated translocations (HMTs) but instead categorized them as natural break sites (NBS). Of note, according to the number of CAST-seq hits, translocations between the CCR5 on-target site and NBS#1 (347 hits) seems to be more frequent than translocations between CCR5 and OMT#1 (211 hits) (Table 1). We subsequently probed the predicted TALEN off-target site in OMT#1 for the formation of Indels by targeted amplicon NGS. To this end, we designed primers flanking the site at which two right TALEN arms was proposed to bind (4 or six mismatches, respectively) with an 18 bp spacer (Figure 1C), but did not find signs of TALEN-associated off-target activity (Figure 1D).
TABLE 1. CAST-Seq results for CCR5-targeting TALEN. Listed are OMTs with >20 hits and NBSs with >40 hits. TALEN combinations: LF, left TALEN arm in forward orientation; LR, left TALEN arm in reverse orientation; RF, right TALEN arm in forward orientation; RR, right TALEN in reverse orientation.
In order to better understand which chromosomal regions had actually translocated from chromosome 5 (OMT#1) to the on-target site, as well as how the deletion landscape at the on-target site looks, we produced plots showing the read coverage at these chromosomal regions (Figure 1E; Supplementary Figure S1B). The read coverage at the on-target site (Supplementary Figure S1B) revealed large deletions of several kilobases. It furthermore showed two distinct peaks, one at the CCR5 on-target site and a second peak at the known off-target site in CCR2, 15 kb upstream (Supplementary Figure S1B). This observation showcased that read coverage is a strong indicator for nominating an off-target site. In contrast, the predicted TALEN binding site in OMT#1 (Figure 1E green and purple dotted lines) were distant from the area of high coverage, as were the primers initially used to assess Indel formation at this site (Figure 1E arrows P#1 and P#2) We therefore designed primers P#3/P#4 flanking the highly covered region and repeated amplicon NGS, uncovering Indels in 6% of alleles (Figure 1F).
Coverage plots for OMT#2-#6 revealed that in only two cases (OMT#3 and OMT#6) the predicted TALEN off-target sites (green and purple dotted lines) were close to the region with the highest read coverage (Supplementary Figure S1C).
3.2 T-CAST nominates TALEN off-target sites with high fidelity
Incited by these results, we set out to improve the accuracy and fidelity of CAST-Seq through development of an optimized pipeline for TALENs called T-CAST. Taking advantage of the strong predictive value of read coverage at off-target sites, T-CAST splits up in a first step each translocated region into 100 bins of equal size. Next, the single bin, with the highest relative read coverage obtained from two (or more) CAST-Seq replicates is identified. A restricted region of ±100 bp from the highest bin was chosen, and the TALEN pair was aligned to this shorter region, using a 5′-T constraint, which is a prerequisite for TALEN-DNA engagement (Cornu et al., 2017). Moreover, to improve the predictive power of identifying the correct off-target site, we implemented a TALEN-specific substitution matrix, which accommodates for the fact that RVDs can bind to multiple DNA bases with varying affinity (Supplementary Table S12).
As shown in Figure 2A; Supplementary Figure S2A, TALEN binding site nomination is now restricted to the most highly covered regions identified by T-CAST. The implemented changes to the bioinformatics pipeline not only changed the identification of TALEN off-target sites, but also the classification for the observed chromosomal rearrangements. In addition to the known CCR2 OMT, 12 sites with more than 20 hits were classified as OMTs by T-CAST, on top of two NBS (Figure 2B; Table 2). While the translocation with chromosome 4 (347 hits, Table 1) was now classified as OMT, two sites previously classified as OMTs (OMT#2 and OMT#5) were now re-classified as NBSs (Table 2). Of note, the T-CAST predicted TALEN target site for OMT#2 (ANKRD55, listed as OMT#1 in Table 1) displayed seven mismatches in the first and eight mismatches in the second binding site of the TALEN left arm (Supplementary Figure S2B).
FIGURE 2. T-CAST analysis for CCR5-targeting TALEN (A) Coverage plot for ANKRD55. Plots showing the coverage of two replicates (turquoise, light red) at the OMT site in ANKRD55. Black dashed lines indicate region (±100 bp) flanking the bin with highest coverage (grey). (B) Structural variations. Circos plot illustrates T-CAST results with enlargement of the chromosome three region encompassing CCR5 and CCR2 loci. Lines represent chromosomal rearrangements with the CCR5 target site: OMTs with >20 hits in red, NBSs with >40 hits in grey, ambiguous classification (OMT/HMT) in yellow. (C) Genotyping. NGS at denoted sites was performed on untreated (UT) T cells, and T cells edited with a combination of left and right TALEN arm (L + R), just left (L + L) or just right (R + R) TALEN arms. *** specifies p-value<0.001 (Fisher’s exact test, Bonferroni corrected p-values).
TABLE 2. T-CAST results for CCR5-targeting TALEN. Listed are OMTs with >20 hits and NBSs with >40 hits. TALEN combinations: LF, left TALEN arm in forward orientation; LR, left TALEN arm in reverse orientation; RF, right TALEN arm in forward orientation; RR, right TALEN in reverse orientation.
In order to validate the nominated off-target sites, we designed amplicon NGS primers flanking OMTs#1-9 and NBS#1/2 (Table 2). We obtained specific PCR products for all sites except OMT#7. NGS revealed significant Indel formation above background (0.1%) at all sites, except NBS#2 (Figure 2D), with Indel frequencies ranging from 0.12% (OMT#4) to more than 5% (OMT#1 and OMT#5). We challenged the T-CAST prediction for the most likely TALEN conformation to cause off-target activity by transferring mRNA encoding either only the left TALEN arm (L + L) or only the right TALEN arm (R + R) into activated T cells. In line with the T-CAST annotation, formation of Indels at off-target sites was observed in all samples treated with only left TALEN arm but not in cells exposed to the right TALEN arm only (Figure 2C). The exception is off-target activity at CCR2, for which both TALEN arms is necessary (L + R). Of note, the coverage plots in Supplementary Figure S1B uncovered all the events in the whole region between the CCR5 and CCR2, thus exposing CCR2 as off-target combined with large deletions. It is also worth mentioning that with the exception of CCR2 PROGNOS (Fine et al., 2014) did not predict any of these sites, even when allowing up to six mismatches per TALEN arm (Supplementary Table S2).
3.3 Obligate-heterodimeric (OH) TALEN mitigate off-target effects while retaining full on-target activity
TALEN-mediated DNA cleavage is performed through dimerization of the two FokI domains upon binding in a tail-to-tail orientation of the two subunits. This can be brought about by two different TALEN arms but also through the formation of homodimers as described above (Figure 2C). In order to prevent homodimerization, charged residues within the two FokI dimer interface can be substituted by oppositely charged residues, exerting repellent electrostatic forces and preventing dimer formation of two identical TALEN subunits (Cade et al., 2012; Nakajima and Yaoita, 2013; Schwarze et al., 2021). These obligate-heterodimeric TALEN pairs reduce the number of possible TALEN conformation by 50% (Figure 3A).
FIGURE 3. Obligate-heterodimerization mitigates off-target activity of CCR5-TALEN (A) Schematic of possible TALEN pairing combinations. Combinations are displayed for TALENs with wild-type (WT) and obligate-heterodimeric (OH) FokI domains. (B) Genotyping. Results of T7E1 assay of WT-TALEN and OH-TALEN (KKR-ELD and KVR-EAD configuration) are shown. 32°C indicates that T cells were subjected to transient cold-shock after electroporation, while control cells were constantly cultured at 37°C. * and ** specify p-values of <0.05 or <0.01 (Student’s t-test, n = 3–5). (C) TALEN expression. Percentage of TALEN-expressing T cells upon mRNA transfer as determined by flow cytometry (n = 1). (D,E) Structural variations. Circos plots illustrate T-CAST results with enlargement of the chromosome three region encompassing CCR5 and CCR2 loci. Lines represent chromosomal rearrangements with the CCR5 target site: OMTs with >20 hits in red, ambiguous classification (OMT/HMT) in yellow. (F,G) Genotyping. NGS at denoted sites was performed on untreated (UT) T cells, and T cells edited with WT-TALENs or OH-TALENs as indicated. The group in which the samples were originally identified is indicated on the bottom. *** specifies p-value<0.001 (Fisher’s exact test, Bonferroni corrected p-values).
Here, we validated two OH-TALEN scaffolds harboring either KKR-ELD or KVR-EAD substitutions in the FokI domain, either linked to the right or to the left TALEN arm. On-target activity of WT and OH-TALENs was assessed on genomic DNA isolated from primary T cells that were cultured constantly at 37°C or subjected to a transient temperature shift to 32°C for 24 h post-electroporation with TALEN-encoding mRNA (Figure 3B). Genotyping by T7E1 assay revealed significantly higher Indel formation for both OH-TALEN scaffolds, averaging 56% and 60% mutated alleles for KKR-ELD and KVR-EAD TALEN, respectively, in contrast to 36% mutated alleles upon transfer of WT TALEN under transient cold-shock (32°C) conditions. Under constant temperature at 37°C, no significant differences were observed between the activities of WT-TALEN and OH-TALEN, with 35%–47% cleavage in T7E1 assay (Figure 3B). Flow cytometric analysis upon intracellular staining 4 h after mRNA transfer revealed similar expression for all TALEN scaffolds (Figure 3C).
Primary T cells that were edited with CCR5-targeting OH-TALENs were subsequently subjected to T-CAST analysis. In contrast to WT-TALEN, no high-scoring chromosomal aberrations were observed in samples treated with KKR-ELD TALEN (Figure 3D) and only three structural variations with >20 T-CAST hits were identified in KVR-EAD TALEN treated cells (Figure 3E). Since no chromosomal translocations were detected in KKR-ELD samples, we decided to probe the two top-scoring NBSs (Table 3) for Indel formation. No Indels were detected at these sites in neither WT nor OH-TALEN treated samples (Figure 3F). Likewise, we analyzed all three nominated off-target sites identified in KVR-EAD TALEN treated samples as well as the two highest-scoring NBSs (Table 3) with respect to the presence of Indels by amplicon NGS. Indel formation above the limit of detection were only observed at OMT#1 and OMT#2 in KVR-EAD TALEN treated samples but hardly above background (Figure 3F). In addition, we tested for Indel formation at the six top-scoring OMTs identified previously for WT TALEN (Table 2) as well as the on-target site and the CCR2 off-target site. In line with the T7E1 results, the fraction of mutated alleles ranged from 56% (WT) to 68% and 72% for KKR-ELD and KVR-EAD, respectively (Figure 3G). Off-target mutagenesis in CCR2 was highest in KVR-EAD edited samples (20%) and lowest in KKR-ELD treated samples (6%). In line with off-target activity at CCR2, we detected a large 15 kb deletion between the CCR5 and CCR2 loci in all samples, irrespective of the scaffold (Supplementary Figure S3). In conclusion, both OH scaffolds greatly improved the specificity of the CCR5-targeted TALEN by abrogating activity at off-target sites that were bound by a single TALEN arm in a homodimeric tail-to-tail configuration.
TABLE 3. T-CAST results for CCR5-targeting OH-TALEN. Listed are OMTs with >20 hits and the two top scoring NBSs. TALEN combinations: LF, left TALEN arm in forward orientation; LR, left TALEN arm in reverse orientation; RF, right TALEN arm in forward orientation; RR, right TALEN in reverse orientation.
3.4 T-CAST unravels off-target activity of TRAC-targeting TALEN used in large-scale production of universal CAR T cells
The constant region of the T cell receptor α chain (TRAC) is an interesting target in CAR T cell immunotherapy. Disruption of TRAC results in the loss of expression of the entire T cell receptor complex (TCR), a major road block in allogeneic CAR T cell therapy (Lin et al., 2021). We have previously shown that a TALEN targeting TRAC can be used in large scale manufacturing of universal CAR T cells (Alzubi et al., 2021). Using T-CAST, we revisited the safety profile of the previously used WT-TALEN in comparison to OH-TALEN scaffolds. Genotyping by T7E1 assays revealed high on-target activity for all three TALEN scaffolds, each one achieving some 90% of T7E1 cleavage (Figure 4A). In line with our findings for CCR5-targeting TALENs, OH-TALEN targeting TRAC outperformed their WT counterpart when the primary T cells were subjected to a transient 32°C cold-shock post electroporation. These findings are mirrored by the phenotypic analysis of TCRα/β surface expression performed 1 week post-transfer of TALEN-encoding mRNA. Under transient cold-shock conditions, the fraction of TCRα/β-negative cells increased from 55% TCR-negative T cells upon transfer of WT-TALENs to 85% and 83% in T cells edited with KVR-EAD and KKR-ELD OH-TALENs respectively (Figure 4B). T cells cultured at 37°C revealed no significant differences with regard to knockout efficacies between the various scaffolds (Figures 4A, B). A representative flow cytometric analysis (Supplementary Figure S4) shows the two clearly separated populations of TCRα/β positive and TCRα/β negative cells, in agreement with monoallelic expression of the T cell receptor α chain.
FIGURE 4. T-CAST analysis for TRAC-targeting TALENs (A) Genotyping. Results of T7E1 assay of WT-TALEN and OH-TALEN (KKR-ELD and KVR-EAD configuration) are shown. 32°C indicates that T cells were subjected to transient cold-shock after electroporation, while control cells were constantly cultured at 37°C. * specifies p-value<0.05 (Student’s t-test, n = 3–6). (B) TCR expression. Displayed is the fraction of TCRα/β-negative T cells upon transfer of TALEN-encoding mRNA as determined by flow cytometry. Where indicated (32°C) T cells were subjected to a transient cold-shock. *** specifies p-value<0.001 (Student’s t-test; n = 3–7). (C–E) Structural variations. Circos plots illustrate T-CAST results with enlargement of the chromosome 14 region encompassing TRAC. Red lines represent chromosomal rearrangements (OMTs with >20 hits) with the TRAC target site. (F) Genotyping. NGS at denoted sites was performed on untreated (UT) T cells, and T cells edited with WT-TALENs or OH-TALENs as indicated. The group in which the samples were originally identified is indicated on the bottom. *** specifies p-value<0.001 (Fisher’s exact test, Bonferroni corrected p-values).
Similar to our observations for TALEN targeting CCR5, the number of chromosomal rearrangements was highest in samples treated with the WT-TALEN scaffold (Table 4). T-CAST identified a total of five translocations with more than 20 T-CAST hits (Figure 4C). In KKR-ELD TALEN treated samples, no structural variation was identified (Figure 4D), while T cells edited with KVR-EAD TALEN displayed three chromosomal aberrations (Figure 4E). Of note, the highest-scoring rearrangement observed in the KVR-EAD samples, OMT#1 with 79 T-CAST hits, is in close proximity to the on-target site, thus likely representing a large deletion rather than an off-target site. A similar phenomenon was observed in KKR-ELD edited T cells (OMT#2, Table 4).
TABLE 4. T-CAST results for TRAC-targeting wild type and OH-TALEN. Listed are OMTs with >20 hits or the two top scoring OMTs (KKR-ELD) and NBSs with >40 hits. TALEN combinations: LF, left TALEN arm in forward orientation; LR, left TALEN arm in reverse orientation; RF, right TALEN arm in forward orientation; RR, right TALEN in reverse orientation. Large deletions (Del.) at the TRAC on-target site are specified.
Validation of T-CAST results by targeted amplicon NGS confirmed high on-target activity for all TALEN scaffolds, with 76%–88% Indels at TRAC (Figure 4F). Assessment of off-targets OMT#1–4 identified for WT-TALEN confirmed off-target activity at OMT#1–3, with 6%–24% Indels at those sites in the WT-TALEN treated samples and absence of off-target mutagenesis in T cells edited with the OH-TALEN scaffolds. No Indels were observed at OMT#4 in any of the edited samples (Figure 4F), while OMT#5 could not be amplified. We furthermore probed OMTs associated with TALEN (OMT#1, 17 CAST-hits) and KVR-EAD TALEN (OMT#2, 32 CAST-hits). Despite the low number of T-CAST hits, a small but significant fraction of reads in both OH-TALEN edited T cell samples revealed off-target activity at this off-target site on chromosome 2 with 0.5%–0.7% of alleles with Indels. No Indels were detected at KVR-EAD OMT#2 (Figure 4F). Of note, similarly to the CCR5-targeting TALENs, there is no overlap between the PROGNOS-predicted sites for TRAC-targeting TALEN (Supplementary Table S3) and experimentally confirmed off-target sites identified by T-CAST.
In conclusion and in agreement with the results for the CCR5-targeting TALENs, both OH scaffolds greatly improved the specificity of the TRAC-targeted TALEN by abrogating off-target activity at sites that are cleaved by a homodimeric TALEN.
4 Discussion
Gene editing tools have entered the clinical stage almost 10 years ago. Some 50 active interventional clinical trials (ClinicalTrials.gov, accessed on Dec. 22, 2022) employ genome editors to treat devastating diseases through ex vivo or in vivo applications. While the CRISPR-Cas system is by far the most frequently used platform, other customizable nucleases, such as TALENs or ZFNs are used as well.
With an ever-increasing number of gene editing clinical trials, the demand for highly sensitive assays to detect unwanted side effects is equally on the rise. While the safety of CRISPR-Cas nucleases can be assessed by in vitro methods (Tsai et al., 2017), such assays are not transferrable to TALENs and ZFNs because these proteins cannot be produced in sufficient amounts needed for these assays. Moreover, in contrast to CRISPR-Cas, for which several in silico off-target prediction tools exist (Cradick et al., 2014; Montague et al., 2014; Park et al., 2015; Stemmer et al., 2015; Haeussler et al., 2016), off-target prediction algorithms are rather scarce for TALENs and they have neither been further developed nor updated in many years (Fine et al., 2014). It is therefore not surprising that the prediction algorithm PROGNOS was not able to forecast most of the off-target sites identified in this study. It is thus highly relevant to have powerful off-target detection methods at hand, which can be applied in cellula. While some assays rely on the concomitant delivery of short DNA fragments to tag DSBs (Tsai et al., 2015), others use chromatin-immunoprecipitation of DNA repair factors to nominate the off-target sites (Wienert et al., 2019). CAST-Seq and other assays (Frock et al., 2015; Turchiano et al., 2021; Liu et al., 2022) identify off-target sites through detection of chromosomal rearrangements triggered by the expression of designer nucleases. These methods are of eminent importance, not only to develop and implement safe therapeutic genome editing strategies but also to monitor the patients during the clinical follow-up phase.
Here, we described the implementation of T-CAST, a CAST-Seq based pipeline that has been optimized to identify off-target effects triggered by TALENs. While CAST-Seq based detection of chromosomal rearrangements is agnostic to the designer nuclease platform, the nomination of the off-target sites is not. The annotation of TALEN off-target sites is particularly challenging because TALE RVDs are–despite showing a strong preference for a particular base–rather promiscuous in their DNA binding behavior (Miller et al., 2011; Streubel et al., 2012; Guilinger et al., 2014; Juillerat et al., 2014). In order to improve the reliability of TALEN off-target site nomination, we took advantage of the fact that the CAST-Seq read coverage at a given chromosomal region is a strong indicator for the presence of an off-target site. The T-CAST pipeline restricts TALEN alignment to this region, and it furthermore accounts for the requirement of 5′-T for efficient DNA binding by a TALEN subunit (Boch et al., 2009; Moscou and Bogdanove, 2009). This current T-CAST pipeline will hence not work properly if TALENs bearing a modified N-terminal domain to recognize all bases at the 5′-end (Lamb et al., 2013) are employed. However, T-CAST can be readily adapted for novel RVD scaffolds and also alternative TALE formats if needed.
T-CAST identified various off-target mediated structural variations triggered by the expression of TALENs designed to target CCR5 or TRAC. Almost all of the nominated off-target sites could be verified by targeted amplicon NGS. In three cases, we did not detect Indels at a predicted OMT. The most likely explanation for this discrepancy is the lower limit of detection (LLOD) of CAST-Seq versus targeted amplicon sequencing. While the LLOD of CAST-Seq was determined to be about 0.01%, the LLOD of safely detecting Indels by amplicon NGS is about 10-fold higher. However, we cannot exclude with certainty that the three sites classified as OMTs are false positives. In another instance, we detected Indel formation at a site classified as NBS, suggesting that although T-CAST was able to identify the chromosomal translocation, the classification into an NBS was incorrect. The erroneous classification is founded on the aforementioned fact that RVDs are promiscuous: the T-CAST classifier is unhinged if a TALEN arm accepts seven or more mismatches.
Having set up a reliable bioinformatics pipeline to identify TALEN off-target sites, we used T-CAST to evaluate whether obligate-heterodimeric TALEN scaffolds can mitigate off-target activity. We established that OH-TALENs displayed similar or higher on-target activity as WT TALENs. Interestingly, OH-TALEN outperformed WT-TALEN whenever the cells were subjected to a transient cold-shock post-nucleofection. In contrast, no significant differences were observed when the primary T cells were constantly cultured at 37°C. Since we demonstrated comparable expression levels of the different TALEN scaffolds, we speculate that the diverging behavior is due to an altered affinity between the FokI dimer interfaces of the three TALEN scaffolds tested here.
Regardless, our main focus was to validate reduced off-target activity of OH-TALEN scaffolds. Indeed, the occurrence of chromosomal aberrations in T cells edited with KKR-ELD and KVR-EAD TALENs was strongly reduced. They were detected only at off-target sites that were engaged by a left and a right TALEN subunit, confirming that the OH-TALEN scaffolds effectively prevent off-target events at ‘homodimeric binding sites’. Our observations are consistent with other studies that reported a decrease in OT-activity when using obligate-heterodimeric nucleases (Miller et al., 2007; Szczepek et al., 2007; Söllü et al., 2010; Doyon et al., 2011; Cade et al., 2012; Nakajima and Yaoita, 2013; Schwarze et al., 2021).
In summary, our results show that T-CAST is genome-wide and sensitive method to detect chromosomal rearrangements induced by TALENs, and to nominate TALEN off-target sites with high precision in primary human cells. It can easily be envisioned to adapt T-CAST to additional dimeric genome editing platforms, including ZFNs, double nickase approaches, or other customized endonucleases, for which in silico or in vitro methods are challenging or not possible. Moreover, as shown for CAST-Seq, T-CAST can be performed on any primary cell type with only 500 ng of genomic DNA required, thus proving its value to monitor chromosomal aberrations during preclinical development, as well as monitoring final product and patients during the follow-up phase.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE222763.
Author contributions
TIC, TC, and MR conceived the study and designed experiments. MR, KG and JR performed experiments. GA and MB developed the T-CAST bioinformatics pipeline. GA and MR performed bioinformatics analyses. TIC, MR, TC, GA, MB, and KG analysed data. MR, TIC, and TC wrote the manuscript which was revised by all co-authors. TIC, TC, GA, and MB acquired funds.
Funding
This study was supported by the Job Research Foundation (ZVK2020121800 to TC and TIC), the German Research Foundation (DFG: FANedit CA311/4-1 to TC; SFB1160-Z02 to MB), and the German Federal Ministry of Education and Research (BMBF: editCCR5 FKZ 01EK2205 to TC and TIC; MIRACUM FKZ 01ZZ 1801B to MB; EkoEstMed FKZ 01ZZ2015 to GA). We further acknowledge support by the Open Access Publication Fund of the University of Freiburg.
Acknowledgments
We thank Viviane Dettmer-Monaco and Jamal Alzubi for support with T cell cultures, Melina el Gaz for technical support, the Lighthouse Core Facility (Medical Center–University of Freiburg) for help with flow cytometry, and the Blood Donation Center, Medical Center–University of Freiburg, for providing LRS chambers, and the members of our laboratories for fruitful discussions and suggestions.
Conflict of interest
TIC has a sponsored research collaboration with Cimeio Therapeutics. TC and TIC have a sponsored research collaboration with Cellectis. TC is an advisor to Cimeio Therapeutics, Excision BioTherapeutics, and Novo Nordisk. TC, MB and GA hold a patent on CAST-Seq (US11319580B2). All other authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgeed.2023.1130736/full#supplementary-material
References
AlJanahi, A. A., Lazzarotto, C. R., Chen, S., Shin, T. H., Cordes, S., Fan, X., et al. (2022). Prediction and validation of hematopoietic stem and progenitor cell off-target editing in transplanted rhesus macaques. Mol. Ther. J. Am. Soc. Gene Ther. 30 (1), 209–222. doi:10.1016/j.ymthe.2021.06.016
Alzubi, J., Lock, D., Rhiel, M., Schmitz, S., Wild, S., Mussolino, C., et al. (2021). Automated generation of gene-edited CAR T cells at clinical scale. Mol. Ther. Methods & Clin. Dev. 20, 379–388. doi:10.1016/j.omtm.2020.12.008
Benjamin, R., Jain, N., Maus, M. V., Boissel, N., Graham, C., Jozwik, A., et al. (2022). UCART19, a first-in-class allogeneic anti-CD19 chimeric antigen receptor T-cell therapy for adults with relapsed or refractory B-cell acute lymphoblastic leukaemia (CALM): A phase 1, dose-escalation trial. Lancet Haematol. 9 (11), e833–e843. doi:10.1016/S2352-3026(22)00245-9
Boch, J., Scholze, H., Schornack, S., Landgraf, A., Hahn, S., Kay, S., et al. (2009). Breaking the code of DNA binding specificity of TAL-type III effectors. Science 326 (5959), 1509–1512. doi:10.1126/science.1178811
Cade, L., Reyon, D., Hwang, W. Y., Tsai, S. Q., Patel, S., Khayter, C., et al. (2012). Highly efficient generation of heritable zebrafish gene mutations using homo- and heterodimeric TALENs. Nucleic acids Res. 40 (16), 8001–8010. doi:10.1093/nar/gks518
Cathomen, T., and Joung, J. K. (2008). Zinc-finger nucleases: The next generation emerges. Mol. Ther. J. Am. Soc. Gene Ther. 16 (7), 1200–1207. doi:10.1038/mt.2008.114
Christian, M., Cermak, T., Doyle, E. L., Schmidt, C., Zhang, F., Hummel, A., et al. (2010). Targeting DNA double-strand breaks with TAL effector nucleases. Genetics 186 (2), 757–761. doi:10.1534/genetics.110.120717
Christian, M. L., Demorest, Z. L., Starker, C. G., Osborn, M. J., Nyquist, M. D., Zhang, Y., et al. (2012). Targeting G with TAL effectors: A comparison of activities of TALENs constructed with NN and NK repeat variable di-residues. PLoS One 7 (9), e45383. doi:10.1371/journal.pone.0045383
Clement, K., Rees, H., Canver, M. C., Gehrke, J. M., Farouni, R., Hsu, J. Y., et al. (2019). CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nat. Biotechnol. 37 (3), 224–226. doi:10.1038/s41587-019-0032-3
Cornu, T. I., Mussolino, C., and Cathomen, T. (2017). Refining strategies to translate genome editing to the clinic. Nat. Med. 23 (4), 415–423. doi:10.1038/nm.4313
Cradick, T. J., Qiu, P., Lee, C. M., Fine, E. J., and Bao, G. (2014). Cosmid: A web-based tool for identifying and validating CRISPR/cas off-target sites. Mol. Ther. Nucleic Acids 3 (12), e214. doi:10.1038/mtna.2014.64
Doyon, Y., Vo, T. D., Mendel, M. C., Greenberg, S. G., Wang, J., Xia, D. F., et al. (2011). Enhancing zinc-finger-nuclease activity with improved obligate heterodimeric architectures. Nat. methods 8 (1), 74–79. doi:10.1038/nmeth.1539
Fine, E. J., Cradick, T. J., Zhao, C. L., Lin, Y., and Bao, G. (2014). An online bioinformatics tool predicts zinc finger and TALE nuclease off-target cleavage. Nucleic acids Res. 42 (6), e42. doi:10.1093/nar/gkt1326
Frock, R. L., Hu, J., Meyers, R. M., Ho, Y. J., Kii, E., and Alt, F. W. (2015). Genome-wide detection of DNA double-stranded breaks induced by engineered nucleases. Nat. Biotechnol. 33 (2), 179–186. doi:10.1038/nbt.3101
Gaj, T., Gersbach, C. A., and Barbas, C. F. (2013). ZFN, TALEN, and CRISPR/Cas-based methods for genome engineering. Trends Biotechnol. 31 (7), 397–405. doi:10.1016/j.tibtech.2013.04.004
Gasiunas, G., Barrangou, R., Horvath, P., and Siksnys, V. (2012). Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc. Natl. Acad. Sci. U. S. A. 109, E2579–E2586. doi:10.1073/pnas.1208507109
Gautron, A. S., Juillerat, A., Guyot, V., Filhol, J. M., Dessez, E., Duclert, A., et al. (2017). Fine and predictable tuning of TALEN gene editing targeting for improved T cell adoptive immunotherapy. Mol. Ther. Nucleic Acids 9, 312–321. doi:10.1016/j.omtn.2017.10.005
Guilinger, J. P., Pattanayak, V., Reyon, D., Tsai, S. Q., Sander, J. D., Joung, J. K., et al. (2014). Broad specificity profiling of TALENs results in engineered nucleases with improved DNA-cleavage specificity. Nat. methods 11 (4), 429–435. doi:10.1038/nmeth.2845
Haeussler, M., Schonig, K., Eckert, H., Eschstruth, A., Mianne, J., Renaud, J. B., et al. (2016). Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR. Genome Biol. 17 (1), 148. doi:10.1186/s13059-016-1012-2
Harmatz, P., Prada, C. E., Burton, B. K., Lau, H., Kessler, C. M., Cao, L., et al. (2022). First-in-human in vivo genome editing via AAV-zinc-finger nucleases for mucopolysaccharidosis I/II and hemophilia B. J. Am. Soc. Gene Ther. 30 (12), 3587–3600. doi:10.1016/j.ymthe.2022.10.010
Jinek, M., Chylinski, K., Fonfara, I., Hauer, M., Doudna, J. A., and Charpentier, E. (2012). A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337 (6096), 816–821. doi:10.1126/science.1225829
Juillerat, A., Dubois, G., Valton, J., Thomas, S., Stella, S., Maréchal, A., et al. (2014). Comprehensive analysis of the specificity of transcription activator-like effector nucleases. Nucleic acids Res. 42 (8), 5390–5402. doi:10.1093/nar/gku155
Juillerat, A., Pessereau, C., Dubois, G., Guyot, V., Marechal, A., Valton, J., et al. (2015). Optimized tuning of TALEN specificity using non-conventional RVDs. Sci. Rep. 5, 8150. doi:10.1038/srep08150
Kim, D., Luk, K., Wolfe, S. A., and Kim, J. S. (2019). Evaluating and enhancing target specificity of gene-editing nucleases and deaminases. Annu. Rev. Biochem. 88, 191–220. doi:10.1146/annurev-biochem-013118-111730
Lamb, B. M., Mercer, A. C., and Barbas, C. F. (2013). Directed evolution of the TALE N-terminal domain for recognition of all 5' bases. Nucleic acids Res. 41 (21), 9779–9785. doi:10.1093/nar/gkt754
Lin, H., Cheng, J., Mu, W., Zhou, J., and Zhu, L. (2021). Advances in universal CAR-T cell therapy. Front. Immunol. 12, 744823. doi:10.3389/fimmu.2021.744823
Liu, Y., Yin, J., Gan, T., Liu, M., Xin, C., Zhang, W., et al. (2022). PEM-seq comprehensively quantifies DNA repair outcomes during gene-editing and DSB repair. Star. Protoc. 3 (1), 101088. doi:10.1016/j.xpro.2021.101088
Miller, J. C., Holmes, M. C., Wang, J., Guschin, D. Y., Lee, Y. L., Rupniewski, I., et al. (2007). An improved zinc-finger nuclease architecture for highly specific genome editing. Nat. Biotechnol. 25 (7), 778–785. doi:10.1038/nbt1319
Miller, J. C., Patil, D. P., Xia, D. F., Paine, C. B., Fauser, F., Richards, H. W., et al. (2019). Enhancing gene editing specificity by attenuating DNA cleavage kinetics. Nat. Biotechnol. 37 (8), 945–952. doi:10.1038/s41587-019-0186-z
Miller, J. C., Tan, S., Qiao, G., Barlow, K. A., Wang, J., Xia, D. F., et al. (2011). A TALE nuclease architecture for efficient genome editing. Nat. Biotechnol. 29 (2), 143–148. doi:10.1038/nbt.1755
Miller, J. C., Zhang, L., Xia, D. F., Campo, J. J., Ankoudinova, I. V., Guschin, D. Y., et al. (2015). Improved specificity of TALE-based genome editing using an expanded RVD repertoire. Nat. methods 12 (5), 465–471. doi:10.1038/nmeth.3330
Montague, T. G., Cruz, J. M., Gagnon, J. A., Church, G. M., and Valen, E. (2014). CHOPCHOP: A CRISPR/Cas9 and TALEN web tool for genome editing. Nucleic acids Res. 42, W401–W407. doi:10.1093/nar/gku410
Moscou, M. J., and Bogdanove, A. J. (2009). A simple cipher governs DNA recognition by TAL effectors. Science 326 (5959), 1501. doi:10.1126/science.1178817
Mussolino, C., Alzubi, J., Fine, E. J., Morbitzer, R., Cradick, T. J., Lahaye, T., et al. (2014). TALENs facilitate targeted genome editing in human cells with high specificity and low cytotoxicity. Nucleic acids Res. 42 (10), 6762–6773. doi:10.1093/nar/gku305
Mussolino, C., Morbitzer, R., Lutge, F., Dannemann, N., Lahaye, T., and Cathomen, T. (2011). A novel TALE nuclease scaffold enables high genome editing activity in combination with low toxicity. Nucleic acids Res. 39 (21), 9283–9293. doi:10.1093/nar/gkr597
Nakajima, K., and Yaoita, Y. (2013). Comparison of TALEN scaffolds in Xenopus tropicalis. Biol. open 2 (12), 1364–1370. doi:10.1242/bio.20136676
Park, J., Bae, S., and Kim, J. S. (2015). Cas-designer: A web-based tool for choice of CRISPR-cas9 target sites. Bioinformatics 31 (24), 4014–4016. doi:10.1093/bioinformatics/btv537
Porteus, M. H., and Carroll, D. (2005). Gene targeting using zinc finger nucleases. Nat. Biotechnol. 23 (8), 967–973. doi:10.1038/nbt1125
Qasim, W., Zhan, H., Samarasinghe, S., Adams, S., Amrolia, P., Stafford, S., et al. (2017). Molecular remission of infant B-ALL after infusion of universal TALEN gene-edited CAR T cells. Sci. Transl. Med. 9 (374), eaaj2013. doi:10.1126/scitranslmed.aaj2013
Romito, M., Juillerat, A., Kok, Y. L., Hildenbeutel, M., Rhiel, M., Andrieux, G., et al. (2021). Preclinical evaluation of a novel TALEN targeting CCR5 confirms efficacy and safety in conferring resistance to HIV-1 infection. Biotechnol. J. 16 (1), e2000023. doi:10.1002/biot.202000023
Schwarze, L. I., Głów, D., Sonntag, T., Uhde, A., and Fehse, B. (2021). Optimisation of a TALE nuclease targeting the HIV co-receptor CCR5 for clinical application. Gene Ther. 28 (9), 588–601. doi:10.1038/s41434-021-00271-9
Söllü, C., Pars, K., Cornu, T. I., Thibodeau-Beganny, S., Maeder, M. L., Joung, J. K., et al. (2010). Autonomous zinc-finger nuclease pairs for targeted chromosomal deletion. Nucleic acids Res. 38 (22), 8269–8276. doi:10.1093/nar/gkq720
Stemmer, M., Thumberger, T., Del Sol Keyer, M., Wittbrodt, J., and Mateo, J. L. (2015). CCTop: An intuitive, flexible and reliable CRISPR/Cas9 target prediction tool. PLoS One 10 (4), e0124633. doi:10.1371/journal.pone.0124633
Streubel, J., Blücher, C., Landgraf, A., and Boch, J. (2012). TAL effector RVD specificities and efficiencies. Nat. Biotechnol. 30 (7), 593–595. doi:10.1038/nbt.2304
Szczepek, M., Brondani, V., Buchel, J., Serrano, L., Segal, D. J., and Cathomen, T. (2007). Structure-based redesign of the dimerization interface reduces the toxicity of zinc-finger nucleases. Nat. Biotechnol. 25 (7), 786–793. doi:10.1038/nbt1317
Tebas, P., Stein, D., Tang, W. W., Frank, I., Wang, S. Q., Lee, G., et al. (2014). Gene editing of CCR5 in autologous CD4 T cells of persons infected with HIV. N. Engl. J. Med. 370 (10), 901–910. doi:10.1056/NEJMoa1300662
Tsai, S. Q., Nguyen, N. T., Malagon-Lopez, J., Topkar, V. V., Aryee, M. J., and Joung, J. K. (2017). CIRCLE-Seq: A highly sensitive in vitro screen for genome-wide CRISPR-cas9 nuclease off-targets. Nat. methods 14 (6), 607–614. doi:10.1038/nmeth.4278
Tsai, S. Q., Zheng, Z., Nguyen, N. T., Liebers, M., Topkar, V. V., Thapar, V., et al. (2015). GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat. Biotechnol. 33 (2), 187–197. doi:10.1038/nbt.3117
Turchiano, G., Andrieux, G., Klermund, J., Blattner, G., Pennucci, V., El Gaz, M., et al. (2021). Quantitative evaluation of chromosomal rearrangements in gene-edited human stem cells by CAST-Seq. Cell stem Cell 28, 1136–1147.e5. doi:10.1016/j.stem.2021.02.002
Urnov, F. D., Rebar, E. J., Holmes, M. C., Zhang, H. S., and Gregory, P. D. (2010). Genome editing with engineered zinc finger nucleases. Nat. Rev. Genet. 11 (9), 636–646. doi:10.1038/nrg2842
Wienert, B., and Cromer, M. K. (2022). CRISPR nuclease off-target activity and mitigation strategies. Front. genome Ed. 4, 1050507. doi:10.3389/fgeed.2022.1050507
Wienert, B., Wyman, S. K., Richardson, C. D., Yeh, C. D., Akcakaya, P., Porritt, M. J., et al. (2019). Unbiased detection of CRISPR off-targets in vivo using DISCOVER-Seq. Science 364 (6437), 286–289. doi:10.1126/science.aav9023
Keywords: TALEN, designer nuclease, chromosomal translocations, off-target effects, FokI, preclinical risk assessment, chromosomal rearrangements, obligate heterodimer
Citation: Rhiel M, Geiger K, Andrieux G, Rositzka J, Boerries M, Cathomen T and Cornu TI (2023) T-CAST: An optimized CAST-Seq pipeline for TALEN confirms superior safety and efficacy of obligate-heterodimeric scaffolds. Front. Genome Ed. 5:1130736. doi: 10.3389/fgeed.2023.1130736
Received: 23 December 2022; Accepted: 06 February 2023;
Published: 20 February 2023.
Edited by:
Karim Benabdellah, Andalusian Autonomous Government of Genomics and Oncological Research (GENYO), SpainReviewed by:
Eli J. Fine, National Resilience, United StatesXiaoxia Cui, Washington University in St. Louis, United States
Copyright © 2023 Rhiel, Geiger, Andrieux, Rositzka, Boerries, Cathomen and Cornu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Tatjana I. Cornu, VGF0amFuYS5Db3JudUB1bmlrbGluaWstZnJlaWJ1cmcuZGU=
†These authors have contributed equally to this work and share first authorship