Skip to main content

ORIGINAL RESEARCH article

Front. Immunol., 06 September 2023
Sec. Cancer Immunity and Immunotherapy
This article is part of the Research Topic Cancer Immunotherapy – Diagnostic and Therapeutic Strategies to Enhance Antitumoral Efficacies whilst Minimizing Toxicity View all 18 articles

Chimeric RNAs reveal putative neoantigen peptides for developing tumor vaccines for breast cancer

Brandon Mistretta&#x;Brandon Mistretta1†Sakuni Rankothgedera&#x;Sakuni Rankothgedera1†Micah Castillo&#x;Micah Castillo1†Mitchell RaoMitchell Rao1Kimberly HollowayKimberly Holloway1Anjana BhardwajAnjana Bhardwaj2Maha El NoafalMaha El Noafal3Constance AlbarracinConstance Albarracin4Randa El-ZeinRanda El-Zein3Hengameh RezaeiHengameh Rezaei1Xiaoping SuXiaoping Su5Rehan AkbaniRehan Akbani5Xiaoshan M. ShaoXiaoshan M. Shao6Brian J. CzernieckiBrian J. Czerniecki7Rachel KarchinRachel Karchin6Isabelle Bedrosian*Isabelle Bedrosian2*Preethi H. Gunaratne,,*Preethi H. Gunaratne1,8,9*
  • 1Department of Biology & Biochemistry, University of Houston, Houston, TX, United States
  • 2Department of Breast Surgical Oncology, University of Texas, MD Anderson Cancer Center, Houston, TX, United States
  • 3Department of Medicine, Houston Methodist Research Institute, Houston, TX, United States
  • 4Department of Pathology, The UT MD Anderson Cancer Center, Houston, TX, United States
  • 5Department of Bioinformatics & Computational Biology, University of Texas, MD Anderson Cancer Center, Houston, TX, United States
  • 6Biomedical Engineering Department, Institute for Computational Medicine, Johns Hopkins School of Medicine, Baltimore, MD, United States
  • 7Department of Molecular & Cellular Biology, Baylor College of Medicine, Houston, TX, United States
  • 8Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, United States
  • 9Department of Breast Oncology, H. Lee Moffitt Cancer Center, Tampa, FL, United States

Introduction: We present here a strategy to identify immunogenic neoantigen candidates from unique amino acid sequences at the junctions of fusion proteins which can serve as targets in the development of tumor vaccines for the treatment of breastcancer.

Method: We mined the sequence reads of breast tumor tissue that are usually discarded as discordant paired-end reads and discovered cancer specific fusion transcripts using tissue from cancer free controls as reference. Binding affinity predictions of novel peptide sequences crossing the fusion junction were analyzed by the MHC Class I binding predictor, MHCnuggets. CD8+ T cell responses against the 15 peptides were assessed through in vitro Enzyme Linked Immunospot (ELISpot).

Results: We uncovered 20 novel fusion transcripts from 75 breast tumors of 3 subtypes: TNBC, HER2+, and HR+. Of these, the NSFP1-LRRC37A2 fusion transcript was selected for further study. The 3833 bp chimeric RNA predicted by the consensus fusion junction sequence is consistent with a read-through transcription of the 5’-gene NSFP1-Pseudo gene NSFP1 (NSFtruncation at exon 12/13) followed by trans-splicing to connect withLRRC37A2 located immediately 3’ through exon 1/2. A total of 15 different 8-mer neoantigen peptides discovered from the NSFP1 and LRRC37A2 truncations were predicted to bind to a total of 35 unique MHC class I alleles with a binding affinity of IC50<500nM.); 1 of which elicited a robust immune response.

Conclusion: Our data provides a framework to identify immunogenic neoantigen candidates from fusion transcripts and suggests a potential vaccine strategy to target the immunogenic neopeptides in patients with tumors carrying the NSFP1-LRRC37A2 fusion.

1 Introduction

Tumor vaccines capable of promoting immune response have the potential to make significant contributions to the treatment and prevention of cancer. The antigenic repertoire that arises during tumorigenesis through somatic alterations in tumors provides a plethora of non-self-antigens (neoantigens) that can form the basis of vaccination-based cancer immunotherapies. Many of the neoantigens discovered have been shown to be capable of inducing anti-tumor immune responses with minimal side effects in the treatment setting (1, 2). Neoantigen load has been reported to be strongly correlated with clinical response to immunotherapy (3) and high somatic mutational burden. A high density of candidate neoantigens have also been shown to improve survival in patients treated with immune checkpoint blockades in non-small cell lung cancer (NSCLC) (4) and melanoma (5, 6). However, many neoantigens caused by non-synonymous mutations are patient specific, thus can only be used as personalized vaccines and not available as an ‘off the shelf’ option for treatment that would facilitate widespread adoption (7). Therefore, identification of shared neoantigens generated through aberrant transcripts which are prevalent in cancer patients would help overcome one of the current challenges in the advancement of vaccination-based cancer immunotherapies.

Much of the work on neoantigens relates to single nucleotide variants (SNV) and small insertions and deletions (indel) (8). However, for cancers with a low to moderate mutation burden, such as breast cancer, these approaches provide a limited neoantigen repertoire that can be harnessed for therapeutic cancer vaccines. Non-mutated, over-expressed peptides have thus been of interest in this context, with much of the clinical research focused on peptides derived from HER2-Neu (9, 10). Additional approaches that expand the available immunogenic peptides for use in cancer vaccines in these tumors with a limited repertoire of neoantigens derived from non-synonymous mutations is needed if this promising immunotherapy strategy is to be fully utilized clinically.

Here, we focused on identifying neoantigens in fusion transcripts from two separate genes identified from RNA-sequencing (RNA-seq) data of breast cancer samples. The unique sequences at the fusion junctions form new open reading frames (ORFs) that can result in fusion proteins representing a hybrid of the two founding genes and/or truncated versions of the two wild type proteins due to premature termination of the 5’-gene yielding a unique amino acid sequence in the C-terminus and novel N-terminal region in the 3’gene. Our main objective was to discover whether such intergenic spliced chimeric mRNA can provide novel neoantigens that can be processed and presented by the major histocompatibility complex (MHC) Class I peptides to target CD8+ T cells. The ultimate goal of this work is to establish a framework for using immunogenic neopeptides generated from the novel amino acids at the fusion junctions of chimeric RNAs for the development of “off the shelf” tumor vaccines for breast cancer.

2 Materials and methods

2.1 Samples and controls

All tissue samples were obtained from archival formalin fixed, paraffin embedded (FFPE) blocks under a protocol approved by the MD Anderson Cancer Center Institutional Review Board. Tumor samples were obtained from women who met the following criteria: i) newly diagnosed breast cancer, ii) no prior history of breast cancer (primary disease), iii) undergoing surgery as the initial treatment modality, iv) no prior receipt of chemotherapy. In addition, only tumors from women with no known germline mutations and without a significant family history were included in order to enrich for sporadic cancers. Stage was not specifically selected for, however all patients had non-metastatic disease. Seventy-five cases from cancer patients were used, 25 from each of the 3 main clinical subtypes: i) estrogen and/or progesterone positive and HER2 negative (referred to as hormone receptor [HR] positive), ii) HER2 positive regardless of HR status and iii) HR negative and HER2 negative (TNBC; triple negative). Four breast tissue samples from women without a cancer diagnosis were used as controls.

2.2 RNA extraction

RNA extraction was conducted using the Ambion Recoverall Total Nucleic Acid isolation kit (cat# AM19750, ThermoFisher) following the manufacturer’s recommendations. Briefly, tissue cores were crushed, placed in 1.5ml tubes and washed three times with 100% xylene for 10 min. Tissues were then washed in 100% ethanol twice for 10 min followed by one wash in 95% ethanol for 10 min and another wash in 10% PBS, then allowed to air dry for 5 min. Tissues were then incubated in protease digestion buffer at 50°C for 3 hours followed by a 15 min incubation at 80°C after which tissues were stored in -20°C until RNA isolation. At the time of RNA extraction, isolation additive and ethanol mix were added to each sample and placed into the filter cartridge followed by centrifugation for 30 sec at 10,000xg. This was repeated 3 times followed by the addition of wash solutions and centrifugation. DNase was then added to the filter cartridge and incubated at room temperature for 30 min. RNA was then eluted by adding nuclease free water to the center of the filter cartilage, incubating for 5 min and centrifugation at maximum speed for 1 min. RNA was then stored at -80°C.

2.3 FFPE RNA quality control

Extracted RNA samples underwent quality control assessment using the RNA tape on a Tapestation 4200 (Agilent, RRID : SCR_019398). DV200 was calculated as the percentage of RNA fragments that are >200 nucleotides in size. All samples had a DV200 >30% which is the recommended cutoff for RNA sequencing (Illumina Technical Pub. No. 470-2014-001,2016). Samples were then quantified with Qubit Fluorometer (ThermoFisher) for input into library preparation.

2.4 Transcriptome sequencing

The RNA libraries were prepared and sequenced at the University of Houston Seq-N-Edit Core per standard protocols. RNA libraries were prepared with the TruSeq RNA Exome kit (Illumina) using 30 ng input RNA. RNA was fragmented, reverse transcribed into cDNA and ligated with sequence adaptors. The size selection for libraries was performed using SPRIselect beads (Beckman Coulter). Enrichment for coding RNA was performed by coding region specific biotinylated capture probes and selected by streptavidin magnetic beads. Library purity was analyzed using the DNA 1000 tape on a Tapestation 4200 (Agilent, RRID : SCR_019398) and quantified with Qubit Fluorometer 2.0 (ThermoFisher, RRID : SCR_020553). The prepared libraries were pooled and sequenced using the NextSeq 500 (Illumina, RRID : SCR_016381); generating ~15 million 2×76 bp paired end reads per sample.

2.5 RNA fusion detection

The RNA-seq raw fastq data was processed with CLC Genomics Workbench 20 (Qiagen). The Illumina sequencing adaptors were trimmed, and reads were mapped to the human reference genome hg38 Refseq GRCh38.p9 from the Biomedical Genomics Analysis Plugin 20.0.1 (Qiagen). Read alignment was represented as integer counts by using parameters of mismatch cost 2, insertion cost 3, deletion cost 3, length fraction 0.8, similarity fraction 0.8, max of 10 hits for a read. Integer read counts were normalized by Trimmed Means of M-values (TMM) algorithm (11). RNA fusions were detected using the detect fusion gene algorithm under the parameters of minimum length of unaligned sequence 15, maximum distance to exon boundary 10, maximum distances for broken pair fusions 1,000, assumed error rate 0.001, promiscuity threshold 7. The algorithm identifies fusion events based on the number of fusion crossing reads and fusion spanning reads. Refine fusion gene tool was used to re-count the number of fusion crossing reads and the novel RNA seq reads mapped against the fusion reference created in detect fusion genes. The fusion list was further refined by excluding those that were detected in both normal breast tissue controls and in paired adjacent normal tissue samples. Details of the false positive and negative filters applied are shown below.

False Positive filter: To reduce the false positive rates of ~50% associated with the majority of fusion callers that rely only on discordant paired end reads we introduced a filter that first extracts fusion candidates based on discordant paired end reads and then filter out fusion candidates that are not supported by at least 1 junction crossing read that has to be split to map on two different genes on the reference genome.

False Negative filter: To capture fusions associated with small sub populations of cells in pre-cancerous lesions and/or ‘cancer stem cells’ driving drug resistance and disease recurrence we relaxed filters that eliminate candidates based on read numbers and included fusions supported by junction crossing split reads mapping on two different genes supported by at least 1 read in three independent patients across the 3 subtypes studies. Additionally, using the CLC Genomics Workbench, we included a secondary alignment of unmapped RNA-seq reads to a fusion reference sequence created in the initial detect fusion genes pipeline. This decreased the number of false negatives discovered in other fusion callers.

2.6 Validation of junction sequence

cDNA from whole transcriptome sequencing underwent PCR amplification across the NSFP1-LRRC37A2 fusion junction site using Forward Primer (5’-GCCTGCAAGTGACGAGAG-3) and Reverse Primer (5’-CGGTCCAACTGTATGCTTTC-3’). DreamTaq DNA polymerase (ThermoFisher Scientific; Cat.# EP0701) was used in a 30-cycle PCR reaction. Amplicon size was analyzed using the High Sensitivity DNA 1000 tape on a Tapestation 4200 (Agilent, RRID : SCR_019398).

2.7 Validation of junction sequence: cloning & sanger sequencing

The PCR amplicon was inserted in to a pJET1.2 vector as per the sticky-end cloning protocol provided by the manufacturer (CloneJET PCR Cloning Kit; ThermoFisher Scientific; Cat.# K1232). The ligation mixture was directly transformed to provided competent cells and plated on Ampicillin-LB agar plates. Plates were incubated overnight at 37°C. After incubation, 4 colonies were selected per plate to confirm the DNA insert. A PCR was performed to validate the junction sequence using the primers for NSFP1-LRRC37A2. Colonies expressing the amplicon were grown in Ampicillin LB broth at 37°C in a shaking incubator overnight. Plasmids extraction from the bacterial cultures was carried out using manufacturer supplied protocols (QIAprep Spin Miniprep Kit; Qiagen; Cat.# 27104) and were verified using Sanger sequencing.

2.8 Neoantigen predictions

Our neoantigen prediction pipeline is described in Shao et al. (12). Neopeptide regions were delineated from the 2 major ORFs predicted from the NSFP1 [Exon 1-13] - LRRC37A2 [Exon 2-14] fusion. To assess the immunogenicity of our predicted neopeptides in relation to 118 MHC class I haplotypes found in humans, we utilized a neoantigen prediction platform, MHCNuggets. Peptides of 8 amino acids encompassing two major ORFs generated from the NSFP1-LRRC37A2 fusion were analyzed. The HLA genotypes extracted from RNASeq fusion caller from the 75 samples served as input to MHCnuggets to predict the MHC class I binding potential (IC50 nM) of each peptide region from wild-type and neoantigen peptide regions of two truncated proteins. Neoantigen candidates meeting an IC50 affinity < 500 nM were subsequently ranked based on MHC binding. Anchor and auxiliary anchor residues for neopeptide-HLA class I allele pairs were evaluated by the SYFPEITHI online tool (13).

2.9 Peptide library generation

The peptide library consisted of 15 neoantigenic 8-mer peptides discovered from the NSFP1- Exon 1-13 truncation ORF and LRRC37A2-Exon 2-14 truncation ORF and was synthesized and purified using standard solid-phase synthetic peptide chemistry and Reverse Phase High Performance Liquid Chromatography (ThermoFisher Scientific PEPotec). These peptides were reconstituted to 1 mg/mL concentrations under sterile conditions. An 8-mer peptide used by the manufacturer to standardize the peptide library which was confirmed to be a peptide of no biological significance was used as a Negative Peptide Control (NCP) to validate the effect of stimulation by a synthetic peptide. A commercially available Cytomegalovirus (CMV) peptide pool (MabTech; Cat.# 3619-1) containing 42 peptides from the Cytomegalovirus where 28 of the peptides are MHC class I restricted and 14 are MHC class II restricted was used as the positive control.

2.10 Human primary cells

The HLA class C07:02 matched human Peripheral Blood Mononuclear Cells (PBMCs) from a healthy donor were acquired (STEMCELL Technologies) and were stored in liquid nitrogen until use.

2.11 Culture medium

Complete media consisted of RPMI-1640 growth media with L-glutamine (Gibco; Cat.# 61870036) supplemented with 10% heat-inactivated fetal bovine serum (GenDEPOT; Cat.# F0601-050), 0.1 mmol/L nonessential amino acids (Corning; Cat.# 25-025-CI), 10ug/ml Cellmaxin (GenDEPOT; Cat.# C3319-006), and 0.5 mg/mL Amphotericin B (Gibco; Cat.# 15290026).

2.12 In vitro stimulation of PBMCs using peptides

PBMCs were retrieved from liquid nitrogen, thawed in a water bath at 37°C, and washed with culture medium warmed to 37°C, as previously described in the primary cell thawing protocol by Stem Cell Technologies. Cells were incubated at 37°C, 5% CO2 for 24 hours (Cell Resting). After resting, cells were seeded at a concentration of 1 × 106/mL in 6-well plates with culture medium containing IL-2 (10 IU/ml), IL-7 (10 ng/ml), and IL-15 (10 ng/ml). The cells of the Negative (Unstimulated) control (NC) wells not treated with any peptides but were supplemented with the growth medium and cytokines required for growth and proliferation and were maintained at the same growth conditions as the cells of wells treated with the neoantigenic peptides. The cells of the CMV positive control wells were treated with 1μg/ml of the CMV peptide pool and were supplemented with media and growth conditions identical to that of the test peptide wells. The 15 neaoantigenic 8-mer test peptides were added to the respective wells at 2 μg/ml and the plates were incubated at 37°C, 5% CO2 for 4 days On day 5, 50% of the medium was replaced with fresh medium, and cells were cultured for an additional 5 days. A second round of peptide restimulation was carried out with the corresponding peptides coupled with the cytokine medium before the cells were used for the ELISpot assay.

2.13 Isolation of CD8+ T cells from PBMCs

On Day 13, untouched CD8+ T cells were isolated from PBMCs by magnetic negative selection using the MojoSort™ Human CD8+ T Cell Isolation Kit (BioLegend; Cat.# 480012) following the manufacturer’s instructions.

2.14 IFN-γ ELISpot assay

To evaluate peptide stimulated CD8+ T cell immune response, IFN-γ production by cells stimulated with the predicted neoantigenic peptides was quantified using a commercially available Human IFN-γ- ELISpot kit (CTL ImmunoSpot, Cellular Technology Ltd), following the instructions of the manufacturer. The plate was read with an ELISpot reader (CTL counter, Cellular Technology Ltd). The cell culture medium used to incubate the cells in the ELISpot plate was augmented with anti-CD28 antibody (1μg/ml) and corresponding peptides (2μg/ml).

2.15 Statistical analysis

Positive response to the assay was defined using a threshold minimum of 20 Spot Forming Colony Units (SFC)/106 cells in experimental wells after subtracting the unstimulated background (Mean number of SFUs generated by the NC wells). To compare immune responses generated by the neonatigenic peptides, SFUs generated by the wells stimulated with the neoantigenic peptides were compared with that of the wells stimulated with CMV peptide pool. ELISpot data were analyzed by Mann-Whitney U Test, without correction for multiple comparisons, using GraphPad Prism 9.0 (RRID : SCR_002798). Each row was analyzed individually, without assuming consistent standard deviation. Data are represented as mean ± SEM. For all analyses, significance threshold was considered as *, P ≤ 0.05.

3 Results

3.1 Twenty highly prevalent fusion transcripts were discovered across 3 breast cancer subtypes

With the goal of discovering RNA-fusions that can be targeted for neoantigen peptide candidates, we performed RNA-Sequencing of triple negative (TNBC), HER2+ and hormone receptor positive (HR+) breast cancer samples (n=25 each). Mining the sequence reads (i) that were discarded due to discordant paired-end reads and (ii) that were supported by split-reads (junction crossing reads) we found a large number of chimeric fusion RNAs. These were then cross referenced with the TCGA Multi-Center Breast Cancer Dataset. We uncovered 20 fusion RNAs with high prevalence across the set of 75 tumor samples and also detected in 1 or more of the TCGA samples. To eliminate false positives, we also required a given fusion to be present within more than one dataset discovered by an independent fusion caller (CLC Genomics Workbench and University of Chicago fusion caller). Table 1 shows the comprehensive list of fusion transcripts with the number of samples in each subtype that was found to carry the fusion in the tumor.

TABLE 1
www.frontiersin.org

Table 1 Top 20 novel prevalent chimeric RNAs discovered in TNBC, HER2+, and HR+ patient sample gene fusions after comparison to normal samples.

The average number of junction crossing reads as well as the exon boundaries of the 5’ and 3’ genes in both our dataset and TCGA are also presented. Of the 20 novel fusions found, 4 were identified with a frequency of 10% or greater in the MD Anderson Cancer Center (MDACC) cohort. The NSFP1- LRRC37A2 fusion transcript was selected for further study based on the fact that it was associated with the highest number of junction crossing reads (TNBC=218, HER2+=274, HR+=217), and detected with highest frequency across the 75 tumor samples (9/75 = 12%), (TNBC=2 samples, Her2+=2 samples and HR+=5 samples). Furthermore, it was also present in 5 samples in the TCGA breast cancer dataset previously analyzed with filters that traditionally exclude fusions found in adjacent normal tissue. TCGA, however, did not remove fusions from cancer free controls similar to what was done in this study.

3.2 Exon boundaries of NSFP1-LRRC37A2 Fusion Maps to Exon 13 of NSFP1 (5’-boundary) and Exon 2 of LRRC37A2 (3’-boundary)

NSFP1 (N-ethylmaleimide sensitive factor, vesicle fusing ATPase, transcript variant 1 pseudogene) and LLRC37A2 (Leucine Rich Repeat Containing 37 Member A2) are located in 17q21.31. To compile the NSFP1-LRRC37A2 fusion junction, we mapped the consensus junction sequence compiled from the complete set of junction crossing reads extracted from fusion positive samples to hg38 Refseq GRCh38.p9. The 5’-boundary of NSFP1-LRRC37A2 was found to be located on Exon 13 of NSFP1 (NR_033799.1) and the 3’ – boundary mapped to Exon 2 (NM_001006607.3) of LRRC37A2 located immediately 3’ to NSP1 on the coding strand of both genes. The boundaries were consistent and supported by 986 junction-crossing reads (TNBC=218, HER2+=274 and HR+=217) with the breakpoint sequence always AAACCA-3’ on the NSFP1 gene and 5’-AAATTC on LRRC37A2. The 5 samples found to be positive for NSFP1-LRRC37A2 fusion in the TCGA dataset (an independent set of samples) also contained the same exon boundaries. The fusion junction and the exon boundaries model for the NSFP1-LRRC37A2 fusion are shown in Figure 1. The consensus junction sequence and the cDNA for the fusion transcript are shown in Supplemental Figure 1. The fusion junction supported by 986 junction crossing reads was validated by amplicon PCR assay as shown in Figure 2. We expected a 121bp PCR fragment from the PCR amplicon generated using a Forward Primer located on NSFP1 (5’-GCCTGCAAGTGACGAGAG-3) and Reverse Primer located on LRRC37A2 (5’-CGGTCCAACTGTATGCTTTC-3’). The PCR amplicons of 121bp cloned to the positive selection cloning vector were Sanger sequenced to further validate the presence of the fusion junction. The chromatogram acquired through Sanger sequencing is also shown in Figure 2. The same exon boundary of NSFP1 Exon 13 LRRC37A2 Exon 2 identified by the CLC Genomics workbench 20.0 (Qiagen) on the breast cancer dataset presented here was also found in the fusions uncovered TCGA and MDACC datasets.

FIGURE 1
www.frontiersin.org

Figure 1 Genomic mapping of junction crossing reads for NSFP1-LRRC37A2. (A) The fusion junction sequence. The sequence of the junction-crossing read extracted from 986 sequence reads from 75 samples (25 Tumor samples – 3 subtypes) is shown. The segment of the reads that map to NSFP1 and LRRC37A2 is shown in Blue and Red respectively. (B) A model of the novel fusion transcript NSFP1-LRRC37A2. The junction site is shown in green between exon 13 of NSFP1 and exon 2 of LRRC37A2.

FIGURE 2
www.frontiersin.org

Figure 2 NSFP1-LRRC37A2 Fusion PCR validation. (A) One fusion junction positive sample from each subtype, was chosen to be validated by PCR. Capillary gel electrophoresis was used to detect the 121 bp amplicon fragment, representing the NSFP1-LRRC37A2 fusion. (B) The sanger sequencing chromatogram of the PCR amplicons cloned into plasmids and sequenced. The junction site of the fusion between exon 13 of NSFP1 and exon 2 of LRRC37A2 is shown in blue in the chromatogram.

3.3 Novel fusion junctions from the NSFP1-LRRC37A2 fusion transcript variants contain two major ORFs generating two truncated proteins

The major open reading frames (ORFs) predicted from the NSFP1 [Exon 1-13] -LRRC37A2 [Exon 2-14] fusion are shown in Figure 3. Two regions of unique amino acid residues carrying neopeptides were uncovered from the 2 major ORFs predicted from the NSFP1 [Exon 1-13] - LRRC37A2 [Exon 2-14] fusion. The truncated NSFP1 protein yielded the unique peptide fragment KFPRKLYFLH at the C-terminal end of NSFP1 Exon 13 fused with the beginning of LRRC37A2 Exon 2. The truncated LRRC37A2 protein yielded the unique peptide fragment MISNQN at the N-terminal end of LRRC37A2 Exons 2-14 (unique amino acids contributed by Exon 13 of NSFP1). To assess the immunogenicity of our predicted neoantigens a total 15 peptides of 8–11 amino acids extracted from the 2 major ORFs generated from the NSFP1-LRRC37A2 fusion were processed through the neoantigen prediction platform, MHCnuggets, which evaluates binding of somatic peptides to MHC class I, antigen processing, self-similarity and gene expression (12). A total of 106 HLA genotypes served as input to MHCnuggets to predict the MHC class I binding potential (IC50nM) of each peptide region. Neoantigen candidates meeting an IC50 affinity < 500nM were subsequently ranked based on MHC binding. Anchor and auxiliary anchor residues for neopeptide-HLA class I allele pairs were evaluated by the SYFPEITHI online tool (13). These peptides were then rank ordered for binding affinity to the greatest number of MHC class I alleles (promiscuity), antigen processing, and self-similarity. To identify the most promiscuous peptides, which have been shown to be strong vaccine candidates (14), we ranked the peptides by number of HLA Class I alleles that each peptide bound to at a binding affinity threshold of IC50 <500nM. The promiscuity distribution plot for the complete set of peptides generated from the NSFP1-LRRC37A2 fusion is shown in Figure 4. While many of the peptides bind to less than 10 MHC class 1 alleles, a small fraction does bind to >20 MHC alleles which were further investigated. We uncovered 10 and 5 immunogenic neoantigen peptides from the truncated NFS protein variant and the truncated LRRC37A2 protein variant respectively. Table 2 presents data from the selected neoepitopic regions with HLA class I IC50 affinities of < 1000nM, < 500nM and < 50nM. Previous studies have reported that predicted antigens with IC50<50 nM bind too strongly and do not initiate an immune response, so we chose to pursue MHC class I alleles with a binding affinity of IC50<500nM (15). A total of 10 different 8-mer neoantigen peptides discovered from the NSFP1-Exon 1-13 truncation ORF were predicted to bind to a total of 28 unique MHC class I alleles with a binding affinity of IC50<500nM (Table 3). A total of 5 different 8-mer neoantigen peptides discovered from the LRRC37A2-Exon 2-14 truncation ORF were predicted to bind to a total of 7 unique MHC class I alleles with a binding affinity of IC50<500nM. The unique set of MHC Class I alleles binding the immunogenic neoantigens from NSFP1 and LRRC37A2 truncations are shown in Supplementary Table 1.

FIGURE 3
www.frontiersin.org

Figure 3 NSFP1- LRRC37A2 fusion transcript predicted ORFs. The cDNA sequence generated from the NSFP1-LRRC37A2 fusion model was analyzed through the NCBI-Open Reading Frame (ORF) Finder. Two major ORFs consistent with two truncated proteins that are predicted from the NSFP1 [Exon 1-13] - LRRC37A2 [Exon 2-14] fusion transcript were uncovered. The NSFP1 [Exon 1-13] 3’-end truncation yielded an ORF of 500 amino acids. The LRRC37A2 [Exon 2-14] 5’-end truncation yielded an ORF of 835 amino acids.

FIGURE 4
www.frontiersin.org

Figure 4 NSFP1-LRRC37A2 Fusion Model and Immunogenic Neoantigen Peptide Fragments. (A) The distribution model shows the promiscuity of peptides binding to MHC Class 1 alleles. The X-axis is the number of MHC Class 1 alleles and the Y-axis is the number of total peptides found. While a majority of peptides bind less than 10 MHC Class 1 alleles, a small fraction binds to >20, which are considered to be highly promiscuous. (B) The unique peptide junction regions predicted from the NSFP1 [Exon 1-13] -LRRC37A2 [Exon 2-14] fusion transcript are shown here. The immunogenic peptides generated through MHC Class I binding predictor (MHCnuggets) from the NSFP1 [Exon 1-13]-C-Terminal truncation are shown above the fusion transcript model and the LRRC37A2 [Exon 2-14]-N-Terminal truncation are shown below. Amino acid residues from NSFP1 and LRRC37A2 are shown in (blue) and (red) respectively. The unique amino acids formed at the fusion junction are shown in (black).

TABLE 2
www.frontiersin.org

Table 2 Predicted immunogenic neo-antigen peptide fragments from the NSFP1 [Exon 1-13]-LRRC37A2 [Exon 2-14] Fusion with MHC Class I partners.

TABLE 3
www.frontiersin.org

Table 3 Immunogenic neo-antigen peptide fragments from the NSFP1 [Exon 1-13]-LRRC37A2 [Exon 2-14] Fusion predicted to bind with MHC Class I alleles at IC50<500nM.

3.4 CD8+ T cell immune responses were elicited by 1 out of 15 candidate fusion neopeptides

To determine if the predicted neopeptides induced CD8+ T cell immune responses in vitro, IFN-γ secretion of PBMCs was evaluated through ELISpot. The IFN-γ secretion of the cells stimulated with the 15 neopeptides were compared to that of PBMCs stimulated with a CMV peptide pool as a positive control. The Negative (Unstimulated) Control is an essential component of an ELISpot assay as it helps determine the non-specific signal or background caused by cytokines necessary for the growth and proliferation of PBMCs. To accurately account for this non-specific effect, a subtraction method is employed. To quantify the specific immune response, the mean Spot Forming Units (SFUs) generated by the Negative control wells are subtracted from the SFUs generated by all the wells on the plate. This subtraction allows for the distinction between the specific immune response induced by the antigen of interest and the background signal resulting from cytokines present in the unstimulated control wells. A Mann-Whitney Test was performed to compare the mean no. of SFUs/106 cells developed for each experimental peptide with that of the CMV positive control. The peptide ENDIKPKF (p=0.0417) was identified as the only neoantigenic peptide candidate that satisfied the set parameters for a positive response including p<0.05. This peptide (ENDIKPKF) exhibits a response which is approximately 2 folds greater than the response shown by the CMV positive control and 5 folds greater than the response shown by the unrelated peptide stimulated cells (Figure 5).

FIGURE 5
www.frontiersin.org

Figure 5 Human IFN-g ELISpot Assay using predicted immunogenic peptides of NSFP1-LRRC37A2. PBMCs from an HLA matched healthy donor were stimulated with the 15 predicted immunogenic peptides and analyzed via IFN-g ELISpot. Data represented as mean ± SEM. For the analysis, significance threshold was considered as *, P ≤ 0.05.

4 Discussion

Chimeric RNAs generated through chromosomal rearrangements (translocations, deletions, duplications and inversions), trans-splicing or read-through transcription have been proposed as reagents for developing tumor vaccines (16). Neoantigens generated from fusion transcripts have been reported to be better candidates for developing tumor vaccines because they are usually associated with significantly higher immunogenic potential than point mutation, SNV or in-del based neoantigens (3). Unique junctions formed in the chimeric RNAs that are translated can generate tumor-specific neoantigens, which can be exploited to design tumor vaccines for peptide-mediated T-cell activation and immunotherapies targeting cancer cells (3, 16). Our data suggests that chimeric RNAs are prevalent in breast tumors, provide a large number of novel fusions and generate immunogenic peptides that can elicit CD8+T cell responses, thus providing an expanded repertoire for development of breast cancer vaccines.

Breast cancer has low mutational burden, and therefore provides limited opportunities for peptide vaccine development. The chimeric RNAs that we uncovered, and the relatively large number of associated immunogenic peptides, open the door for cancer vaccines in these tumors with relatively fewer somatic mutations. The majority of the fusions discovered in our set of 75 cancer cases showed low frequency (present in 1-2 patients, ≤ 3% of the MDACC cohort). This is consistent with data from the TCGA Pan Cancer dataset that similarly noted that the overwhelming majority of fusions were private (17). Using computational approaches, the TCGA Pan Cancer study also determined the relative immunogenicity of neoantigens generated from fusions and reported that neopeptides derived from private fusions appeared to be more immunogenic than candidate neoantigens derived from highly frequent fusion events. While intriguing, these data lack direct in vitro/in vivo validation and thus the relationship between the frequency with which neoantigens are identified in the population and the ability to elicit a robust immune response remains unclear. Our data shows that some chimeric RNAs, such as NSFP1-LRRC37A2, occur at frequency in line with other therapeutic targets such as HER2/neu in breast cancer and EGFR in lung cancer, opening the door to an “off the shelf” peptide vaccine targeting tumors with these alterations, similar to targeted therapeutic strategies in breast and lung cancer.

In order to increase sensitivity and specificity of fusion discovery, we employed a unique strategy that incorporated two filters to significantly decrease the false positive and false negative rates of fusion detection. Focusing exclusively on the split reads crossing fusion junctions that are associated with discordant paired end reads bringing together two independent genes to extract chimeric RNAs that are not present in normal breast tissue we reduced the false positive rate. Including fusions that are present in adjacent normal samples (typically excluded by other ‘fusion callers’) and absent in normal breast tissue from cancer free patients, we significantly decreased the false negative rates of fusion detection. Additionally, this approach excludes chimeric RNAs that may be found in normal cells that have no impact on tumorigenesis or cancer progression (18). A number of fusion callers have been developed and published to extract fusion junctions from chimeric RNAs from RNAseq. Brian et al. and Trung et al. have each compared and benchmarked 15 gene fusion identification tools which are contingent on the accuracy of the transcriptome mapping (19, 20). Read length, quality scores and number of reads supporting each fusion were reported as the top limitations associated with fusion callers using short reads (21). De-novo assembly-based approaches yielding longer contigs have been reported to reduce limitations of short-read alignment but are computationally intensive (2022). SeekFusion, developed by Balan et al. is designed to leverage de-novo assembly and alignment based approaches to increase the accuracy utilizing PCR-UMI-based amplicon RNA-Seq (23). Taking in to account the extensive body of prior work on fusion callers we used a multi-layered strategy to minimize false positives and false negatives. The key elements used include 1) de-novo assembly of RNA-seq data using the CLC Genomics Workbench 20 (Qiagen) to reduced false positives from shared repeat sequences on the genome; 2) utilized filters for removal of false positives from mis-mapping of reads to shared sequences in gene family members and/or pseudogenes when they exist (3, 24); and 3) relied heavily on fusions supported by split reads in multiple samples reported through other fusion callers from independent datasets (i.e. TCGA).

With an ultimate goal of identifying immunogenic peptides antigens that are broadly shared in breast cancer patients, we selected the NSFP1-LRRC37A2 fusion transcript based on its frequency in tumor samples (found in 12% of samples tested) and 5 samples in the TCGA breast cancer dataset. LRRC37A2 and NSFP1 were previously predicted by the ChimeRScope pipeline to generate a fusion transcript in the opposite orientation (LRRC37A2-NSFP1) in a natural killer cell line (25). However, the data did not report the fusion junction site or exon boundaries due to poor sequence quality of the amplified PCR product (25). Increased read-depths made possible by decreased costs for RNA-seq applications have uncovered an increasing number of non-genetic gene fusions arising from intergenic cis- or trans-splicing that are emerging as new biomarkers and therapeutic targets for cancer (26). The NSFP1-LRRC37A2 fusion is consistent with a transcriptional read through of the NSFP1-pseudo gene truncated at Exon 13 into LRRC37A2 located immediately 3’ followed by a Cis-splicing event between NSFP1 [Exon 13] and Exon 2 of LRRC37A2 (Figure 1; Supplemental Figure 1). The relatively high degree of recurrence (12% in 75 patients) in 3 subtypes of breast cancer in our study and 5 subjects in TCGA breast tumor cohort makes it a highly attractive candidate for targeted therapies. The relatively low read numbers supporting the LRRC37A2-NSFP1 fusion junction (average of 217-274 reads across the 75 samples) validated through PCR suggests that the fusion is likely present in a small subpopulation of cells in the tumor samples. Cai et al. and Carter et al. (27, 28) using clonal mutation analysis also report that tumor purity, heterogeneity and ploidy can result in variable cancer cell fractions in samples from cancer patients. However, if the fusion resulted from non-genetic fusions such as the one reported here they will not have corresponding DNA changes that are needed to compute CCF (cancer cell fraction) for each mutation.

Gene fusions have been reported to function as tumorigenic events in 16.5% of cancers and appear to be druggable in 6% of cases. The recurrent fusions commonly found associated with breast cancer and the potential impact of these in the development of new therapies for cancer is discussed by Loo et al. Gao et al. (29, 30). The most significant recurrent fusions reported from breast malignancies that could be benefit from targeted therapies as therapeutic vulnerabilities include ESR1-CCDC170, ESR1 exon 6 fusions, BCL2L14-ETV6, ETV6-NTRK3 and MYB-NFIB. ESR1-CCDC170 and ESR1 exon 6, have been reported to result in estrogen resistance and metastatic transformation in Luminal B breast cancer (3133). BCL2L14-ETV6 found in 6-12% of TNBC (34). BCL2L14-ETV6 fusions reported in TNBC has been shown to result in EMT and paclitaxel resistance (35). 83% of a rare type of TNBS (adenoid cystic carcinomas (ACC) of the breast) carry the MYB-NFIB fusion (36). ETV6-NTRK4 has been reported in secretory breast carcinoma (SBC). ETV6-NTRK3 and MYB-NFIB have been established to be cancer drivers (37, 38). Kinase fusions are currently being evaluated in breast cancer clinical trials and on-going mechanistic investigation is exposing therapeutic vulnerabilities in patients with fusion positive disease.

The NSFP1-[Exon-1-13]-KFPRKLYFLH C-terminal truncation and MISNQ-LRRC37A2-[Exon-2-14] N-terminal truncation together was found to generate 15 predicted immunogenic neoantigens with the potential to be processed and presented by 28 different MHC Class I alleles with a binding affinity of IC50<500nM. Out of the 15 peptides predicted to be immunogenic from the fusion junction, 8 peptides showed binding affinity (IC50<500nM) to the tested HLA Class of HLA-C*07:02. The peptide ENDIKPKF which showed the highest binding affinity (IC50 = 39) among all the peptides predicted to bind to HLA-C*07:02 was the only candidate which, satisfied the p<0.05 cutoff in the ELISpot assay (39).

In summary, we describe an untapped framework for discovery of neoantigens in breast cancer, generated through novel ORFs created from intergenically spliced mRNA transcripts. This novel pool of neopeptides broadens the opportunities for development of vaccines in breast cancer.

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: NCBI, accession number PRJNA1004862.

Ethics statement

The studies involving humans were approved by Institutional Review Board at MD Anderson Cancer Center under the protocol PA-16-0112. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

BM, IB and PG conceptualized and designed the study. BM, SR, MC, AB, MR and HR contributed to the data acquisition and interpretation as well as in methodology and analysis. CA selected patient samples and oversaw the assembly of patient sample cores that were used for RNA extraction. BM, SR, MC, MR, AB, RK, IB and PG were major contributors in writing, review and editing the manuscript. All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.

Funding

This work was supported by funds from the Moores Professorship to PG; 1U01CA189240-01 grant to IB and RE-Z (M-PI). BM, SR, MC and HR were supported in part by a grant from the McCammon Foundation.

Acknowledgments

We would like to acknowledge bioinformatics and sequencing support from the UH-Sequencing & Gene Editing Core and the contributions from the USAEOP/REAP funded internship program participants for exon-boundary analysis of the RNA fusions led by mentors Dr. Kimberly Holloway, Sudhili Fernando, Abhinav Vadassery, and interns Tanya Roysam, Fernando Peraza and Aprameya Sudharsan.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1188831/full#supplementary-material

References

1. Li L, Goedegebuure SP, Gillanders WE. Preclinical and clinical development of neoantigen vaccines. Ann Oncol (2017) 28(suppl_12):xii11–xii7. doi: 10.1093/annonc/mdx681

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Tran E, Robbins PF, Rosenberg SA. 'Final common pathway' of human cancer immunotherapy: targeting random somatic mutations. Nat Immunol (2017) 18(3):255–62. doi: 10.1038/ni.3682

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Wei Z, Zhou C, Zhang Z, Guan M, Zhang C, Liu Z, et al. The landscape of tumor fusion neoantigens: A pan-cancer analysis. iScience (2019) 21:249–60. doi: 10.1016/j.isci.2019.10.028

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Anagnostou V, Smith KN, Forde PM, Niknafs N, Bhattacharya R, White J, et al. Evolution of neoantigen landscape during immune checkpoint blockade in non-small cell lung cancer. Cancer Discov (2017) 7(3):264–76. doi: 10.1158/2159-8290.CD-16-0828

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gartner JJ, Parker SC, Prickett TD, Dutton-Regester K, Stitzel ML, Lin JC, et al. Whole-genome sequencing identifies a recurrent functional synonymous mutation in melanoma. Proc Natl Acad Sci USA (2013) 110(33):13481–6. doi: 10.1073/pnas.1304227110

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Ott PA, Hu Z, Keskin DB, Shukla SA, Sun J, Bozym DJ, et al. An immunogenic personal neoantigen vaccine for patients with melanoma. Nature (2017) 547(7662):217–21. doi: 10.1038/nature22991

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Aldous AR, Dong JZ. Personalized neoantigen vaccines: A new approach to cancer immunotherapy. Bioorg Med Chem (2018) 26(10):2842–9. doi: 10.1016/j.bmc.2017.10.021

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Turajlic S, Litchfield K, Xu H, Rosenthal R, McGranahan N, Reading JL, et al. Insertion-and-deletion-derived tumour-specific neoantigens and the immunogenic phenotype: a pan-cancer analysis. Lancet Oncol (2017) 18(8):1009–21. doi: 10.1016/S1470-2045(17)30516-8

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Disis ML, Wallace DR, Gooley TA, Dang Y, Slota M, Lu H, et al. Concurrent trastuzumab and HER2/neu-specific vaccination in patients with metastatic breast cancer. J Clin Oncol Off J Am Soc Clin Oncol (2009) 27(28):4685–92. doi: 10.1200/JCO.2008.20.6789

CrossRef Full Text | Google Scholar

10. Lowenfeld L, Zaheer S, Oechsle C, Fracol M, Datta J, Xu S, et al. Addition of anti-estrogen therapy to anti-HER2 dendritic cell vaccination improves regional nodal immune response and pathologic complete response rate in patients with ER(pos)/HER2(pos) early breast cancer. Oncoimmunol (2017) 6(9):e1207032. doi: 10.1080/2162402X.2016.1207032

CrossRef Full Text | Google Scholar

11. Robinson MD, Oshlack A. A scaling norMalization method for differential expression analysis of RNA-seq data. Genome Biol (2010) 11(3):R25. doi: 10.1186/gb-2010-11-3-r25

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Shao XM, Bhattacharya R, Huang J, Sivakumar IKA, Tokheim C, Zheng L, et al. High-throughput prediction of MHC class I and II neoantigens with MHCnuggets. Cancer Immunol Res (2020) 8(3):396–408. doi: 10.1158/2326-6066.CIR-19-0464

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Hundal J, Kiwala S, McMichael J, Miller CA, Xia H, Wollam AT, et al. pVACtools: A computational toolkit to identify and visualize cancer neoantigens. Cancer Immunol Res (2020) 8(3):409–20. doi: 10.1158/2326-6066.CIR-19-0401

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Almeida RR, Rosa DS, Ribeiro SP, Santana VC, Kallas EG, Sidney J, et al. Broad and cross-clade CD4+ T-cell responses elicited by a DNA vaccine encoding highly conserved and promiscuous HIV-1 M-group consensus peptides. PloS One (2012) 7(9):e45267. doi: 10.1371/journal.pone.0045267

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Van der Auwera I, Bovie C, Svensson C, Trinh XB, Limame R, van Dam P, et al. Quantitative methylation profiling in tumor and matched morphologically normal tissues from breast cancer patients. BMC Cancer (2010) 10:97. doi: 10.1186/1471-2407-10-97

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Yang W, Lee KW, Srivastava RM, Kuo F, Krishna C, Chowell D, et al. Immunogenic neoantigens derived from gene fusions stimulate T cell responses. Nat Med (2019) 25(5):767–75. doi: 10.1038/s41591-019-0434-2

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Vellichirammal NN, Albahrani A, Banwait JK, Mishra NK, Li Y, Roychoudhury S, et al. Pan-cancer analysis reveals the diverse landscape of novel sense and antisense fusion transcripts. Mol Ther Nucleic Acids (2020) 19:1379–98. doi: 10.1016/j.omtn.2020.01.023

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Singh S, Qin F, Kumar S, Elfman J, Lin E, Pham LP, et al. The landscape of chimeric RNAs in non-diseased tissues and cells. Nucleic Acids Res (2020) 48(4):1764–78. doi: 10.1093/nar/gkz1223

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Vu TN, Deng W, Trac QT, Calza S, Hwang W, Pawitan Y. A fast detection of fusion genes from paired-end RNA-seq data. BMC Genomics (2018) 19(1):786. doi: 10.1186/s12864-018-5156-1

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Haas BJ, Dobin A, Li B, Stransky N, Pochet N, Regev A. Accuracy assessment of fusion transcript detection via read-mapping and de novo fusion transcript assembly-based methods. Genome Biol (2019) 20:213. doi: 10.1186/s13059-019-1842-9

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Carrara M, Beccuti M, Lazzarato F, Cavallo F, Cordero F, Donatelli S, et al. State-of-the-art fusion-finder algorithms sensitivity and specificity. Biomed Res Int (2013) 2013:340620. doi: 10.1155/2013/340620

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Davidson NM, Majewski IJ, Oshlack A. JAFFA: high sensitivity transcriptome-focused fusion gene detection. Genome Med (2015) 7(1):43. doi: 10.1186/s13073-015-0167-x

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Balan J, Jenkinson G, Nair A, Saha N, Koganti T, Voss J, et al. SeekFusion - A clinically validated fusion transcript detection pipeline for PCR-based next-generation sequencing of RNA. Front Genet (2021) 12:739054. doi: 10.3389/fgene.2021.739054

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Kumar S, Razzaq SK, Vo AD, Gautam M, Li H. Identifying fusion transcripts using next generation sequencing. Wiley Interdiscip Rev RNA (2016) 7(6):811–23. doi: 10.1002/wrna.1382

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Li Y, Heavican TB, Vellichirammal NN, Iqbal J, Guda C. ChimeRScope: a novel alignment-free algorithm for fusion transcript prediction using paired-end RNA-Seq data. Nucleic Acids Res (2017) 45(13):e120. doi: 10.1093/nar/gkx315

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Jia Y, Xie Z, Li H. Intergenically spliced chimeric RNAs in cancer. Trends Cancer (2016) 2(9):475–84. doi: 10.1016/j.trecan.2016.07.006

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Carter SL, Cibulskis K, Helman E, McKenna A, Shen H, Zack T, et al. Absolute quantification of somatic DNA alterations in human cancer. Nat Biotechnol (2012) 30(5):413–21. doi: 10.1038/nbt.2203

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Cai W, Zhou D, Wu W, Tan WL, Wang J, Zhou C, et al. MHC class II restricted neoantigen peptides predicted by clonal mutation analysis in lung adenocarcinoma patients: implications on prognostic immunological biomarker and vaccine design. BMC Genomics (2018) 19(1):582. doi: 10.1186/s12864-018-4958-5

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Loo SK, Yates ME, Yang S, Oesterreich S, Lee AV, Wang X. Fusion-associated carcinomas of the breast: Diagnostic, prognostic, and therapeutic significance. Genes Chromosomes Cancer (2022) 61(5):261–73. doi: 10.1002/gcc.23029

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Gao Q, Liang WW, Foltz SM, Mutharasu G, Jayasinghe RG, Cao S, et al. Driver fusions and their implications in the development and treatment of human cancers. Cell Rep (2018) 23(1):227–238.e3. doi: 10.1016/j.celrep.2018.03.050

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Veeraraghavan J, Tan Y, Cao XX, Kim JA, Wang X, Chamness GC, et al. Recurrent ESR1–CCDC170 rearrangements in an aggressive subset of oestrogen receptor-positive breast cancers. Nat Commun (2014) 5(1):4577. doi: 10.1038/ncomms5577

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Liu CC, Veeraraghavan J, Tan Y, Kim JA, Wang X, Loo SK, et al. A novel neoplastic fusion transcript, RAD51AP1-DYRK4 , confers sensitivity to the MEK inhibitor trametinib in aggressive breast cancers. Clin Cancer Res (2021) 27(3):785–98. doi: 10.1158/1078-0432.CCR-20-2769

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Hartmaier RJ, Trabucco SE, Priedigkeit N, Chung JH, Parachoniak CA, Vanden Borre P, et al. Recurrent hyperactive ESR1 fusion proteins in endocrine therapy-resistant breast cancer. Ann Oncol (2018) 29(4):872–80. doi: 10.1093/annonc/mdy025

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Guo B, Godzik A, Reed JC. Bcl-G, a novel pro-apoptotic member of the bcl-2 family. J Biol Chem (2001) 276(4):2780–5. doi: 10.1074/jbc.M005889200

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Lee S, Hu Y, Loo SK, Tan Y, Bhargava R, Lewis MT, et al. Landscape analysis of adjacent gene rearrangements reveals BCL2L14–ETV6 gene fusions in more aggressive triple-negative breast cancer. Proc Natl Acad Sci (2020) 117(18):9912–21. doi: 10.1073/pnas.1921333117

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Martelotto LG, De Filippo MR, Ng CK, Natrajan R, Fuhrmann L, Cyrta J, et al. Genomic landscape of adenoid cystic carcinoma of the breast. J Pathol (2015) 237(2):179–89. doi: 10.1002/path.4573

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Persson M, Andrén Y, Mark J, Horlings HM, Persson F, Stenman G. Recurrent fusion of MYB and NFIB transcription factor genes in carcinomas of the breast and head and neck. Proc Natl Acad Sci USA (2009) 106(44):18740–4. doi: 10.1073/pnas.0909114106

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Bishop JA, Yonescu R, Batista D, Begum S, Eisele DW, Westra WH. Utility of mammaglobin immunohistochemistry as a proxy marker for the ETV6-NTRK3 translocation in the diagnosis of salivary mammary analogue secretory carcinoma. Hum Pathol (2013) 44(10):1982–8. doi: 10.1016/j.humpath.2013.03.017

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Currier JR, Kuta EG, Turk E, Earhart LB, Loomis-Price L, Janetzki S, et al. A panel of MHC class I restricted viral peptides for use as a quality control for vaccine trial ELISPOT assays. J Immunol Methods (2002) 260(1-2):157–72. doi: 10.1016/S0022-1759(01)00535-X

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: RNA fusions, chimeric RNAs, neoantigens, immunopeptides, tumor peptide vaccines

Citation: Mistretta B, Rankothgedera S, Castillo M, Rao M, Holloway K, Bhardwaj A, El Noafal M, Albarracin C, El-Zein R, Rezaei H, Su X, Akbani R, Shao XM, Czerniecki BJ, Karchin R, Bedrosian I and Gunaratne PH (2023) Chimeric RNAs reveal putative neoantigen peptides for developing tumor vaccines for breast cancer. Front. Immunol. 14:1188831. doi: 10.3389/fimmu.2023.1188831

Received: 17 March 2023; Accepted: 27 July 2023;
Published: 06 September 2023.

Edited by:

Yeonseok Chung, Seoul National University, Republic of Korea

Reviewed by:

Kai Zhang, Zhengzhou University, China
Dapeng Zhou, Tongji University, China

Copyright © 2023 Mistretta, Rankothgedera, Castillo, Rao, Holloway, Bhardwaj, El Noafal, Albarracin, El-Zein, Rezaei, Su, Akbani, Shao, Czerniecki, Karchin, Bedrosian and Gunaratne. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Preethi H. Gunaratne, cGhndW5hcmF0bmVAdWguZWR1; Isabelle Bedrosian, aWJlZHJvc2lhbkBtZGFuZGVyc29uLm9yZw==

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.