- 1Rostock University Medical Center, Department of Neurology, Division of Neuroimmunology, Rostock, Germany
- 2Rostock University Medical Center, Institute of Immunology, Rostock, Germany
Background: Multiple sclerosis (MS) is a chronic immune-mediated disease of the central nervous system to which a genetic predisposition contributes. Over 200 genetic regions have been associated with increased disease risk, but the disease-causing variants and their functional impact at the molecular level are mostly poorly defined. We hypothesized that single-nucleotide polymorphisms (SNPs) have an impact on pre-mRNA splicing in MS.
Methods: Our study focused on 10 bioinformatically prioritized SNP–gene pairs, in which the SNP has a high potential to alter alternative splicing events (ASEs). We tested for differential gene expression and differential alternative splicing in B cells from MS patients and healthy controls. We further examined the impact of the SNP genotypes on ASEs and on splice isoform expression levels. Novel genotype-dependent effects on splicing were verified with splicing reporter minigene assays.
Results: We were able to confirm previously described findings regarding the relation of MS-associated SNPs with the ASEs of the pre-mRNAs from GSDMB and SP140. We also observed an increased IL7R exon 6 skipping when comparing relapsing and progressive MS patients to healthy subjects. Moreover, we found evidence that the MS risk alleles of the SNPs rs3851808 (EFCAB13), rs1131123 (HLA-C), rs10783847 (TSFM), and rs2014886 (TSFM) may contribute to a differential splicing pattern. Of particular interest is the genotype-dependent exon skipping of TSFM due to the SNP rs2014886. The minor allele T creates a donor splice site, resulting in the expression of the exon 3 and 4 of a short TSFM transcript isoform, whereas in the presence of the MS risk allele C, this donor site is absent, and thus the short transcript isoform is not expressed.
Conclusion: In summary, we found that genetic variants from MS risk loci affect pre-mRNA splicing. Our findings substantiate the role of ASEs with respect to the genetics of MS. Further studies on how disease-causing genetic variants may modify the interactions between splicing regulatory sequence elements and RNA-binding proteins can help to deepen our understanding of the genetic susceptibility to MS.
Introduction
Multiple sclerosis (MS) is a chronic immune-mediated and neurodegenerative disease of the central nervous system (CNS) (1, 2). Approximately 2.8 million people worldwide suffer from MS, with women being affected two to three times more often than men and with an average age at diagnosis of 32 years (3, 4). MS is classified into three different clinical courses: relapsing–remitting MS (RRMS) as the most common form (~85% of initial diagnoses), secondary progressive MS (SPMS), and primary progressive MS (PPMS) (~15% of initial diagnoses) (5–7). Clinically, RRMS is characterized by episodes of disease (relapses) followed by a partial recovery of symptoms (remissions). As the neurological deficits worsen with disease progression, approximately 80% of the RRMS cases convert to SPMS within 25 years after the diagnosis (6, 8, 9). PPMS and SPMS are characterized by a continuous worsening of symptoms without significant recovery. The symptoms of MS include, among others, limited mobility, impaired vision, and cognitive deficits (10). The severity of disability is usually determined by the Expanded Disability Status Scale (EDSS) (11).
The immune system plays a key role in the pathophysiology of MS. Immune cells infiltrate the CNS across the blood–brain barrier, leading to demyelination, neuroaxonal damage, the loss of synapses, and reactive gliosis (1, 8, 12). The disruption of neuronal signal transmission then results in clinical symptoms. Approaches to the management of MS include the treatment of acute relapses (13, 14), symptomatic therapies (15), and therapies to prevent relapses and slow the progression of disability (16–18). B cells are central players in the pathogenesis of MS as they mediate cytokine production, antigen presentation, intrathecal antibody synthesis, and the formation of ectopic follicles (19). As new research findings on MS immunopathology further underlined the functional role of B cells, disease-modifying drugs that mediate the depletion of B cells are now commonly used (20–24).
The etiology of MS is still unclear. However, environmental and lifestyle factors, such as smoking, adolescent obesity, and Epstein–Barr virus (EBV) infection, as well as genetic predisposition have been identified as risk factors contributing to the development of MS (25–29). Single-nucleotide polymorphisms (SNPs), the variations of single base pairs at specific genome locations, are the most common type of genetic risk factors (30, 31). Genome-wide association studies (GWASs) have been used to identify associations between SNP alleles and disease. In the most recent GWAS of MS, 233 SNPs have been associated with a significantly increased risk of developing MS [MS-associated lead SNPs (MS SNPs)] (32). However, considering the tendency of proximal SNPs to be inherited together (33), SNPs that are in linkage disequilibrium (LD) with an MS SNP are also associated with MS. Most disease-associated SNPs are considered to have regulatory implications, which means that they are colocalized with quantitative trait loci (QTLs) and thus can affect, e.g., gene expression (eQTL) or alternative splicing (sQTL) (34–38).
Precursor messenger RNA (pre-mRNA) splicing is a physiological process in the cell nucleus by which the introns (intragenic regions) of a pre-mRNA are cut out and the remaining exons (expressed regions) are joined together to form a mature mRNA molecule (39). The cotranscriptional splicing process is coordinated by a complex interplay of cis-elements, trans-acting factors, and the spliceosome complex, which consists of five small nuclear ribonucleoproteins (snRNPs) (40, 41). The important sequences within the pre-mRNA are 5’ and 3’ splice sites (donor and acceptor, respectively), the branch point, the polypyrimidine tract, and exonic or intronic motifs to enhance or silence splicing (42–45). The RNA-binding proteins (RBPs) that recognize these sequences are important for the recruitment of the spliceosome complex. The regulation of the splicing process enables the use of different splice sites, which, in turn, leads to alternative splicing and thus to an altered exon usage compared to the canonical splicing. This allows for the generation of various mRNAs from one pre-mRNA, resulting in a broad transcriptome diversity.
There are five basic types of alternative splicing events (ASEs). While during exon skipping, an exon is excised and not inserted into the mRNA, during intron retention, an intron is not removed and remains in the mRNA molecule. The use of different splice sites can also result in mutually exclusive exons, where only one of two possible exons occurs in the mRNA, or in exons with different lengths due to the use of different acceptor or donor splice sites (46). In addition to the physiological role of alternative splice sites, genetic variants, such as SNPs, can alter the splicing pattern and thereby contribute to the risk of developing diseases (47). As the majority of ASEs in the human EST database are not conserved in mice (48), investigations on the splicing pattern in the experimental autoimmune encephalomyelitis (EAE) mouse model for MS are limited, and thus studies with MS patients are needed. We previously reviewed studies in which ASEs in association with MS have been investigated and found that alternative splicing in MS has been little studied so far (49). The most prominent example is exon 6 skipping in the transcript for the interleukin-7 receptor (IL7R) dependent on SNP rs6897932 (50).
In this study, we investigated ASEs related to SNPs in genetic loci associated with the risk of MS. For this purpose, we used a bioinformatic approach to identify SNPs that potentially alter splicing in MS. We then measured the expression of genes and of individual exons and exon–exon junctions in B cells from MS patients and healthy individuals and analyzed whether the expression is related to MS and/or the SNP. We further used splicing reporter minigene assays to verify alternative pre-mRNA splicing dependent on the genotype of the SNPs. Our study provides new insights into the molecular pathomechanisms of MS by exploring the putative functional role of genetic variants associated with disease susceptibility.
Methods
This study is divided into in silico, ex vivo, and in vitro parts (Figure 1). A detailed description of all methods is provided in the supplement (Supplementary file).
Figure 1 Methodical overview of the study. An in silico approach (serial workflow) was employed to identify single-nucleotide polymorphism (SNP)–gene pairs, where the SNP has the potential to alter the splicing pattern of a gene. For the selected SNP–gene pairs, ex vivo and in vitro analyses were conducted. a Multiple sclerosis (MS)–associated lead SNP according to the genome-wide association study (GWAS) of MS from 2019 (32). ASE, alternative splicing event; bp, base pairs; GWAS, genome-wide association study; kb, kilobases; MS, multiple sclerosis; PBMC, peripheral blood mononuclear cells; qPCR, quantitative real-time PCR polymerase chain reaction; SNP, single-nucleotide polymorphism; V1/V2, minigene construct variants, which differ in only one base and thus represent the two allelic variants of a SNP (here, V1 shows constitutive splicing and V2 shows alternative splicing).
Selection of multiple sclerosis–associated genetic variants that may alter pre-mRNA splicing
Using publicly available microarray data sets and a literature-based screening, we identified differentially spliced candidate genes in MS that are encoded less than 250 kb away from an MS SNP (32). We then determined SNPs that are at least in mild LD (r2 > 0.1 and D’ > 0.7) with the MS SNPs and are located within exons or adjacent intronic regions (up to 400 bp from the exon) of the genes. By using the splice prediction tool Human Splicing Finder (51) and the POSTAR2 database (52), we finally prioritized 10 SNP–gene pairs, in which the SNP has the potential to alter the splicing pattern of the gene (hereafter referred to as splice SNP).
Study cohort
As part of the research projects of the Neuroimmunology research group at Rostock University Medical Center, a total of 121 blood samples were collected and DNA and B-cell RNA were extracted as described previously (53). The subjects were divided into the following three subgroups: healthy controls, PPMS patients, and RRMS patients. MS patients were diagnosed according to the 2017 revisions of the McDonald criteria (54). The diagnosis, treatment, and monitoring of the patients followed routine clinical practice. For further details on the study cohort and the B-cell samples, the reader is referred to our previously published study (53).
Single-nucleotide polymorphism genotyping
The genotyping of the 10 bioinformatically determined splice SNPs was performed with the DNA extracted from the blood samples. For the genotyping, we used custom TaqMan® Array Cards (Applied Biosystems). Data analysis was performed in an automated manner using the TaqMan Genotyper Software (version 1.6, Applied Biosystems). The genotype assignments were manually validated. In case of failed genotyping, the SNP was not considered for further analyses.
Transcriptome analysis
The isolated B-cell RNA samples were used to perform high-density microarray measurements. This was done with Clariom D arrays for human (Thermo Fisher Scientific), which allow to examine the expression of more than 130,000 protein-coding and non-protein-coding genes (transcript clusters, TC probe sets). The arrays are designed using six oligonucleotide probes for each probe selection region (PSR), mostly identical with an exon, and four probes for all presumptive exon–exon junctions (junction probe set, JUC), which enables the analysis of expression differences with respect to single exons or exon–exon junctions. Sample preparation and microarray hybridization were conducted as described in the supplementary methods (Supplementary file). Based on the transcriptome data for all 121 samples, we tested for differential gene expression and splicing pattern differences in MS patients vs. healthy controls as well as in the dependence of the splice SNP genotypes. The analysis of the microarray data was accomplished by using the Transcriptome Analysis Console (TAC) software (version 4.0.2, Applied Biosystems).
Verification of splice isoform expression via quantitative real-time PCR
After the transcriptome analysis, sufficient material was available for 109 of the 121 B-cell RNA samples to perform transcript isoform expression measurements by quantitative (real-time) PCR (qPCR) assays. Custom TaqMan® Gene Expression Array Cards (Thermo Fisher Scientific) were used for this analysis. For each of the 10 SNP–gene pairs, two qPCR assays were used to distinguish the different transcript isoforms resulting from the specific ASEs under scrutiny (e.g., exon skipping vs. exon inclusion). If a transcript isoform could not be detected within 45 PCR cycles, the missing CT values were imputed with the R package nondetects (55). Primary data analysis was conducted by using the ExpressionSuite software (version 1.3, Thermo Fisher Scientific). The data were normalized and converted to the linear scale (Supplementary file).
Splicing reporter minigene assay
Seven SNP–gene pairs were subjected to splicing reporter minigene assays. The minigene assay is based on the principle of the transient transfection of cells with a vector containing the genomic region of interest cloned between two constitutive exons. Our minigene constructs were generated using the pDESTsplice vector and synthesized genomic sequences cloned into the pDONR221 vector (BioCat). The pDESTsplice vector was kindly gifted by Stefan Stamm (56) (Addgene plasmid #32484; http://n2t.net/addgene:32484; RRID: Addgene_32484). For each SNP–gene pair, our minigene assays always consisted of two minigene constructs that differed in a single base and thus represented the two SNP allele variants. HeLa cells were transiently transfected with the minigene constructs. RNA from the HeLa cells was isolated 24 h after the transfection and used for RT-PCR. The PCR products were visualized by gel electrophoresis and validated by sequencing. The distribution of splice isoforms was evaluated by determining the intensity of the PCR product bands on the gel with the Image Studio Lite software version 5.2 (LI-COR Biosciences).
Statistics
Statistical analyses were performed in R (version 4.0) and the TAC software (version 4.0.2). For descriptive statistics, the (robust) means and standard deviations (SD) per group were either calculated in R or directly obtained from the TAC software. We computed linear models and performed pairwise comparisons with Tukey post-hoc tests by using either the limma (57) framework in TAC or the R packages car (58) and stats. For the evaluation of the minigene assay outcomes, we performed two-way analyses of variance (ANOVA) to test whether the relative transcript abundance can be explained by an interaction between the splice SNP allele and the splice isoform. For all analyses, a significance level of α = 0.01 was chosen to indicate significant differences in expression and splicing, respectively. This cutoff was chosen to provide a balance between multiple testing and exploratory investigations. The data were visualized with bar plots and beeswarm plots.
Results
Prioritization of splice single-nucleotide polymorphisms in multiple sclerosis–associated genetic loci
We identified a total of 10 SNP–gene pairs in which the splice SNP has the potential to influence pre-mRNA splicing and for which we sought an experimental validation of the determined ASEs in this work (Table 1; Figure S1). For three SNP–gene pairs (genes: GSDMB, IL7R, and SP140), an aberrant alternative splicing in MS has already been described in the literature (49).
Table 1 Prioritized SNPs from MS–associated genetic regions that potentially alter the splicing pattern of eight genes.
The splice SNPs are located within an exonic region (n = 5) or within 400 bp of the adjacent intronic regions (n = 5), with all but one of the intronic SNPs being located less than 100 bp from the exon. Two of the 10 splice SNPs are in complete LD (D’ = 1) with the MS SNP (32), implying that one SNP allele is always inherited together with one specific MS SNP allele.
In total, we determined four different types of ASEs for the 10 SNP–gene pairs. In most cases, exon skipping was found (n = 6). Moreover, we identified the ASEs intron retention (n = 2), alternative 5’ donor site (n = 1), and alternative last exon (n = 3). Note that in two cases (TSFM exon 6 and 7 skipping and NCAPH2 intron 19 retention), the ASE coincided with the usage of an alternative last exon.
Characteristics of the study cohort groups
A total of 121 blood samples were collected. We obtained 28 samples from healthy controls, 13 samples from PPMS patients, and 80 samples from RRMS patients. The PPMS patients were treated with glucocorticoids. The RRMS samples were taken from patients receiving alemtuzumab (n = 38), natalizumab (n = 29), cladribine (n = 6), fingolimod (n = 3), glatiramer acetate (n = 3), or interferon beta-1b (n = 1).
The sex ratio was relatively balanced in the PPMS group, whereas there was a non-significant preponderance of women in the healthy control group and the RRMS group (Table 2). In terms of age, the healthy controls, with an average age of 28.0 years, were significantly younger than the MS patients (mean age: PPMS: 58.7 years, RRMS: 36.1 years, p < 0.001). The mean disease duration was similar for PPMS patients and RRMS patients. RRMS patients had an average of 0.4 relapses in the year prior to the blood collection and a mean EDSS score of 2.7. PPMS patients had a considerably higher degree of disability, with an average EDSS score of 4.9 (p < 0.001). There were no major imbalances in the demographic and clinical data between the SNP genotype groups (Supplementary Table S8, Supplementary file).
Differential gene expression and alternative splicing in B cells
The transcriptome data for the 121 B-cell RNA samples were used to test the prioritized genes for differential gene expression and differential alternative splicing. Comparing the gene expression between the study groups, we found a significantly lower IL7R mRNA expression in MS patients as compared to healthy controls (Table 3). For two genes, we observed a significant association with the splice SNP genotype. The transcript levels of EFCAB13 were significantly higher when the MS risk allele C of splice SNP rs3851808 was present. For GSDMB, a significantly lower gene expression was observed in the homozygous carriers of the MS risk allele C of splice SNP rs11078928.
Next, we used the transcriptome data set to examine differences in the expression levels of individual exons and exon–exon junctions that distinguish certain alternative pre-mRNA splice variants. For this purpose, the data for PSR and JUC probe sets, which correspond to the ASEs of the 10 prioritized SNP–gene pairs, were compared between the study groups and the splice SNP genotypes. When the MS patients were compared with the healthy controls, an evidence of differential splicing was found for three genes (Table 4). For the probe set interrogating the exon 6 of IL7R, we found significantly higher levels in the healthy group, suggesting that in those individuals, the exon is frequently incorporated into the mRNA. Similarly, we measured significantly higher levels for the probe set corresponding to exon 4 of TSFM in healthy controls as compared to patients with MS. In addition, we found that the longer CLEC16A exon 11, which belongs to the ENST00000409790 transcript variant, was significantly more abundant in the B cells of MS patients (especially PPMS patients) than in those of healthy controls.
For six SNP–pairs, the levels of exons and junctions were significantly associated with the genotype of the respective splice SNP (Figure 2). In B cells from individuals that were homozygous for the MS risk alleles of the splice SNP, we detected lower levels of GSDMB exon 6 and higher levels of the SP140 exon 6 to exon 8 splice junction. The exons 9 and 10 of EFCAB13 and the intron 2 of HLA-C were found more likely to be included in the mRNA when the MS risk allele is present. Regarding HLA-C, we could only evaluate the ASE type intron retention as there are no PSR/JUC probe sets on Clariom D arrays that represent transcripts in which the intron is spliced out. We also found that the two splice SNPs located in the TSFM gene are associated with differential alternative splicing. These two SNPs are in the proximity of the same MS SNP, and the respective ASEs presumably account for a short and long transcript isoform of TSFM (ENST00000417094 and ENST00000550559). We observed that the levels of the exon 4 of the short transcript were significantly lower in the presence of the MS risk allele C of splice SNP rs2014886, and that the levels of the exon 7 of the long transcript were significantly higher when the MS risk allele G of splice SNP rs10783847 is present. Note that for all six SNP–gene pairs for which the splice SNP genotype was significantly associated with exon- or junction-specific expression levels, the data always correlated with the number of risk alleles carried, i.e., the average expression of the group of heterozygotes was always between that of the two homozygous groups (Figure 2). The full results of the transcriptome data analysis, including those for probe sets that capture the respective opposite events, are provided in Supplementary Tables S9 and S10 (Supplementary file). The transcriptome data are accessible through GEO Series accession number GSE190847.
Figure 2 Detection of ASEs in transcriptome data from the B cells of MS patients vs. healthy controls and in relation to splice SNP genotypes. For all 121 samples, the expression of individual exons and exon–exon junctions was interrogated using PSR and JUCs, respectively. Signal intensities (in log2 scale) and group means (black lines) are depicted for the ASEs of the six SNP–gene pairs for which we found significant associations with the genotype (Table 4). Shown are the comparisons of expression levels between the three study groups (on the left) and between the splice SNP genotypes (on the right). P-values from pairwise Tukey post-hoc analyses and the numbers of samples per group are given. The numbering of exons and introns is as specified in Table 1. a For technical reasons, the designated splice SNP was tagged by a proximal SNP (Supplementary file). ASE, alternative splicing event; JUC, junction probe set; MS, multiple sclerosis; PPMS, primary progressive MS; PSR, probe selection region; RA, risk allele; RRMS, relapsing–remitting MS; SNP, single-nucleotide polymorphism.
Validation of differential splice isoform expression
To confirm that the splice SNPs affect ASEs and consequently the expression of different splice isoforms, we performed qPCR measurements with 109 of the 121 B-cell RNA samples. Based on these data, we compared the expression of mRNA splice isoforms between MS patients and healthy controls and between the splice SNP genotypes (Table 5).
Overall, the qPCR data well reflected the transcriptome data. In line with the transcriptome data, we saw significantly higher levels of IL7R transcripts that contain exon 6 in the qPCR data of healthy controls compared to those of MS patients. In addition, in the presence of the MS risk allele, exons 9 and 10 of EFCAB13 were included more frequently, exon 7 of SP140 was skipped more frequently and exon 6 of GSDMB and exons 3 and 4 of TSFM were included at significantly lower rates (Figure 3). In the case of TSFM | rs2014886, the short transcript isoform (ENST00000417094) is only rarely expressed in B cells, which explains the high number of missing values. For splice SNP rs10783847 and TSFM exons 6 and 7 (ENST00000550559), a non-significant trend toward preferential exon inclusion has been observed for the carriers of the MS risk allele. In contrast to the transcriptome data, no genotype dependence of the ASE in HLA-C (intron 2 retention) was seen in the qPCR data. The detailed results of the qPCR analysis are available in Supplementary Tables S11 and S12 (Supplementary file).
Figure 3 Verification of ASEs in MS patients vs. healthy controls and in relation to splice SNP genotypes. Relative expression as measured in B cells by qPCR (n = 109 samples). The same ASEs as in Figure 2 are visualized (but for HLA-C related to the isoform with intron 2 spliced out). Means per group are shown as horizontal black lines. Shown are the comparisons of mRNA isoform expression levels between the three study groups (on the left) and between the splice SNP genotypes (on the right). P-values from pairwise Tukey post-hoc analyses and the number of samples for each group are given. The numbering of exons and introns is as specified in Table 1. a For technical reasons, the designated splice SNP was tagged by a proximal SNP (Supplementary file). ASE, alternative splicing event; MS, multiple sclerosis; PPMS, primary progressive MS; qPCR, quantitative real-time PCR; RA, risk allele; RRMS, relapsing-remitting MS; SNP, single-nucleotide polymorphism.
Multiple sclerosis–associated splice single-nucleotide polymorphism affects splicing pattern of TSFM
Since evidence of genotype-dependent splicing was found for 6 of the 10 SNP–gene pairs within the transcriptome and/or qPCR data for our study cohort, we used splicing reporter minigene assays to investigate whether the ASEs are causally related to the splice SNP allele variants. We focused on the seven ASEs that have not yet been previously studied in the samples of MS patients according to our recent systematic review (49), i.e., for CLEC16A, the alternative 5’ donor site and the alternative last exon, exon skipping for EFCAB13 and TSFM, and intron retention for HLA-C and NCAPH2 (Table 1).
When the MS risk allele of the splice SNP rs2014886 is present, there is a C two nucleotides downstream of TSFM exon 3 (variant V1). In this case, we observed TSFM exon 3 skipping (Figure 4). On the other hand, when the minigene construct carried the alternative allele T (variant V2), exon 3 was frequently included between the constitutively expressed rat insulin exons (Figures 4B, C). More precisely, the creation of the donor splice site due to the allele T resulted in a significant shift in the expression of the transcript isoforms: from a proportion of nearly 100% exon skipping to a proportion of 61% exon skipping and 39% exon inclusion (p = 2.8e-09). We verified that the ASE of TSFM depends on splice SNP rs2014886 by sequencing (Figure 4D). These findings are in line with the results from the analyses of B cells with microarrays and qPCR assays (Tables 4, 5).
Figure 4 Effect of MS-associated splice SNP on TSFM exon 3 skipping. (A) Simplified depiction of the minigene assay for TSFM. The sequences of TSFM exon 3 (gray box) as well as 400 bp of the up- and downstream introns were cloned between rat insulin exon 2 and 3 (burgundy boxes) of the pDESTsplice vector. The splice SNP rs2014886 is located in the intronic region 2 nucleotides downstream of exon 3. The matching sequences for the PCR primers are located in the sequences of the rat insulin exons (P1 and P2). If TSFM exon 3 is included in the resulting transcript, the PCR product has a size of 233 bp. If exon 3 is skipped, the resulting PCR product has a size of 195 bp. (B) The PCR products for the variants V1 and V2 (from triplicate measurements) were visualized by gel electrophoresis. V1 represents the MS risk allele C and V2 represents the alternative allele T of splice SNP rs2014886. (C) The relative proportions of splice isoforms that resulted due to TSFM exon 3 skipping or TSFM exon 3 inclusion for the allele variants V1 (green) and V2 (gray). The MS risk allele C of splice SNP rs2014886 favors TSFM exon 3 skipping. The two splice isoforms were verified by reverse direction sequencing (D, E). —, negative control; bp, base pairs; Ex, exon; M, size standard; MS, multiple sclerosis; NT, non-template control; P1, PCR_RatInsEx2; P2, PCR_RatInsEx3 (Supplementary Table S6, Supplementary file); SNP, single-nucleotide polymorphism.
We also observed a preferential intron 2 retention for HLA-C related to the MS risk allele T of SNP rs1131123 (Figure S3). In the presence of the allele T, we saw a shift of the relative proportion of intron 2 retention from 67% to 87% (p = 3.6e-08). This is consistent with the observations from the microarray data analysis (Figure 2). However, for the other five SNP–gene pairs (CLEC16A | rs11074944, CLEC16A | rs3214361, EFCAB13 | rs3851808, NCAPH2 | rs2782 and TSFM | rs10783847), similar relative proportions of the different transcription products were obtained independently of the allelic variant, and the tests for interactions did not reach the significance level. Thus, we could not confirm that these ASEs are causally related to the splice SNP genotypes in the minigene assays (Figure S3).
Discussion
In this study, we combined in silico evaluations to identify SNPs that may alter pre-mRNA splicing with expression analyses of B cells and with cell culture experiments. We demonstrate that the genotype of SNPs in LD with MS-associated genetic variants can affect pre-mRNA splicing and thus the expression of splice isoforms. We observed an association of the splice SNP genotype with the expression of exons and exon–exon junctions for six SNP–gene pairs (EFCAB13 | rs3851808, GSDMB | rs11078928, HLA-C | rs1131123, SP140 | rs28445040, TSFM | rs10783847, and TSFM | rs2014886) in the microarray data. The differential alternative splicing could be verified by qPCR analyses for EFCAB13, GSDMB, SP140, and TSFM. With our findings for GSDMB and SP140, we could support previous results in the literature showing that the MS-associated SNPs affect alternative splicing (49).
As a starting point, we used various bioinformatic tools to prioritize genetic variants that are likely to alter the pre-mRNA splicing of MS risk genes. We here focused on SNPs located in an exon or within 400 bp of the adjacent intronic regions of these genes. According to previous studies, most splicing factor motifs can be found within this selected 400 bp window (59–61). For the prediction of splicing events due to genetic variants, different tools and databases are available (62–64). We used the Human Splicing Finder to investigate whether a SNP may affect a cis-element such as a branch point, a splice site, or an exonic/intronic splicing enhancer or silencer, and we used the POSTAR2 database to identify SNPs in experimentally determined RBP-binding sites. Finally, we determined 10 SNP–gene pairs (10 SNPs and 8 different genes) for the further event-focused investigations. The reliability of our splice SNP selection procedure was supported by the identification of ASEs for GSDMB | rs11078928, IL7R | rs6897932, and SP140 | rs28445040 as an aberrant genotype-dependent splicing in MS has been previously described for these three SNP–gene pairs (49), and the SNPs are also reported as sQTL SNPs for whole-blood and EBV-transformed lymphocyte samples in the GTEx portal (34). Most of the eight prioritized genes are expressed with a low immune cell type specificity according to the Human Protein Atlas (65). However, two of the genes are expressed more specifically in certain immune cell types: IL7R is expressed mainly in the subsets of T cells and natural killer cells, and SP140 is expressed mainly in memory B cells (66).
Then, we examined the association between the genotype of splice SNPs with the expression of the genes as well as with the levels of individual exons/junctions of the distinct splice isoforms of these genes in B cells from MS patients and healthy controls. Apart from the fact that we did not include SPMS patients, the group of MS patients resembled the typical characteristics of MS patients in European MS registries in terms of age, disease status, and sex (67). In line with the literature (50, 68–70), we observed a significant differential expression of exon 6 of IL7R in MS patients as compared to healthy controls. In our data, the levels of transcripts containing exon 6 were lower in MS patients, but we could not find the previously described association to the MS risk allele C of the non-synonymous splice SNP rs6897932 (T244I). However, the latter might result from the fact that we studied the expression in B cells and not in T cells, in which IL7R is more strongly expressed (66). IL7R encodes for a cell surface receptor for interleukin-7, which plays an essential role for the development and survival of T cells (71). Gregory et al. reported that the C allele of SNP rs6897932 augments an exonic splicing silencer and thus promotes exon 6 skipping, leading to a splice isoform that encodes a soluble form of the protein (50). This is of relevance as increased levels of soluble interleukin-7 receptor have been shown to exacerbate the disease severity in an EAE mouse model, presumably by increasing the activity or bioavailability of interleukin-7 (72). Our analyses of B-cell RNA samples by microarrays and qPCR indicated a genotype-dependent skipping of GSDMB exon 6 and SP140 exon 7. Consistent with our findings, Cardamone et al. (68), Garrido-Martín et al. (73), and Morrison et al. (74) found that the MS risk allele C of SNP rs11078928 affects the acceptor splice site of GSDMB exon 6, resulting in increased exon 6 skipping. The encoded protein Gasdermin-B mediates pyroptosis (75) and, in addition to MS, genetic variants in the GSDMB gene have also been associated with susceptibility to other multifactorial autoimmune diseases like rheumatoid arthritis (76) and ulcerative colitis (77). With regard to the genotype-dependent splicing of SP140, Cardamone et al. (78) and Matesanz et al. (79) could demonstrate via minigene assays that the MS risk allele T of SNP rs28445040 leads to the skipping of exon 7. The function of the protein encoded by SP140 is only partially known. However, the presence of chromatin-related protein domains indicates a role in the chromatin-mediated regulation of gene expression (80). In addition, Karaky et al. reported that SP140 regulates the expression of immune-related genes that are associated with MS (81).
For four other SNP-gene pairs (EFCAB13 | rs3851808, HLA-C | rs1131123, TSFM | rs2014886, and TSFM | rs10783847), we could detect differential alternative splicing in B cells in relation to the MS risk allele. We observed increased EFCAB13 expression and preferential inclusion of exons 9 and 10 in the presence of the MS risk allele C of splice SNP rs3851808. The genotype dependency of this ASE is supported by an sQTL association that is reported for EFCAB13 | rs3851808 for EBV-transformed lymphocytes and other cell types and tissues in the GTEx portal (34). The protein encoded by EFCAB13 contains a calcium-binding domain that is shared by a variety of calcium sensor proteins, which play a role in neuronal function and plasticity (82, 83). Diseases implicated with calcium sensor proteins are, for instance, Alzheimer’s disease (84) and various cancer types (85, 86).
For HLA-C | rs1131123, we observed a trend toward preferential HLA-C intron 2 retention in the presence of the MS risk allele T of the non-synonymous splice SNP rs1131123 (D114A). This genotype dependency was also observed with the minigene assay. HLA-C encodes a class I major histocompatibility complex antigen. Class I molecules play a central role in the immune system and have repeatedly been demonstrated to contribute to the genetic susceptibility to MS (87–89). However, there were challenges in examining the ASE of pre-mRNA from HLA-C: first, in the microarray data, we could only evaluate the expression of the transcript variant in which intron 2 is retained in the mRNA because there are no PSR/JUC probe sets for HLA-C intron 2 exclusion on the employed chip model. Second, only one of the two qPCR assays used to measure transcript splice isoforms of HLA-C provided valid data, which might be due to a sensitivity of the primer pair toward HLA-C subtypes. Since the SNP rs1131123 is not recorded in the GTEx portal (34), further investigations, e.g., with RNA sequencing, could be helpful to ascertain the presumed genotype-dependent splicing of HLA-C intron 2.
We found that exons 3 and 4 of the short transcript variant ENST00000417094 are more frequently skipped in the presence of the MS risk allele C of splice SNP rs2014886 and that this short transcript is only rarely expressed in B cells. In line with our B-cell transcriptome data and minigene assay data, an association of the C allele of SNP rs2014886 with TSFM exon 3 skipping was previously reported by Morrison et al. (74). However, they only studied a small study cohort of eight individuals per genotype. In a recently published report, which focused exclusively on the identification of potential cryptic exons based on literature reports and the dbSNP database, a genotype-dependent splicing of TSFM exon 3 was also postulated (90). Moreover, an sQTL that links the skipping of TSFM exon 3 and 4 with SNP rs2014886 is listed for EBV-transformed lymphocytes in the GTEx portal (34). For the second SNP–gene pair with TSFM, we found that the MS risk allele G of splice SNP rs10783847 showed a strong trend toward TSFM exon 6 and 7 inclusion of the transcript isoform ENST00000550559. TSFM encodes for a mitochondrial translation elongation factor, which catalyzes the exchange of GDP to GTP (91, 92). As the respiratory chain function relies on proper mitochondrial gene expression, differential TSFM expression is associated with various diseases such as encephalomyopathy, hypertrophic cardiomyopathy, and MS (93–96). Noteworthy, Alcina et al. (96) described that SNP rs10877013 affects TSFM expression in MS by altering the enhancer activity of a regulatory element. This SNP is in almost perfect LD with the two splice SNPs rs10783847 and rs2014886 in the European population (33). Further studies are needed to better understand the functional role of the different splice isoforms of TSFM in relation to the pathogenesis of the multifactorial disease MS. According to the Ensembl database, ENST00000417094 codes for an 89 amino acid long protein sequence (UniProt F8WCK2) but it is likely a target of nonsense-mediated decay. However, experimental evidence remains to be established.
The following limitations should be considered when interpreting the data of this study. First, due to the stringent restrictions on the selection of experimentally screenable SNP–gene pairs, it is possible that we have missed some MS-specific ASEs. For instance, we did not include rare variants (minor allele frequency < 1%) because the sample size would be insufficient to study such variants. In addition, we focused only on SNPs in or near exons and thus did not capture the potential influence of deep intronic SNPs on splicing. Such variants have been described for other diseases (61, 97–100). Second, some genomic regions are characterized by long-range LD. Hence, the observed effects on splicing may not represent the only effects underlying the genetic associations with MS. Third, in the analyses of differences in gene expression and alternative splicing, we cannot exclude the possibility of confounding variables, e.g., medical treatment and comorbidities. Specifically, we observed a shift in the proportions of B-cell subsets in patients treated with alemtuzumab or cladribine (53). This contributed to the variance in the gene expression data. Fourth, we conducted our measurements in B cells and therefore may have missed or underestimated the differential alternative splicing of genes that are more abundantly expressed in other cell types (101). Even though genetic effects on splicing are usually highly shared across tissues and cell types (34), further insights into the effects of genetic risk variants could be obtained by studying other cell types, e.g., other peripheral immune cells such as T cells. Fifth, our analysis of the microarray data relied on transcript isoforms as annotated in the reference genome. Thus, we studied known splice isoforms and may have missed novel splicing patterns, which can potentially be identified by using RNA sequencing (102, 103). Fifth, as we have previously described (46), there are issues regarding the use of the minigene assay system, such as a possible interference by the Gateway cloning attachment sites, an insufficient amount of an important splicing factor in the used cell line, or the fact that only a small and specific part of the gene is examined. The latter may lead to the misinterpretation of ASEs as the splicing of exons can depend on the correct splicing of other exons of the gene that are not included in the minigene construct.
In conclusion, in this study, we focused on SNPs located in genetic risk loci for MS that presumably affect pre-mRNA splicing and thus may have an influential role in the pathogenesis of the disease. We were able to support findings from previous studies on MS-related ASEs for the pre-mRNAs of GSDMB, IL7R, and SP140. For four novel SNP–gene pairs, we found an association of the splice SNP genotypes with differential alternative splicing in the B-cell transcriptome data. Except for two SNP–gene pairs, we were able to validate the findings of the microarray data analysis with the qPCR assays. In addition, we were able to further substantiate our observations from the B-cell expression data on TSFM exon 3 skipping by using minigene assays. The MS risk allele C of the SNP rs2014886 almost always led to TSFM exon 3 skipping, whereas the alternative allele led to a low expression of ENST00000417094 transcripts. However, the potential functional impact of this ASE remains unclear. Further functional studies are needed to identify the disease-causing genetic variants and to explore their effects on splicing and the resulting consequences of an aberrant expression of splice isoforms to improve our understanding of the molecular pathomechanisms of MS.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Ethics statement
The studies involving human participants were reviewed and approved by the Ethics committee of the University of Rostock. The patients/participants provided their written informed consent to participate in this study.
Author contributions
EP performed the bioinformatic prioritization of SNP-gene pairs as well as the majority of the experiments. EP evaluated and interpreted the obtained data. Furthermore, EP drafted the manuscript and prepared the figures and tables. MH provided support for the bioinformatic and statistical analyses of the data and critically revised the manuscript. MS coordinated the sample collection and gathered the clinical-demographic data. EP, NB and BF processed the blood samples. NB performed the SNP genotyping. DK was responsible for performing the microarray measurements. PL provided valuable advice concerning the splicing reporter minigene assays. MH, BF and UKZ conceptualized the study. MH, EP, NB and UKZ secured research funding. UKZ provided important intellectual insights and supervised the research. All authors contributed to the article and approved the submitted version.
Funding
EP received a scholarship from the Landesgraduiertenförderung Mecklenburg-Vorpommern. NB was funded by the Stiftung der Deutschen Wirtschaft (sdw). Research funding for this work was provided by the Rostock University Medical Center (FORUN program, grant 889034). In addition, this study was partially funded by Sanofi Genzyme (grant GZ-2016-11560) and Merck Serono GmbH (Darmstadt, Germany, an affiliate of Merck KGaA, CrossRef Funder ID: 10.13039/100009945, grant 4501860307). The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.
Acknowledgments
We thank Alexander Winkelmann, Stefanie Meister, and Ales Dudesek for patient care and Antje Bombor and Ina Schröder for coordinating the clinical visits and blood draws. We wish to thank Nele Retzlaff and Deborah Sonnenberg for reviewing the medical records. Our thanks go to Hans-Jürgen Thiesen, and Robert Jaster for providing access to technical equipment. We thank the Core Facility for Cell Sorting and Cell Analysis headed by Brigitte Müller-Hilke for providing the FACS equipment and technical support. We are grateful to Ildikó Tóth for laboratory support and to Felix Steinbeck for his input on computational tools for analyzing genetic data.
Conflict of interest
EP received travel funds from Novartis. MH received speaking fees and travel funds from Bayer HealthCare, Biogen, Merck, Novartis, and Teva. UKZ received research support as well as speaking fees and travel funds from Alexion, Almirall, Bayer HealthCare, Biogen, Bristol Myers Squibb, Janssen, Merck Serono, Novartis, Roche, Sanofi Genzyme, and Teva as well as EU, BMBF, BMWi, and DFG.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2022.931831/full#supplementary-material
Glossary
References
1. Filippi M, Bar-Or A, Piehl F, Preziosa P, Solari A, Vukusic S, et al. Multiple sclerosis. Nat Rev Dis Primers (2018) 4:43. doi: 10.1038/s41572-018-0041-4
2. Lassmann H. Multiple sclerosis pathology. Cold Spring Harb Perspect Med (2018) 8:a028936. doi: 10.1101/cshperspect.a028936
3. Walton C, King R, Rechtman L, Kaye W, Leray E, Marrie RA, et al. Rising prevalence of multiple sclerosis worldwide: Insights from the atlas of MS, third edition. Mult Scler (2020) 26:1816–21. doi: 10.1177/1352458520970841
4. Blaschke SJ, Ellenberger D, Flachenecker P, Hellwig K, Paul F, Pöhlau D, et al. Time to diagnosis in multiple sclerosis: Epidemiological data from the German multiple sclerosis registry. Mult Scler (2022) 28:865–71. doi: 10.1177/13524585211039753
5. Lublin FD, Reingold SC, Cohen JA, Cutter GR, Sørensen PS, Thompson AJ, et al. Defining the clinical course of multiple sclerosis: the 2013 revisions. Neurology. (2014) 83:278–86. doi: 10.1212/WNL.0000000000000560
6. Zettl UK, Stüve O, Patejdl R. Immune-mediated CNS diseases: a review on nosological classification and clinical features. Autoimmun Rev (2012) 11:167–73. doi: 10.1016/j.autrev.2011.05.008
7. Reich DS, Lucchinetti CF, Calabresi PA. Multiple sclerosis. N Engl J Med (2018) 378:169–80. doi: 10.1056/NEJMra1401483
8. Dendrou CA, Fugger L, Friese MA. Immunopathology of multiple sclerosis. Nat Rev Immunol (2015) 15:545–58. doi: 10.1038/nri3871
9. Frahm N, Ellenberger D, Fneish F, Christoph K, Warnke C, Zettl UK, et al. Characteristics of secondary progressive multiple sclerosis: Disease activity and provision of care in Germany - a registry-based/multicentric cohort study. Mult Scler Relat Disord (2021) 56:103281. doi: 10.1016/j.msard.2021.103281
10. Rommer PS, Eichstädt K, Ellenberger D, Flachenecker P, Friede T, Haas J, et al. Symptomatology and symptomatic treatment in multiple sclerosis: Results from a nationwide MS registry. Mult Scler (2019) 25:1641–52. doi: 10.1177/1352458518799580
11. Kurtzke JF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology. (1983) 33:1444–52. doi: 10.1212/wnl.33.11.1444
12. Frischer JM, Bramow S, Dal-Bianco A, Lucchinetti CF, Rauschka H, Schmidbauer M, et al. The relation between inflammation and neurodegeneration in multiple sclerosis brains. Brain. (2009) 132:1175–89. doi: 10.1093/brain/awp070
13. Galea I, Ward-Abel N, Heesen C. Relapse in multiple sclerosis. BMJ (2015) 350:h1765. doi: 10.1136/bmj.h1765
14. Ellenberger D, Eichstädt K, Flachenecker P, Friede T, Haas J, Kleinschnitz C, et al. Decreasing longitudinal use of glucocorticosteroids in multiple sclerosis. Mult Scler Relat Disord (2018) 25:173–4. doi: 10.1016/j.msard.2018.07.040
15. Skierlo S, Rommer PS, Zettl UK. Symptomatic treatment in multiple sclerosis-interim analysis of a nationwide registry. Acta Neurol Scand (2017) 135:394–9. doi: 10.1111/ane.12612
16. Giovannoni G. Disease-modifying treatments for early and advanced multiple sclerosis: a new treatment paradigm. Curr Opin Neurol (2018) 31:233–43. doi: 10.1097/WCO.0000000000000561
17. Lünemann JD, Ruck T, Muraro PA, Bar-Or A, Wiendl H. Immune reconstitution therapies: concepts for durable remission in multiple sclerosis. Nat Rev Neurol (2020) 16:56–62. doi: 10.1038/s41582-019-0268-z
18. Rommer PS, Milo R, Han MH, Satyanarayan S, Sellner J, Hauer L, et al. Immunological aspects of approved MS therapeutics. Front Immunol (2019) 10:1564. doi: 10.3389/fimmu.2019.01564
19. Comi G, Bar-Or A, Lassmann H, Uccelli A, Hartung H-P, Montalban X, et al. Role of b cells in multiple sclerosis and related disorders. Ann Neurol (2021) 89:13–23. doi: 10.1002/ana.25927
20. Milo R. Therapies for multiple sclerosis targeting b cells. Croat Med J (2019) 60:87–98. doi: 10.3325/cmj.2019.60.87
21. Baker D, Herrod SS, Alvarez-Gonzalez C, Zalewski L, Albor C, Schmierer K. Both cladribine and alemtuzumab may effect MS via b-cell depletion. Neurol Neuroimmunol Neuroinflamm (2017) 4:e360. doi: 10.1212/NXI.0000000000000360
22. Hauser SL, Cree BAC. Treatment of multiple sclerosis: A review. Am J Med (2020) 133:1380–1390.e2. doi: 10.1016/j.amjmed.2020.05.049
23. Rommer P, Zettl UK. Treatment options in multiple sclerosis and neuromyelitis optica spectrum disorders. Curr Pharm Des (2022) 28:428–36. doi: 10.2174/1381612827666210920151231
24. Wiendl H, Gold R, Berger T, Derfuss T, Linker R, Mäurer M, et al. Multiple sclerosis therapy consensus group (MSTCG): position statement on disease-modifying therapies for multiple sclerosis (white paper). Ther Adv Neurol Disord (2021) 14:17562864211039648. doi: 10.1177/17562864211039648
25. Belbasis L, Bellou V, Evangelou E, Ioannidis JPA, Tzoulaki I. Environmental risk factors and multiple sclerosis: an umbrella review of systematic reviews and meta-analyses. Lancet Neurology (2015) 14:263–73. doi: 10.1016/S1474-4422(14)70267-4
26. Olsson T, Barcellos LF, Alfredsson L. Interactions between genetic, lifestyle and environmental risk factors for multiple sclerosis. Nat Rev Neurol (2017) 13:25–36. doi: 10.1038/nrneurol.2016.187
27. Bjornevik K, Cortese M, Healy BC, Kuhle J, Mina MJ, Leng Y, et al. Longitudinal analysis reveals high prevalence of Epstein-Barr virus associated with multiple sclerosis. Science (2022) 375:296–301. doi: 10.1126/science.abj8222
28. Hecker M, Bühring J, Fitzner B, Rommer PS, Zettl UK. Genetic, environmental and lifestyle determinants of accelerated telomere attrition as contributors to risk and severity of multiple sclerosis. Biomolecules. (2021) 11:1510. doi: 10.3390/biom11101510
29. Rommer PS, Ellenberger D, Hellwig K, Haas J, Pöhlau D, Stahmann A, et al. Relapsing and progressive MS: the sex-specific perspective. Ther Adv Neurol Disord (2020) 13:1756286420956495. doi: 10.1177/1756286420956495
30. Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, et al. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res (2001) 29:308–11. doi: 10.1093/nar/29.1.308
32. International Multiple Sclerosis Genetics Consortium. Multiple sclerosis genomic map implicates peripheral immune cells and microglia in susceptibility. Science (2019) 365:eaav7188. doi: 10.1126/science.aav7188
33. Machiela MJ, Chanock SJ. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics. (2015) 31:3555–7. doi: 10.1093/bioinformatics/btv402
34. GTEx Consortium. The GTEx consortium atlas of genetic regulatory effects across human tissues. Science (2020) 369:1318–30. doi: 10.1126/science.aaz1776
35. James T, Lindén M, Morikawa H, Fernandes SJ, Ruhrmann S, Huss M, et al. Impact of genetic risk loci for multiple sclerosis on expression of proximal genes in patients. Hum Mol Genet (2018) 27:912–28. doi: 10.1093/hmg/ddy001
36. Vandiedonck C. Genetic association of molecular traits: A help to identify causative variants in complex diseases. Clin Genet (2018) 93:520–32. doi: 10.1111/cge.13187
37. Zhernakova DV, Deelen P, Vermaat M, van Iterson M, van Galen M, Arindrarto W, et al. Identification of context-dependent expression quantitative trait loci in whole blood. Nat Genet (2017) 49:139–45. doi: 10.1038/ng.3737
38. Chen L, Ge B, Casale FP, Vasquez L, Kwan T, Garrido-Martín D, et al. Genetic drivers of epigenetic and transcriptional variation in human immune cells. Cell. (2016) 167:1398–1414.e24. doi: 10.1016/j.cell.2016.10.026
39. Shi Y. Mechanistic insights into precursor messenger RNA splicing by the spliceosome. Nat Rev Mol Cell Biol (2017) 18:655–70. doi: 10.1038/nrm.2017.86
40. Lee Y, Rio DC. Mechanisms and regulation of alternative pre-mRNA splicing. Annu Rev Biochem (2015) 84:291–323. doi: 10.1146/annurev-biochem-060614-034316
41. Fu X-D, Ares M. Context-dependent control of alternative splicing by RNA-binding proteins. Nat Rev Genet (2014) 15:689–701. doi: 10.1038/nrg3778
42. Manning KS, Cooper TA. The roles of RNA processing in translating genotype to phenotype. Nat Rev Mol Cell Biol (2017) 18:102–14. doi: 10.1038/nrm.2016.139
43. Anna A, Monika G. Splicing mutations in human genetic disorders: examples, detection, and confirmation. J Appl Genet (2018) 59:253–68. doi: 10.1007/s13353-018-0444-7
44. Pineda JMB, Bradley RK. Most human introns are recognized via multiple and tissue-specific branchpoints. Genes Dev (2018) 32:577–91. doi: 10.1101/gad.312058.118
45. Yabas M, Elliott H, Hoyne GF. The role of alternative splicing in the control of immune homeostasis and cellular differentiation. Int J Mol Sci (2015) 17:3. doi: 10.3390/ijms17010003
46. Putscher E, Hecker M, Fitzner B, Lorenz P, Zettl UK. Principles and practical considerations for the analysis of disease-associated alternative splicing events using the gateway cloning-based minigene vectors pDESTsplice and pSpliceExpress. Int J Mol Sci (2021) 22:5154. doi: 10.3390/ijms22105154
47. Scotti MM, Swanson MS. RNA Mis-splicing in disease. Nat Rev Genet (2016) 17:19–32. doi: 10.1038/nrg.2015.3
48. Yeo GW, van Nostrand E, Holste D, Poggio T, Burge CB. Identification and analysis of alternative splicing events conserved in human and mouse. Proc Natl Acad Sci U S A. (2005) 102:2850–5. doi: 10.1073/pnas.0409742102
49. Hecker M, Rüge A, Putscher E, Boxberger N, Rommer PS, Fitzner B, et al. Aberrant expression of alternative splicing variants in multiple sclerosis - a systematic review. Autoimmun Rev (2019) 18:721–32. doi: 10.1016/j.autrev.2019.05.010
50. Gregory SG, Schmidt S, Seth P, Oksenberg JR, Hart J, Prokop A, et al. Interleukin 7 receptor alpha chain (IL7R) shows allelic and functional association with multiple sclerosis. Nat Genet (2007) 39:1083–91. doi: 10.1038/ng2103
51. Desmet F-O, Hamroun D, Lalande M, Collod-Béroud G, Claustres M, Béroud C. Human splicing finder: an online bioinformatics tool to predict splicing signals. Nucleic Acids Res (2009) 37:e67. doi: 10.1093/nar/gkp215
52. Zhu Y, Xu G, Yang YT, Xu Z, Chen X, Shi B, et al. POSTAR2: deciphering the post-transcriptional regulatory logics. Nucleic Acids Res (2019) 47:D203–11. doi: 10.1093/nar/gky830
53. Hecker M, Fitzner B, Putscher E, Schwartz M, Winkelmann A, Meister S, et al. Implication of genetic variants in primary microRNA processing sites in the risk of multiple sclerosis. EBioMedicine. (2022) 80:104052. doi: 10.1016/j.ebiom.2022.104052
54. Thompson AJ, Banwell BL, Barkhof F, Carroll WM, Coetzee T, Comi G, et al. Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria. Lancet Neurol (2018) 17:162–73. doi: 10.1016/S1474-4422(17)30470-2
55. McCall MN, McMurray HR, Land H, Almudevar A. On non-detects in qPCR data. Bioinformatics. (2014) 30:2310–6. doi: 10.1093/bioinformatics/btu239
56. Kishore S, Khanna A, Stamm S. Rapid generation of splicing reporters with pSpliceExpress. Gene. (2008) 427:104–10. doi: 10.1016/j.gene.2008.09.021
57. Ritchie ME, Phipson B, Wu Di, Hu Y, Law CW, Shi W, et al. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res (2015) 43:e47. doi: 10.1093/nar/gkv007
59. Barash Y, Calarco JA, Gao W, Pan Q, Wang X, Shai O, et al. Deciphering the splicing code. Nature. (2010) 465:53–9. doi: 10.1038/nature09000
60. Carazo F, Romero JP, Rubio A. Upstream analysis of alternative splicing: a review of computational approaches to predict context-dependent splicing factors. Brief Bioinform (2019) 20:1358–75. doi: 10.1093/bib/bby005
61. Vaz-Drago R, Custódio N, Carmo-Fonseca M. Deep intronic mutations and human disease. Hum Genet (2017) 136:1093–111. doi: 10.1007/s00439-017-1809-4
62. Yi Y, Zhao Y, Huang Y, Wang D. A brief review of RNA-protein interaction database resources. Noncoding RNA. (2017) 3:6. doi: 10.3390/ncrna3010006
63. Ohno K, Takeda J-I, Masuda A. Rules and tools to predict the splicing effects of exonic and intronic mutations. Wiley Interdiscip Rev RNA. (2018) 9:e1451. doi: 10.1002/wrna.1451
64. Ptok J, Müller L, Theiss S, Schaal H. Context matters: Regulation of splice donor usage. Biochim Biophys Acta Gene Regul Mech (2019) 1862:194391. doi: 10.1016/j.bbagrm.2019.06.002
65. Uhlen M, Karlsson MJ, Zhong W, Tebani A, Pou C, Mikes J, et al. A genome-wide transcriptomic analysis of protein-coding genes in human blood cells. Science (2019) 366:eaax9198. doi: 10.1126/science.aax9198
66. Monaco G, Lee B, Xu W, Mustafah S, Hwang YY, Carré C, et al. RNA-Seq signatures normalized by mRNA abundance allow absolute deconvolution of human immune cell types. Cell Rep (2019) 26:1627–1640.e7. doi: 10.1016/j.celrep.2019.01.041
67. Ohle L-M, Ellenberger D, Flachenecker P, Friede T, Haas J, Hellwig K, et al. Chances and challenges of a long-term data repository in multiple sclerosis: 20th birthday of the German MS registry. Sci Rep (2021) 11:13340. doi: 10.1038/s41598-021-92722-x
68. Cardamone G, Paraboschi EM, Rimoldi V, Duga S, Soldà G, Asselta R. The characterization of GSDMB splicing and backsplicing profiles identifies novel isoforms and a circular RNA that are dysregulated in multiple sclerosis. Int J Mol Sci (2017) 18:576. doi: 10.3390/ijms18030576
69. Galarza-Muñoz G, Briggs FBS, Evsyukova I, Schott-Lerner G, Kennedy EM, Nyanhete T, et al. Human epistatic interaction controls IL7R splicing and increases multiple sclerosis risk. Cell. (2017) 169:72–84.e13. doi: 10.1016/j.cell.2017.03.007
70. Hoe E, McKay FC, Schibeci SD, Gandhi K, Heard RN, Stewart GJ, et al. Functionally significant differences in expression of disease-associated IL-7 receptor alpha haplotypes in CD4 T cells and dendritic cells. J Immunol (2010) 184:2512–7. doi: 10.4049/jimmunol.0902900
71. Mazzucchelli R, Durum SK. Interleukin-7 receptor expression: intelligent design. Nat Rev Immunol (2007) 7:144–54. doi: 10.1038/nri2023
72. Lundström W, Highfill S, Walsh STR, Beq S, Morse E, Kockum I, et al. Soluble IL7Rα potentiates IL-7 bioactivity and promotes autoimmunity. Proc Natl Acad Sci U S A. (2013) 110:E1761–70. doi: 10.1073/pnas.1222303110
73. Garrido-Martín D, Borsari B, Calvo M, Reverter F, Guigó R. Identification and analysis of splicing quantitative trait loci across multiple tissues in the human genome. Nat Commun (2021) 12:727. doi: 10.1038/s41467-020-20578-2
74. Morrison FS, Locke JM, Wood AR, Tuke M, Pasko D, Murray A, et al. The splice site variant rs11078928 may be associated with a genotype-dependent alteration in expression of GSDMB transcripts. BMC Genomics (2013) 14:627. doi: 10.1186/1471-2164-14-627
75. Zhou Z, He H, Wang K, Shi X, Wang Y, Su Y, et al. A from cytotoxic lymphocytes cleaves GSDMB to trigger pyroptosis in target cells. Science (2020) 368:aaz7548. doi: 10.1126/science.aaz7548
76. Eyre S, Bowes J, Diogo D, Lee A, Barton A, Martin P, et al. High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis. Nat Genet (2012) 44:1336–40. doi: 10.1038/ng.2462
77. McGovern DPB, Gardet A, Törkvist L, Goyette P, Essers J, Taylor KD, et al. Genome-wide association identifies multiple ulcerative colitis susceptibility loci. Nat Genet (2010) 42:332–7. doi: 10.1038/ng.549
78. Cardamone G, Paraboschi EM, Soldà G, Cantoni C, Supino D, Piccio L, et al. Not only cancer: the long non-coding RNA MALAT1 affects the repertoire of alternatively spliced transcripts and circular RNAs in multiple sclerosis. Hum Mol Genet (2019) 28:1414–28. doi: 10.1093/hmg/ddy438
79. Matesanz F, Potenciano V, Fedetz M, Ramos-Mozo P, Del Abad-Grau MM, Karaky M, et al. A functional variant that affects exon-skipping and protein expression of SP140 as genetic mechanism predisposing to multiple sclerosis. Hum Mol Genet (2015) 24:5619–27. doi: 10.1093/hmg/ddv256
80. Zucchelli C, Tamburri S, Quilici G, Palagano E, Berardi A, Saare M, et al. Structure of human Sp140 PHD finger: an atypical fold interacting with Pin1. FEBS J (2014) 281:216–31. doi: 10.1111/febs.12588
81. Karaky M, Fedetz M, Potenciano V, Andrés-León E, Codina AE, Barrionuevo C, et al. SP140 regulates the expression of immune-related genes associated with multiple sclerosis and other autoimmune diseases by NF-κB inhibition. Hum Mol Genet (2018) 27:4012–23. doi: 10.1093/hmg/ddy284
82. Burgoyne RD, Haynes LP. Understanding the physiological roles of the neuronal calcium sensor proteins. Mol Brain (2012) 5:2. doi: 10.1186/1756-6606-5-2
83. Heizmann CW. Ca2+-binding proteins of the EF-hand superfamily: Diagnostic and prognostic biomarkers and novel therapeutic targets. Methods Mol Biol (2019) 1929:157–86. doi: 10.1007/978-1-4939-9030-6_11
84. Braunewell K-H. The darker side of Ca2+ signaling by neuronal Ca2+-sensor proteins: from alzheimer's disease to cancer. Trends Pharmacol Sci (2005) 26:345–51. doi: 10.1016/j.tips.2005.04.008
85. Laitinen VH, Rantapero T, Fischer D, Vuorinen EM, Tammela TLJ, Wahlfors T, et al. Fine-mapping the 2q37 and 17q11.2-q22 loci for novel genes and sequence variants associated with a genetic predisposition to prostate cancer. Int J Cancer (2015) 136:2316–27. doi: 10.1002/ijc.29276
86. Natanzon Y, Earp M, Cunningham JM, Kalli KR, Wang C, Armasu SM, et al. Genomic analysis using regularized regression in high-grade serous ovarian cancer. Cancer Inform (2018) 17:1176935118755341. doi: 10.1177/1176935118755341
87. Hollenbach JA, Oksenberg JR. The immunogenetics of multiple sclerosis: A comprehensive review. J Autoimmun (2015) 64:13–25. doi: 10.1016/j.jaut.2015.06.010
88. Link J, Lorentzen AR, Kockum I, Duvefelt K, Lie BA, Celius EG, et al. Two HLA class I genes independently associated with multiple sclerosis. J Neuroimmunol (2010) 226:172–6. doi: 10.1016/j.jneuroim.2010.07.006
89. Yeo TW, de JPL, SG G, LF B, Walton A, Goris An, et al. A second major histocompatibility complex susceptibility locus for multiple sclerosis. Ann Neurol (2007) 61:228–36. doi: 10.1002/ana.21063
90. Keegan NP, Fletcher S. A spotter's guide to SNPtic exons: The common splice variants underlying some SNP-phenotype correlations. Mol Genet Genomic Med (2022) 10:e1840. doi: 10.1002/mgg3.1840
91. Cai YC, Bullard JM, Thompson NL, Spremulli LL. Interaction of mitochondrial elongation factor tu with aminoacyl-tRNA and elongation factor ts. J Biol Chem (2000) 275:20308–14. doi: 10.1074/jbc.M001899200
92. D'Souza AR, Minczuk M. Mitochondrial transcription and translation: overview. Essays Biochem (2018) 62:309–20. doi: 10.1042/EBC20170102
93. Boczonadi V, Ricci G, Horvath R. Mitochondrial DNA transcription and translation: clinical syndromes. Essays Biochem (2018) 62:321–40. doi: 10.1042/EBC20170103
94. Gil-Varea E, Fedetz M, Eixarch H, Spataro N, Villar LM, Urcelay E, et al. A new risk variant for multiple sclerosis at 11q23.3 locus is associated with expansion of CXCR5+ circulating regulatory T cells. J Clin Med (2020) 9:625. doi: 10.3390/jcm9030625
95. Handel AE, Handunnetthi L, Berlanga AJ, Watson CT, Morahan JM, Ramagopalan SV. The effect of single nucleotide polymorphisms from genome wide association studies in multiple sclerosis on gene expression. PloS One (2010) 5:e10142. doi: 10.1371/journal.pone.0010142
96. Alcina A, Fedetz M, Fernández O, Saiz A, Izquierdo G, Lucas M, et al. Identification of a functional variant in the KIF5A-CYP27B1-METTL1-FAM119B locus associated with multiple sclerosis. J Med Genet (2013) 50:25–33. doi: 10.1136/jmedgenet-2012-101085
97. Bergougnoux A, Délétang K, Pommier A, Varilh J, Houriez F, Altieri JP, et al. Functional characterization and phenotypic spectrum of three recurrent disease-causing deep intronic variants of the CFTR gene. J Cyst Fibros (2019) 18:468–75. doi: 10.1016/j.jcf.2018.10.012
98. Bauwens M, Garanto A, Sangermano R, Naessens S, Weisschuh N, de ZJ, et al. ABCA4-associated disease as a model for missing heritability in autosomal recessive disorders: novel noncoding splice, cis-regulatory, structural, and recurrent hypomorphic variants. Genet Med (2019) 21:1761–71. doi: 10.1038/s41436-018-0420-y
99. Jung H, Lee KS, Choi JK. Comprehensive characterisation of intronic mis-splicing mutations in human cancers. Oncogene. (2021) 40:1347–61. doi: 10.1038/s41388-020-01614-3
100. Moles-Fernández A, Domènech-Vivó J, Tenés A, Balmaña J, Diez O, Gutiérrez-Enríquez S. Role of splicing regulatory elements and in silico tools usage in the identification of deep intronic splicing variants in hereditary Breast/Ovarian cancer genes. Cancers (Basel) (2021) 13:3341. doi: 10.3390/cancers13133341
101. Kim-Hellmuth S, Aguet F, Oliva M, Muñoz-Aguirre M, Kasela S, Wucher V, et al. Cell type-specific genetic regulation of gene expression across human tissues. Science (2020) 369:eaaz8528. doi: 10.1126/science.aaz8528
102. Zhao S, Fung-Leung W-P, Bittner A, Ngo K, Liu X. Comparison of RNA-seq and microarray in transcriptome profiling of activated T cells. PloS One (2014) 9:e78644. doi: 10.1371/journal.pone.0078644
Keywords: B cells, genetic disease risk, splicing reporter minigene assay, multiple sclerosis, single-nucleotide polymorphisms, TSFM, alternative splicing
Citation: Putscher E, Hecker M, Fitzner B, Boxberger N, Schwartz M, Koczan D, Lorenz P and Zettl UK (2022) Genetic risk variants for multiple sclerosis are linked to differences in alternative pre-mRNA splicing. Front. Immunol. 13:931831. doi: 10.3389/fimmu.2022.931831
Received: 29 April 2022; Accepted: 12 October 2022;
Published: 28 October 2022.
Edited by:
Hans-Peter Hartung, Heinrich Heine University of Düsseldorf, GermanyReviewed by:
David Otaegui, Biodonostia Health Research Institute (IIS Biodonostia), SpainJinming Han, Capital Medical University, China
Copyright © 2022 Putscher, Hecker, Fitzner, Boxberger, Schwartz, Koczan, Lorenz and Zettl. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Michael Hecker, michael.hecker@rocketmail.com