- 1Genomic Medicine Laboratory UILDM, IRCCS Santa Lucia Foundation, Rome, Italy
- 2Unità Operativa Complessa di Neurologia, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy
- 3Department of Clinical and Behavioral Neurology, IRCCS Fondazione Santa Lucia, Rome, Italy
- 4Istituto di Neurologia, Università Cattolica del Sacro Cuore, Rome, Italy
- 5John Walton Muscular Dystrophy Research Centre, Newcastle University and Newcastle Hospitals NHS Foundation Trusts, Newcastle UponTyne, United Kingdom
- 6Department of Biomedical Sciences, Catholic University Our Lady of Good Counsel, Tirana, Albania
- 7Medical Genetics Laboratory, Department of Biomedicine and Prevention, Tor Vergata University, Rome, Italy
Introduction: Despite the progress made in the study of Facioscapulohumeral Dystrophy (FSHD), the wide heterogeneity of disease complicates its diagnosis and the genotype-phenotype correlation among patients and within families. In this context, the present work employed Whole Exome Sequencing (WES) to investigate known and unknown genetic contributors that may be involved in FSHD and may represent potential disease modifiers, even in presence of a D4Z4 Reduced Allele (DRA).
Methods: A cohort of 126 patients with clinical signs of FSHD were included in the study, which were characterized by D4Z4 sizing, methylation analysis and WES. Specific protocols were employed for D4Z4 sizing and methylation analysis, whereas the Illumina® Next-Seq 550 system was utilized for WES. The study included both patients with a DRA compatible with FSHD diagnosis and patients with longer D4Z4 alleles. In case of patients harboring relevant variants from WES, the molecular analysis was extended to the family members.
Results: The WES data analysis highlighted 20 relevant variants, among which 14 were located in known genetic modifiers (SMCHD1, DNMT3B and LRIF1) and 6 in candidate genes (CTCF, DNMT1, DNMT3A, EZH2 and SUV39H1). Most of them were found together with a permissive short (4–7 RU) or borderline/long DRA (8–20 RU), supporting the possibility that different genes can contribute to disease heterogeneity in presence of a FSHD permissive background. The segregation and methylation analysis among family members, together with clinical findings, provided a more comprehensive picture of patients.
Discussion: Our results support FSHD pathomechanism being complex with a multigenic contribution by several known (SMCHD1, DNMT3B, LRIF1) and possibly other candidate genes (CTCF, DNMT1, DNMT3A, EZH2, SUV39H1) to disease penetrance and expressivity. Our results further emphasize the importance of extending the analysis of molecular findings within the proband’s family, with the purpose of providing a broader framework for understanding single cases and allowing finer genotype-phenotype correlations in FSHD-affected families.
1 Introduction
The introduction of Next-Generation Sequencing (NGS) into the clinical practice has revolutionized the genetic diagnosis and counseling approach of many Neuromuscular Diseases (NMDs).
In particular, NGS allows detecting a wide range of known genetic alterations associated with NMDs as well as identifying novel genetic variations that can expand the genetic heterogeneity of NMDs (Barp et al., 2021). However, reduced penetrance, variable onset, and expressivity as well as the presence of extra-muscular symptoms in many patients still make the genotype-phenotype correlation of NMDs challenging. Among them, FacioScapuloHumeral Dystrophy (FSHD) represents an excellent example of such level of complexity (Caputo et al., 2022a). FSHD (OMIM #158900, #158901, #619477, #619478) is a skeletal muscle disorder with an estimated prevalence of 1:8000–20.000 (Mostacciuolo et al., 2009; Deenen et al., 2014). A progressive and often asymmetric weakness of facial, shoulder and upper arm muscles are typical features of disease, although abdominal, hip girdle and lower leg muscles are also frequently involved (Preston et al., 2020). Typically, FSHD is inherited as an autosomal dominant disorder, although reduced penetrance and variable expressivity can occur among patients and within families (De Simone et al., 2017; Ricci et al., 2020). FSHD can be distinguished in two forms, namely, FSHD1 and FSHD2, although it can also occur as a compound form of disease (FSHD1+FSHD2). From a genetic perspective, FSHD is associated with the contraction of a macrosatellite repeat array on chromosome 4q35 that is referred to as D4Z4. This region normally consists of 11 to >100 Repeated Units (RU) of D4Z4 elements, whereas it is reduced to 1–10 RU in FSHD1 subjects (Wijmenga et al., 1992). In addition to the D4Z4 contraction, FSHD has been associated with the presence of pathogenic variants within the SMCHD1 (18p11.32), DNMT3B (20q11.21) and LRIF1 (1p13.3) genes. These genes have been described as disease modifiers in FSHD1 cases (i.e., with D4Z4 size of 8–10 RU) or as causative genes in FSHD2 (with a D4Z4 of 11–20 RU) (Sacconi et al., 2009; Lemmers et al., 2012; van der Boogard et al., 2016; Cascella et al., 2018; Strafella et al., 2019; Hamanaka et al., 2020). In addition, two subtelomeric variants have been identified at chromosome 4, namely, the 4qA and 4qB alleles. Although both are present in the general population, only the 4qA allele is associated with FSHD and it is thereby referred to as “permissive” allele (Lemmers et al., 2010). Moreover, the DNA methylation status of the D4Z4 locus has been shown to significantly contribute to FSHD severity and penetrance (Lemmers et al., 2015; Himeda et al., 2019). Altogether, these events lead to the relaxation of chromatin conformation, which, in turn, results in the derepression of DUX4 gene, which is stably transcribed in the presence of the 4qA haplotype containing a polyadenylation signal. In muscle cells, the aberrant expression of DUX4 has been associated with the induction of cell death, oxidative stress and inflammatory pathways, which are thought to be responsible for the progression of muscle damage also in vivo (Greco et al., 2020; Cohen et al., 2021).
Despite the progress made in the field, the wide range of mild to severe phenotypes, the occurrence of extra-muscular features, the variable age of onset and progression of disease advocate for FSHD being a complex disorder (Sacconi et al., 2019; Greco et al., 2020). In our practice, approximately 60% of patients with a clinical suspicion of disease are found to be carriers of a reduced D4Z4 allele compatible with an FSHD1 diagnosis (Zampatti et al., 2019), a percentage that is highly variable and dependent on the experience of the different neurological centers referring the patients. In addition, a reduced D4Z4 allele in combination with a permissive haplotype has been observed in approximately 3% of the healthy population (Scionti et al., 2012; Ricci et al., 2020). Furthermore, the disease severity has been shown to account for approximately 40% by familial factors and 10% by the D4Z4 repeat array size (Mul et al., 2018). Given these premises, it is plausible that other (epi)genetic factors contribute to the clinical variability and heterogeneity of FSHD, and the knowledge of these could be important for increasing the accuracy of diagnosis and therefore genetic counseling of patients and families. To this purpose, the present work employed Whole Exome Sequencing (WES) to investigate known and unknown genetic contributors that may be involved in FSHD and may represent potential disease modifiers, even in presence of a D4Z4 Reduced Allele (DRA). The study included both patients with a D4Z4 Reduced Allele (DRA) compatible with FSHD diagnosis (≤10RU) and patients with longer D4Z4 sizes. The study was performed on a large cohort of patients characterized by D4Z4 sizing, methylation analysis and WES. In case of patients reporting variants of interest from WES analysis, the study was extended to the family members in order to provide a more comprehensive picture of the cases.
2 Methods
2.1 Study cohort
The study involved 126 Italian patients with clinical signs of FSHD, which accessed to the Genomic Medicine Laboratory-UILDM at the Santa Lucia Foundation IRCCS for the standard molecular diagnosis. The presence of D4Z4 Reduced Allele (DRA) was evaluated during the diagnostic routine and was utilized to select the study cohort with the purpose of including patients with variable D4Z4 size. In particular, the molecular assessment of DRA was performed using PFGE and Southern blotting followed by hybridization with specific probes P13-E11 as previously described (Zampatti et al., 2019). The patient’s cohort displayed a variable number of RUs including 15 patients with 1–3 RUs, 80 patients with 4–7 RUs, 7 patients with 8–10 RUs, 2 patients with 11–20 RUs, 6 patients with RUs>20 RUs). Moreover, 16 patients carried two permissive (i.e., both 4qA) DRAs, in the size range between 3 and 20 RU (Supplementary Table S1). The patient’s cohort presented a Female:Male (F:M) ratio of 45:55 and an average age of 52.5 ± 17.7 years. In addition, a cohort of 100 Italian subjects matched for age and sex were included in the study as reference group (Supplementary Table S1).
The clinical evaluation of patients was performed by expert neurologists from Fondazione Policlinico Gemelli IRCCS, using the Clinical Severity Scale (CSS) (Ricci et al., 1999) and the FSHD Clinical Score scale (Lamperti et al., 2010), scores specifically designed and validated to assess disease severity in FSHD patients. Muscle MRI was performed on a 1.5 T scanner (Siemens Magnetom Espree), according to published protocols (Tasca et al., 2014; Tasca et al., 2016; Giacomucci et al., 2020). Upper girdle and lower limb muscle MRI scans were evaluated for the presence of imaging patterns supporting the diagnosis of FSHD (Monforte et al., 2022).
Informed consent was obtained from all the subjects included in the present study.
2.2 DNA extraction and methylation analysis
The genomic DNA of patients was extracted from 400 μL of peripheral blood using MagPurix Blood DNA Extraction Kit and MagPurix Automatic Extraction System (Zinexts) according to the manufacturer’s instructions.
Concerning the analysis of methylation, two regions of the D4Z4 locus were evaluated, namely, the DR1 (located 1 Kb upstream of the DUX4 ORF and harboring 29 CpG sites) and the DUX4-PAS (containing 10 CpG sites located within the most distal part of the array and including the Polyadenylation Signal, PAS). In particular, the DUX4-PAS is specific for the 4q distal region; it is ampliefied only in presence of a 4qA allele and it provides information concerning the presence of a DRA (i.e., FSHD1). The DR1 region is located within each D4Z4 RU on both chromosomes 4 and 10 and it is highly useful to identify FSHD2 subjects. The DNA from each patient underwent methylation analysis using a protocol based on Bisulfite Sequencing (BSS), Amplified Fragment Length Polymorphism (AFLP) and Machine Learning (ML) described in our previous work (Caputo et al., 2022b). The ML model employs the methylation levels of four CpG sites (DUX4-PAS_CpG6, DUX4-PAS_CpG3, DR1_CpG1 and DR1_Cpg22) to classify FSHD subjects from non-FSHD ones. Following a specific decision tree (available in Caputo et al., 2022b), the model classify subjects on the basis of specific thresholds of methylation of each CpG site and following a specific order of relevance, that is DUX4-PAS_CpG6; DUX4-PAS_CpG3; DR1_CpG1; DR1_CpG22.
The characterization of 4q subtelomeric variant was assessed for each converted DNA by means of traditional PCR and electrophoresis. In particular, this PCR employs specific primers for DUX4-PAS region, whose amplification is indicative of the presence of at least 4qA allele. The 4qB allele is detected by means of specific primers as well (Caputo et al., 2022b). Three possible 4q configurations were thus distinguished, namely, 4qA/4qA, 4qA/4qB and 4qB/4qB.
2.3 Whole Exome sequencing (WES)
Concerning WES analysis, the Illumina® Next-Seq 550 system was utilized. In particular, 30–50 ng/μL of DNA was employed for library preparation by means of Illumina® DNA Prep with Enrichment and Tagmentation kit according to manufacturer’s instructions. The obtained libraries were sequenced at 2 × 100 bp and the sequencing quality of the resulting data was expected to reach a Quality score >30 (Q30) for ∼80% of total called bases. The analysis of variants was performed, focusing on the variants located in genes (SMCHD1, DNMT3B and LRIF1) known to be associated with FSHD as well as genes that may represent candidate novel genes for the disease. The selection of these genes was performed considering their function as epigenetic regulators of D4Z4, their location near the D4Z4 array or genes being targeted by DUX4. The list of selected genes was reported in Supplementary Table S2.
The resulting variants were visualized by Integrated Genome Viewer (v.2.7.2) and functionally annotated by means of BaseSpace Variant Interpreter (Illumina, v. 2.15.0.110), using GRCh37 as genome build. Only the variants reporting a minimum coverage of 20X were considered for subsequent analysis. Annotated variants were prioritized considering the type of variants (nonsense, frameshift, missense, splicing); the Minor Allele Frequency (MAF<0.001) in publicly available database (gnomAD) and in the internal reference group; their localization in regulatory regions or protein domains (by consultation of Uniprot and Decipher databases); their pathogenicity scores retrieved by interrogation of bioinformatics prediction tools. In particular, REVEL (Rare Exome Variant Ensemble Learner) is a meta-predictor tool for missense variants that integrates different scores (MutPred, FATHMM v2.3, VEST 3.0, PolyPhen-2, SIFT, PROVEAN, MutationAssessor, MutationTaster, LRT, GERP++, SiPhy, phyloP, phastCons) (Ioannidis et al., 2016). Moreover, Varsite (Laskowski et al., 2020) and Missense3D (Ittisoponpisan et al., 2019) were used to predict the potential effect of missense variants on protein structure and function, whereas variants within the splicing region were analyzed by Human Splicing Finder (v.3.1, https://www.genomnis.com/access-hsf).
The variants with a clinical significance were also confirmed by direct sequencing. To this purpose, the DNA was amplified by PCR, using the AmpliTaq Gold DNA Polymerase (Applied Biosystems) reagents in a total volume of 25 μL, following the manufacturer’s instructions. Successively, direct sequencing of the amplified samples was performed by BigDye Terminator v3.1 Cycle Sequencing Kit (Thermo Fisher Scientific) and run on ABI3130xl (Applied Biosystems). The results were finally analyzed with Sequencing Analysis Software v.6 (Applied Biosystems). In addition, the variants of interest were also subjected to segregation analysis among family members, if available.
Finally, the variants were classified according to the ACMG Standards and Guidelines, which provide a clinical interpretation of variants, discriminating among benign, likely benign, with uncertain significance (VUS), likely pathogenic and pathogenic variants (Richards et al., 2015; Ellard et al., 2020). Bioinformatic online platform (Varsome) and public database collecting data concerning DNA genetic variations (Clinvar, LOVD, Decipher) were also employed as supporting tools for the clinical interpretation of variants. In particular, the PP3 rule was applied following the ClinGen recommendations (Evidence-based calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for clinical use of PP3/BP4 criteria) (Pejaver et al., 2022). Concerning the application of PM1 and PP2 rules, Decipher was utilized as supporting tool since it provides helpful and user-friendly tools for assigning such criteria. In particular, Decipher was consulted to visualize the gene and regional constraint (for PP2 application) to missense and loss-of-function variants and the localization of functional domains and regulatory regions (for PM1 rule) of the protein corresponding to the genes of interest. In addition, a “benign cut-off frequency” derived by Varsome was also utilized to compare the frequency of the identified variants in the specific genes and assess the BS1 rule in case of variants with a frequency exceeding the cut-off fixed for each gene.
3 Results
The patient’s cohort displayed a variable number of RUs including 15 patients with 1–3 RUs, 80 patients with 4–7 RUs, 7 patients with 8–10 RUs, 2 patients with 11–20 RUs and six patients with RUs>20 RUs). Moreover, 16 patients carried two permissive (i.e., both 4qA) DRAs, in the size range between 3 and 20 RU. All of the patients carried at least a 4qA allele (Supplementary Table S1). The analysis of WES focused the attention on a set of genes selected by their function as epigenetic regulators of D4Z4, their location near the D4Z4 array or genes being targeted by DUX4 (Supplementary Table S2). Successively, the variants were prioritized according to their frequency, localization into regulatory or protein domains, and bioinformatics prediction.
Twenty variants, which were detected in 19 patients with clinical signs of FSHD, emerged from the analysis of WES data. In particular, 14 variants were located in known genes (SMCHD1, DNMT3B and LRIF1), whereas six variants were found in other genes of interest (namely, CTCF, DNMT1, DNMT3A, EZH2, SUV39H1). None of the variants was observed in the reference group. Interestingly, most of the variants were localized within a specific domain or region of interaction with other factors, thereby suggesting a potential functional effect (Figure 1).
FIGURE 1. Illustration of the localization of the identified variants at level of the protein sequence. Missense variants are reported with their protein coding nomenclature whereas variants in the splicing regions are indicated with their nucleotide coding. The figure was built by retrieving the domain localization and visualization from Uniprot and Decipher, respectively.
Among the identified variants, 14 were detected in known FSHD2 causing genes (Table 1), namely, SMCHD1 (n = 12), DNMT3B (n = 1) and LRIF1 (n = 1). These variants were further investigated considering the type (3 frameshift, 4 spicing and 7 missense), frequency in gnomAD and bioinformatics prediction (Table 1). As a result, 12 out of 14 identified variants in known genes were found to be absent or extremely rare in gnomAD database, whereas two variants (SMCHD1:c.3841A>G and DNMT3B:c.1144C>T) reported a frequency higher than the fixed “benign cut-off frequency.” The use of bioinformatic prediction tools predicted a deleterious or uncertain effect for most variants except for three, which were not reported to have a significant impact on protein function/structure or splicing. Moreover, patients harboring variants in known FSHD genes displayed variable D4Z4 sizes, including six individuals displaying 4–7 RU, two subjects carrying 9–10 RU, one patient with a D4Z4>20RU and four patients with two permissive DRAs (namely, 8 + 6 RU, 5 + 10RU, 8 + 20RU, 13 + 20 RU). Segregation analysis was possible for six cases and showed two patients (FSHD13A and FSHD15A) with de novo variants, whereas the remaining ones were inherited or undetermined. Supporting these findings, methylation analysis revealed hypomethylation status consistent with FSHD phenotype in patients harboring such variants, especially in the DR1 region (Table 3). Interestingly, one patient (FSHD1A) harbored a variant in SMCHD1, a variant in LRIF1 and a short DRA (4 RU). Segregation analysis showed that the SMCHD1 variant was inherited from the unaffected father (FSHD1D), whereas the variant in LRIF1 was inherited from the affected mother (FSHD1B), together with the short DRA. Methylation analysis in the patient and the family members revealed the lowest methylation levels in the FSHD1A patient with the three events compared to the other family members (Table 3).
Furthermore, the analysis of WES highlighted the presence of six variants in five genes, namely, CTCF, DNMT3A, DNMT1, EZH2 and SUV39H1 (Table 2). The variants were absent or very rare in gnomAD database. All of them were missense, except for one in-frame insertion located in EZH2. All of the genes harboring such variations presented a significant constraint (Z score ≥ 3.12) to missense variants, at level of the gene and regions including the identified variants. The application of prediction tools allowed assessing a potential effect of such variants on protein function, structure or splicing (Table 2). Moreover, all of the 6 variants were found in combination with a DRA ranging from 4 to 8 RU and, in two cases (FSHD2A and FSHD5A), with an additional permissive (i.e., 4qA) DRA < 20 RU. Segregation analysis among the available family members showed that the identified variants were found in affected individuals together with a DRA. Moreover, the assessment of methylation levels in families harboring such variants revealed hypomethylated profiles mostly consistent with the clinical status of family members (Table 3).
All molecular and clinical data concerning the families harboring variants in known and candidate genes for FSHD have been reported in Tables 3, 4, respectively.
TABLE 3. Molecular characterization of FSHD families harboring the variants identified by WES. The column reporting the D4Z4 size shows the shortest permissive (4qA) allele compatible with the disease for patients 4qA/4qB considering that it is the only one permissive for FSHD. Concerning patients with 4qA/4qA, it has been reported the shortest allele compatible with FSHD, although those ones carrying both D4Z4 alleles with a size <20 RU have been reported since they both could contribute to disease in these cases. The family link has been reported taking the proband as reference subject. The methylation data refer to the method described in Caputo et al. (2022b) that combines the methylation levels of four CpG sites (DUX4-PAS_CpG6, DUX4-PAS_CpG3, DR1_CpG1 and DR1_Cpg22) with machine-learning pipeline to classify FSHD subjects from non-FSHD ones (herein referred as to CTRL). Following a specific decision tree (available in Caputo et al., 2022b), the model classify subjects on the basis of specific thresholds of methylation of each CpG site and following a specific order of relevance, that is DUX4-PAS_CpG6; DUX4-PAS_CpG3; DR1_CpG1; DR1_CpG22. For more details concerning the method employed for methylation analysis, please refer to the article (Caputo et al., 2022b).The utilized thresholds are reported in brackets in each CpG site column. The decision nodes indicate the step of the decision tree utilized by the ML model. RU: Repeated Unit. ML: Machine Learning. CTRL: non-FSHD.
Finally, all the variants identified in known and candidate genes were subjected to ACMG classification, which allowed identifying five Pathogenic variants, 7 Likely pathogenic, 7 VUS and one Likely Benign variant (Supplementary Table S3).
Altogether, these results showed variants in SMCHD1 as one of the most frequent genetic alterations in this study together with D4Z4 contraction, whereas variants in LRIF1 and DNMT3B appeared as rarer events, although they may co-occur together with short D4Z4 contraction and potentially contribute to phenotype variability. Importantly, the WES analysis identified variants in CTCF, DNMT3A, DNMT1, EZH2 and SUV39H1, which have not been described in patients and families with FSHD before and, thus, they may represent novel candidate genetic modifiers for the disease. These variants were found in combination with a DRA, supporting the possibility that different genes can contribute to disease heterogeneity in presence of a FSHD permissive background.
4 Discussion
The comprehension of the mechanisms underlying the complex molecular background of FSHD is an area of active research. On this subject, recent studies described transcriptomic and proteomic markers associated with FSHD clinical severity and progression in muscle and blood (Banerji et al., 2019; Corasolla Carregari et al., 2020; Wong et al., 2020; Banerji et al., 2022). In addition, several studies highlighted DNA hypomethylation as a hallmark of disease (Hartwerk et al., 2013; Gaillard et al., 2014; Huichalaf et al., 2014; Calandra et al., 2016; Haynes et al., 2018; Lemmers et al., 2019; Roche et al., 2019; Gould et al., 2021; Banerji et al., 2022; Caputo et al., 2022b; Erdmann et al., 2022; Hiramuki et al., 2022). In this scenario, the identification of SMCHD1, DNMT3B and LRIF1 as causative or modifier genes in FSHD1 and FSHD2 laid the foundations for considering FSHD as a complex disease, in which multiple genes are likely to contribute to the disease heterogeneity and variability (Caputo et al., 2022a) To this regard, NGS approaches are the ideal tool to allow the simultaneous investigation of known FSHD causing (SMCHD1, DNMT3B and LRIF1) and other potential candidate gene modifiers. Given these premises, the present study employed WES to investigate known and unknown genetic contributors that may be involved in FSHD, even in presence of a DRA. As a result, the analysis of WES data highlighted 20 variants in 19 patients with clinical signs of FSHD (15% of the total patients’ cohort), including five Pathogenic variants, 7 Likely pathogenic, 7 VUS and one Likely Benign variant (Supplementary Table S3). Among them, 14 variants were detected in known FSHD genes (namely, SMCHD1, LRIF1 and DNMT3B). In this case, SMCHD1 appeared as the most frequently altered gene harbouring 12 variants. As expected, the variants were located throughout the entire gene (Figure 1) and were found to impact protein structure/functioning (in the case of missense variants mostly located in the ATPase domain of SMCHD1) or alter splicing and create Premature Termination Codon and truncated proteins (in the case of frameshift, stop-gained or splicing variants) (Table 1). These findings were in line with previous studies (Lemmers et al., 2019; Strafella et al., 2019) highlighting the ATPase domain as one of the most frequently affected domain by FSHD variants, especially by missense ones. Concerning Loss of function (LoF) variants identified in this study (namely, frameshift and intronic variants located in ±1-2 of splice site), they were scattered throughout the gene (Figure 1), consistently with other studies (Lemmers et al., 2019; Sacconi et al., 2019; Strafella et al., 2019; Giacomucci et al., 2020).
Segregation analysis was possible only for six patients carrying SMCHD1 variants, among which two cases displayed de novo variants, whereas the remaining ones were inherited or undetermined (Table 3). Concerning the association with D4Z4 size, 6 individuals carrying SMCHD1 variants displayed 4–7 RU (that is clearly in the FSHD1 range); Two subjects showed a 9–10 RU DRA (borderline/short FSHD1 fragments); 1 patients had a D4Z4>20 RU (that is in the normal range) and 4 patients revealed two permissive (i.e., 4qA/4qA) DRAs (namely, 5 + 10 RU, 8 + 6 RU, 8 + 20 RU and 13 + 20RU). This result is consistent with the fact that SMCHD1 can act as causative or modifier gene for FSHD (Lemmers et al., 2012; Sacconi et al., 2019; Strafella et al., 2019). In addition to previous studies, the present work highlighted six cases harbouring both short DRA (4–7 RU) and genetic variants in known FSHD genes (SMCHD1 and LRIF1) and 1 case carrying detrimental variants in SMCHD1 and D4Z4 size>20RU, suggesting that these events may contribute to the disease variability among patients and families. In this regard, the FSHD1 family represented a very peculiar case, with the segregation of a short DRA (4 RU), a likely pathogenic variant in LRIF1 and a VUS in SMCHD1. In this family, methylation analysis showed that the patient (FSHD1A) harbouring the three events displayed the lowest methylation levels compared to the affected mother (FSHD1B) and unaffected relatives (Table 3). Moreover, the clinical evaluation revealed a different degree of severity degree between the proband FSHD1A and the affected mother (FSHD1B), supporting a potential combined effect of the LRIF1 and SMCHD1 variants in worsening the phenotype (Table 4). Importantly, the LRIF1 variant was detected at heterozygous state in FSHD1 family, which is in contrast with the other family described in literature (Hamanaka et al., 2020), in which biallelic LRIF1 variants have been reported together with a permissive (4qA) and a D4Z4 array of 13 RUs in a patient born from a consanguineous marriage.
The methylation analysis in patients carrying SMCHD1 variants revealed a marked hypomethylation consistent with FSHD, especially at the level of the DR1 region (Table 3). This finding is consistent with previous studies (Hartwerk et al., 2013; Huichalaf et al., 2014; Caputo et al., 2022b; Hiramuki et al., 2022; Zheng et al., 2023). The striking DR1 hypomethylation supported a functional effect for the identified SMCHD1 variants, even for those detected in patients carrying a short DRA (in the FSHD1 range). In fact, these patients displayed lower methylation levels compared to their family members carrying the short DRA only. Patient FSHD19A provided a valuable example of such condition, displaying a short DRA (6RU) combined with the SMCHD1:c.3802-8C>A variant, for which pathogenicity scores and prediction analysis supported a benign effect. In this case, the methylation analysis in FSHD19A patient revealed a marked reduction of methylation levels in DR1 compared to the other family member (FSHD19B), who displayed the same DRA and 4q subtype but was negative for the SMCHD1 variant (Table 3). Moreover, other two SMCHD1 variants were predicted as benign. However, the SMCHD1:c.3841A>G displayed a higher frequency than expected (Table 1) and the low methylation levels detected in the patient (FSHD17A) may be due to the presence of two permissive DRAs (8+6 RU). For the SMCHD1:c.3649A>G variant (detected in the FSHD18A patient), the methylation analysis did not reveal a marked reduction of DR1 methylation levels. Therefore, the prediction analysis was consistent with additional findings, such as frequency and/or methylation analysis, which equally supported a non-significant effect for both variants in these cases. Overall, the observation of differential methylation profiles in patients harbouring SMCHD1 variants supports the hypothesis that the methylation analysis is more accurate for assessing the pathogenicity of SMCHD1 variants compared to bioinformatics prediction tools. In addition, methylation analysis emerges as a useful tool to prioritize subjects in whom the research of variants in FSHD genes should be performed in parallel with D4Z4 sizing.
Concerning the DNMT3B variant, the reported frequency did not support a deleterious effect, and the methylation analysis was lower in the DUX4-PAS region, consistently with the presence of a short DRA in the patient (FSHD8A).
In addition to the variants detected in known FSHD-causing genes, the present work highlighted the presence of six variants in five genes (CTCF, DNMT1, DNMT3A, EZH2 and SUV39H1), which have been involved in the context of FSHD pathogenesis, although no variant has been described in any of them in FSHD patients. Interestingly, all of these genes have been described as epigenetic regulators of the D4Z4 locus in the context of FSHD (Zeng et al., 2009; Neguembor et al., 2010; Huichalaf et al., 2014; Himeda et al., 2015). All of them have been found to participate in the maintenance of DUX4-repressive machinery, by regulating chromatin modifications (namely, the H3K27me3 and H3K9me3 repressive markers) or DNA methylation (Figure 2). The former are mainly mediated by the activity of EZH2 (which is a member of the Polycomb Repressor Complex 2, PRC2) and SUV39H1, whereas the latter are exerted by DNA Methyltransferases (DNMTs), including DNMT1, DNMT3A and DNMT3B, which are enriched to the FSHD locus and display a redundant role (Huichalaf et al., 2014; Haynes et al., 2018). Moreover, CTCF acts as a multifunctional protein that can mediate transcriptional silencing or activation by creating accessible or inaccessible loops of chromatin at specific sites (Ottaviani et al., 2009; Caputo et al., 2022a). Interestingly, the consultation of Uniprot database revealed that the variants identified in EZH2 gene were located in the PRC2 complex domain, which interacts with DNMT1, DNMT3A and DNMT3B; the DNMT1 variant was located in the catalytic domain interacting with PRC2; the variant of DNMT3A is located in PWWP domain interacting with DNMT1 and DNMT3B. Moreover, the variant identified in CTCF has been found to be located in the Zinc Finger 9 (ZF9, C2H2-type 9), which has been involved in the formation and directionality of base-specific interactions between CTCF and its binding sites. Interestingly, the function of ZF9 (together with ZF10 and ZF11) enables CTCF to recognize different DNA sequences across the genome and to promote transcriptional insulation that has been previously described in the pathophysiology of FSHD (Ottaviani et al., 2009; Yin et al., 2017; Xu et al., 2018; Huang et al., 2021). Furthermore, the variant identified in SUV39H1 was found upstream the pre-SET domain, which plays a structural function in stabilizing the SET domain of the protein. SUV39H1 has been previously proposed as a candidate gene for FSHD, because of its role in mediating the methylation of H3K9, which is critical for HP1γ/cohesion binding (both involved in DUX4 suppression), and for SMCHD1 recruitment, which in turn mediates DNA methylation at D4Z4 (Zeng et al., 2009; Zeng et al., 2014; Sacconi et al., 2019). The consultation of public databases (Clinvar and LOVD) reported the EZH2:c.604G>A and DNMT1:c.4001C>T variants as VUS (Clinvar), whereas the other were not described either in Clinvar or in LOVD. In particular, the EZH2:c.604G>A was described as VUS for Weaver Smith Syndrome (OMIM #277590), an overgrowth syndrome characterized by accelerated skeletal maturation, characteristic facial appearance and camptodactyly. The DNMT1:c.4001C>T was reported as VUS for Hereditary Sensory Neuropathy-Deafness-Dementia Syndrome (OMIM #614116), a degenerative disorder of the central and peripheral nervous systems characterized by sensorineural hearing loss, cerebellar ataxia, narcolepsy and dementia.
FIGURE 2. Overview of the known and candidate genes harboring the variants identified in FSHD patients and their role in the maintenance of the DUX4-repressive machinery. This figure has been created with Biorender.com.
Further evidence including functional studies will be needed to finally confirm the modulatory effect of the newly identified variants in FSHD. However, the above-discussed findings and the role of the genes in maintaining the repressive pressure on the D4Z4 locus suggest that CTCF, DNMT1, DNMT3A, EZH2 and SUV39H1 might be further investigated as genes whose alteration contributes to the permissive (epi)genetic environment required to develop FSHD. Interestingly, the patients carrying a variant in one of the candidate genes showed variable D4Z4 sizes, including permissive DRAs clearly falling in the FSHD1 range (n = 3), borderline/short fragments (n = 1) and 2 cases with two permissive (i.e., 4qA/4qA) alleles (namely, 5 + 12 RU and 5 + 16 RU) in the FSHD2 range. The integration of these findings with segregation analysis, methylation status, and clinical findings, provided a more comprehensive picture of the probands’ and family phenotype (Table 4).
Methylation analysis in patients and family members harbouring variants in known and candidate genes was mostly consistent with affected/unaffected subjects, although family studies highlighted reduced methylation profiles in five unaffected subjects (FSHD1C, FSHD2B, FSHD4C, FSHD6C and FSHD10C). Of note, all of them were positive to the variants segregating in the families and displayed variable D4Z4 size, which may affect the penetrance of disease together with unknown mechanisms.
Moreover, we observed higher clinical scores together with lower methylation in DUX4-PAS and/or DR1 regions only in FSHD1, FSHD14 and FSHD19 families, whereas the other cases were de novo or we did not have enough clinical information or family member to test such a correlation. In general, methylation data appeared to be mostly associated with affected/unaffected status in this study, rather than with FSHD severity as proposed in other studies (Lemmers et al., 2015; Erdmann et al., 2022).
Overall, the variability in methylation profiles and disease severity observed in the families described in this study, could depend on several factors and patients’ characteristics, including 4q configuration (D4Z4 size, 4q genotype); the variable penetrance of DRA; age; other epigenetic modifications (such as X chromosome inactivation for the FSHD6 family harboring the variant in SUV39H1) and still unknown factors that altogether could contribute to the disease severity and clinical variability.
Nevertheless, the assessment of methylation status within the families proved to be a valuable tool not only for discriminating affected subjects, but also for highlighting possible preclinical/asymptomatic conditions among members of the same family who may benefit from a clinical monitoring over time. Although the variants in candidate genes did not show a clear correlation with D4Z4 size, methylation levels and clinical signs in the investigated patients (FSHD1C, FSHD2C, FSHD3B, FSHD3C, FSHD4C, FSHD6C), this study adds knowledge concerning the possible role of genetic features in the modulation of disease phenotype.
In conclusion, our results further support that FSHD is a complex disease, in which the presence of variations in several known (SMCHD1, DNMT3B, LRIF1) and possibly other genes (CTCF, DNMT1, DNMT3A, EZH2, SUV39H1) could influence the phenotype, penetrance and severity of disease among patients as well as within the same family. Our results further emphasize the importance of extending the analysis of molecular findings within the proband’s family, with the purpose of providing a broader framework for understanding single cases and allow more accurate genotype-phenotype correlations in FSHD-affected families.
Data availability statement
The datasets presented in this article are not readily available because data obtained from whole exome sequencing are sensitive and. our ethics committee does not authorize the sharing of these data. Requests to access the datasets should be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by Ethics Committee of Santa Lucia Foundation IRCCS, via Ardeatina 306, 00179, Rome. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin. No potentially identifiable images or data are presented in this study.
Author contributions
Conceptualization: CS, VC, RC, EG, Methodology: CS, VC Formal analysis: CS, VC, DM, GT., LC Investigation: CS, VC, SB, ET, DM, GT, MM, LC Resources: CS, SB, ET, CC, ER, GT, EG, Writing-Original Draft: CS Writing-Review and Editing: CS, VC, MM, ER, GT, RC, EG, Supervision: CS, EG, Project administration: CS, EG, Funding acquisition. CS, CC, EG, All authors contributed to the article and approved the submitted version.
Funding
This research was supported by FSHD Society Research Grant #Winter 2021-0992658837 (COD:FSHD) to CS and by Ministry of Health (Ricerca Corrente) to CC.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2023.1235589/full#supplementary-material
Supplementary Table S1 | List of subjects included in the study. In the First Sheet (“Patient’s cohort”), demographic (sex, anagraphic age), D4Z4 size and methylation data of patients with clinical signs of FSHD have been reported. The column reporting the D4Z4 size shows the shortest allele compatible with the disease, although patients carrying an additional permissive (i.e., 4qA) allele <20RU has also been reported. The Second sheet (“reference group”) displays demographic data for the individuals employed as reference population.
Supplementary Table S2 | List of genes investigated by analysis of WES data.
Supplementary Table S3 | ACMG classification of identified variants in known and candidate genes for FSHD.
References
Banerji, C. R. S., Greco, A., Joosten, L. A. B., van Engelen, B., and Zammit, P. S. (2022). A circulating biomarker of facioscapulohumeral muscular dystrophy clinical severity, valid in skeletal muscle and blood. bioRxiv.
Banerji, C. R. S., and Zammit, P. S. (2019). PAX7 target gene repression is a superior FSHD biomarker than DUX4 target gene activation, associating with pathological severity and identifying FSHD at the single-cell level. Hum. Mol. Genet. 28, 2224–2236. doi:10.1093/hmg/ddz043
Barp, A., Mosca, L., and Sansone, V. A. (2021). Facilitations and hurdles of genetic testing in neuromuscular disorders. Diagn. (Basel) 11, 701. doi:10.3390/diagnostics11040701
Calandra, P., Cascino, I., Lemmers, R. J., Galluzzi, G., Teveroni, E., Monforte, M., et al. (2016). Allele-specific DNA hypomethylation characterises FSHD1 and FSHD2. J. Med. Genet. 53, 348–355. doi:10.1136/jmedgenet-2015-103436
Caputo, V., Megalizzi, D., Fabrizio, C., Termine, A., Colantoni, L., Bax, C., et al. (2022b). D4Z4 methylation levels combined with a machine learning pipeline highlight single CpG sites as discriminating biomarkers for FSHD patients. Cells 11, 4114. doi:10.3390/cells11244114
Caputo, V., Megalizzi, D., Fabrizio, C., Termine, A., Colantoni, L., Caltagirone, C., et al. (2022a). Update on the molecular aspects and methods underlying the complex architecture of FSHD. Cells 11, 2687. doi:10.3390/cells11172687
Cascella, R., Strafella, C., Caputo, V., Galota, R. M., Errichiello, V., Scutifero, M., et al. (2018). Digenic inheritance of shortened repeat units of the D4Z4 region and a loss-of-function variant in SMCHD1 in a family with FSHD. Front. Neurol. 9, 1027. doi:10.3389/fneur.2018.01027
Cohen, J., DeSimone, A., Lek, M., and Lek, A. (2021). Therapeutic approaches in facioscapulohumeral muscular dystrophy. Trends Mol. Med. 27, 123–137. doi:10.1016/j.molmed.2020.09.008
Corasolla Carregari, V., Monforte, M., Di Maio, G., Pieroni, L., Urbani, A., Ricci, E., et al. (2020). Proteomics of muscle microdialysates identifies potential circulating biomarkers in facioscapulohumeral muscular dystrophy. Int. J. Mol. Sci. 22, 290. doi:10.3390/ijms22010290
Deenen, J. C., Arnts, H., van der Maarel, S. M., Padberg, G. W., Verschuuren, J. J., Bakker, E., et al. (2014). Population-based incidence and prevalence of facioscapulohumeral dystrophy. Neurology 83, 1056–1059. doi:10.1212/WNL.0000000000000797
DeSimone, A. M., Pakula, A., Lek, A., and Emerson, C. P. (2017). Facioscapulohumeral muscular dystrophy. Compr. Physiol. 7, 1229–1279. doi:10.1002/cphy.c160039
Ellard, S., Baple, E. L., Callaway, A., Berry, I., Forrester, N., Turnbull, C., et al. (2020). ACGS best practice guidelines for variant classification in rare disease 2020. Avaialble at: https://www.acgs.uk.com/media/11631/uk-practice-guidelines-for-variant-classification-v4-01-2020.pdf.
Erdmann, H., Scharf, F., Gehling, S., Benet-Pagès, A., Jakubiczka, S., Becker, K., et al. (2022). Methylation of the 4q35 D4Z4 repeat defines disease status in facioscapulohumeral muscular dystrophy. Brain awac336 146, 1388–1402. doi:10.1093/brain/awac336
Gaillard, M. C., Roche, S., Dion, C., Tasmadjian, A., Bouget, G., Salort-Campana, E., et al. (2014). Differential DNA methylation of the D4Z4 repeat in patients with FSHD and asymptomatic carriers. Neurology 83, 733–742. doi:10.1212/WNL.0000000000000708
Giacomucci, G., Monforte, M., Diaz-Manera, J., Mul, K., Fernandez Torrón, R., Maggi, L., et al. (2020). Deep phenotyping of facioscapulohumeral muscular dystrophy type 2 by magnetic resonance imaging. Eur. J. Neurol. 27, 2604–2615. doi:10.1111/ene.14446
Gould, T., Jones, T. I., and Jones, P. L. (2021). Precise epigenetic analysis using targeted bisulfite genomic sequencing distinguishes FSHD1, FSHD2, and healthy subjects. Diagn. (Basel) 11, 1469. doi:10.3390/diagnostics11081469
Greco, A., Goossens, R., van Engelen, B., and van der Maarel, S. M. (2020). Consequences of epigenetic derepression in facioscapulohumeral muscular dystrophy. Clin. Genet. 97, 799–814. doi:10.1111/cge.13726
Hamanaka, K., Šikrová, D., Mitsuhashi, S., Masuda, H., Sekiguchi, Y., Sugiyama, A., et al. (2020). Homozygous nonsense variant in LRIF1 associated with facioscapulohumeral muscular dystrophy. Neurology 94, e2441–e2447. doi:10.1212/WNL.0000000000009617
Hartweck, L. M., Anderson, L. J., Lemmers, R. J., Dandapat, A., Toso, E. A., Dalton, J. C., et al. (2013). A focal domain of extreme demethylation within D4Z4 in FSHD2. Neurology 80, 392–399. doi:10.1212/WNL.0b013e31827f075c
Haynes, P., Bomsztyk, K., and Miller, D. G. (2018). Sporadic DUX4 expression in FSHD myocytes is associated with incomplete repression by the PRC2 complex and gain of H3K9 acetylation on the contracted D4Z4 allele. Epigenetics Chromatin 11, 47. doi:10.1186/s13072-018-0215-z
Himeda, C. L., and Jones, P. L. (2019). The genetics and epigenetics of facioscapulohumeral muscular dystrophy. Annu. Rev. Genomics Hum. Genet. 20, 265–291. doi:10.1146/annurev-genom-083118-014933
Himeda, C. L., Jones, T. I., and Jones, P. L. (2015). Facioscapulohumeral muscular dystrophy as a model for epigenetic regulation and disease. Antioxid. Redox Signal 22, 1463–1482. doi:10.1089/ars.2014.6090
Hiramuki, Y., Kure, Y., Saito, Y., Ogawa, M., Ishikawa, K., Mori-Yoshimura, M., et al. (2022). Simultaneous measurement of the size and methylation of chromosome 4qA-D4Z4 repeats in facioscapulohumeral muscular dystrophy by long-read sequencing. J. Transl. Med. 20, 517. doi:10.1186/s12967-022-03743-7
Huang, H., Zhu, Q., Jussila, A., Han, Y., Bintu, B., Kern, C., et al. (2021). CTCF mediates dosage- and sequence-context-dependent transcriptional insulation by forming local chromatin domains. Nat. Genet. 53, 1064–1074. doi:10.1038/s41588-021-00863-6
Huichalaf, C., Micheloni, S., Ferri, G., Caccia, R., and Gabellini, D. (2014). DNA methylation analysis of the macrosatellite repeat associated with FSHD muscular dystrophy at single nucleotide level. PLoS One 9, e115278. doi:10.1371/journal.pone.0115278
Ioannidis, N. M., Rothstein, J. H., Pejaver, V., Middha, S., McDonnell, S. K., Baheti, S., et al. (2016). Revel: an Ensemble method for predicting the pathogenicity of rare missense variants. Am. J. Hum. Genet. 99, 877–885. doi:10.1016/j.ajhg.2016.08.016
Ittisoponpisan, S., Islam, S. A., Khanna, T., Alhuzimi, E., David, A., and Sternberg, M. J. E. (2019). Can predicted protein 3D structures provide reliable insights into whether missense variants are disease associated? J. Mol. Biol. 431, 2197–2212. doi:10.1016/j.jmb.2019.04.009
Lamperti, C., Fabbri, G., Vercelli, L., D'Amico, R., Frusciante, R., Bonifazi, E., et al. (2010). A standardized clinical evaluation of patients affected by facioscapulohumeral muscular dystrophy: the FSHD clinical score. Muscle Nerve 42, 213–217. doi:10.1002/mus.21671
Laskowski, R. A., Stephenson, J. D., Sillitoe, I., Orengo, C. A., and Thornton, J. M. (2020). VarSite: disease variants and protein structure. Protein Sci. 29, 111–119. doi:10.1002/pro.3746
Lemmers, R. J., Goeman, J. J., van der Vliet, P. J., van Nieuwenhuizen, M. P., Balog, J., Vos-Versteeg, M., et al. (2015). Inter-individual differences in CpG methylation at D4Z4 correlate with clinical variability in FSHD1 and FSHD2. Hum. Mol. Genet. 24, 659–669. doi:10.1093/hmg/ddu486
Lemmers, R. J., Tawil, R., Petek, L. M., Balog, J., Block, G. J., Santen, G. W., et al. (2012). Digenic inheritance of an SMCHD1 mutation and an FSHD-permissive D4Z4 allele causes facioscapulohumeral muscular dystrophy type 2. Nat. Genet. 44 (12), 1370–1374. doi:10.1038/ng.2454
Lemmers, R. J., van der Vliet, P. J., van der Gaag, K. J., Zuniga, S., Frants, R. R., de Knijff, P., et al. (2010). Worldwide population analysis of the 4q and 10q subtelomeres identifies only four discrete interchromosomal sequence transfers in human evolution. Am. J. Hum. Genet. 86, 364–377. doi:10.1016/j.ajhg.2010.01.035
Lemmers, R. J. L. F., van der Stoep, N., Vliet, P. J. V., Moore, S. A., San Leon Granado, D., Johnson, K., et al. (2019). SMCHD1 mutation spectrum for facioscapulohumeral muscular dystrophy type 2 (FSHD2) and Bosma arhinia microphthalmia syndrome (BAMS) reveals disease-specific localisation of variants in the ATPase domain. J. Med. Genet. 56, 693–700. doi:10.1136/jmedgenet-2019-106168
Monforte, M., Bortolani, S., Torchia, E., Cristiano, L., Laschena, F., Tartaglione, T., et al. (2022). Diagnostic magnetic resonance imaging biomarkers for facioscapulohumeral muscular dystrophy identified by machine learning. J. Neurol. 269, 2055–2063. doi:10.1007/s00415-021-10786-1
Mostacciuolo, M. L., Pastorello, E., Vazza, G., Miorin, M., Angelini, C., Tomelleri, G., et al. (2009). Facioscapulohumeral muscular dystrophy: epidemiological and molecular study in a north-east Italian population sample. Clin. Genet. 75, 550–555. doi:10.1111/j.1399-0004.2009.01158.x
Mul, K., Voermans, N. C., Lemmers, R. J. L. F., Jonker, M. A., van der Vliet, P. J., Padberg, G. W., et al. (2018). Phenotype-genotype relations in facioscapulohumeral muscular dystrophy type 1. Clin. Genet. 94, 521–527. doi:10.1111/cge.13446
Neguembor, M. V., and Gabellini, D. (2010). In junk we trust: repetitive DNA, epigenetics and facioscapulohumeral muscular dystrophy. Epigenomics 2, 271–287. doi:10.2217/epi.10.8
Ottaviani, A., Rival-Gervier, S., Boussouar, A., Foerster, A. M., Rondier, D., Sacconi, S., et al. (2009). The D4Z4 macrosatellite repeat acts as a CTCF and A-type lamins-dependent insulator in facio-scapulo-humeral dystrophy. PLoS Genet. 5, e1000394. doi:10.1371/journal.pgen.1000394
Pejaver, V., Byrne, A. B., Feng, B. J., Pagel, K. A., Mooney, S. D., Karchin, R., et al. (2022). Calibration of computational tools for missense variant pathogenicity classification and ClinGen recommendations for PP3/BP4 criteria. Am. J. Hum. Genet. 109, 2163–2177. doi:10.1016/j.ajhg.2022.10.013
Preston, M. K., Tawil, R., and Wang, L. H. (2020). “Facioscapulohumeral muscular dystrophy,” in GeneReviews® [internet]. Editors M. P. Adam, D. B. Everman, G. M. Mirzaa, R. A. Pagon, S. E. Wallace, L. J. H. Beanet al. (Seattle (WA): University of Washington, Seattle), 1993–2022.
Ricci, E., Galluzzi, G., Deidda, G., Cacurri, S., Colantoni, L., Merico, B., et al. (1999). Progress in the molecular diagnosis of facioscapulohumeral muscular dystrophy and correlation between the number of KpnI repeats at the 4q35 locus and clinical phenotype. Ann. Neurol. 45, 751–757. doi:10.1002/1531-8249(199906)45:6<751::aid-ana9>3.0.co;2-m
Ricci, G., Mele, F., Govi, M., Ruggiero, L., Sera, F., Vercelli, L., et al. (2020). Large genotype-phenotype study in carriers of D4Z4 borderline alleles provides guidance for facioscapulohumeral muscular dystrophy diagnosis. Sci. Rep. 10, 21648. doi:10.1038/s41598-020-78578-7
Richards, S., Aziz, N., Bale, S., Bick, D., Das, S., Gastier-Foster, J., et al. (2015). Standards and guidelines for the interpretation of sequence variants: A joint consensus recommendation of the American college of medical genetics and genomics and the association for molecular pathology. Genet. Med. 17, 405–424. doi:10.1038/gim.2015.30
Roche, S., Dion, C., Broucqsault, N., Laberthonnière, C., Gaillard, M. C., Robin, J. D., et al. (2019). Methylation hotspots evidenced by deep sequencing in patients with facioscapulohumeral dystrophy and mosaicism. Neurol. Genet. 5, e372. doi:10.1212/NXG.0000000000000372
Sacconi, S., Briand-Suleau, A., Gros, M., Baudoin, C., Lemmers, R. J. L. F., Rondeau, S., et al. (2019). FSHD1 and FSHD2 form a disease continuum. Neurology 92, e2273–e2285. doi:10.1212/WNL.0000000000007456
Scionti, I., Greco, F., Ricci, G., Govi, M., Arashiro, P., Vercelli, L., et al. (2012). Large-scale population analysis challenges the current criteria for the molecular diagnosis of fascioscapulohumeral muscular dystrophy. Am. J. Hum. Genet. 90, 628–635. doi:10.1016/j.ajhg.2012.02.019
Strafella, C., Caputo, V., Galota, R. M., Campoli, G., Bax, C., Colantoni, L., et al. (2019). The variability of SMCHD1 gene in FSHD patients: evidence of new mutations. Hum. Mol. Genet. 28, 3912–3920. doi:10.1093/hmg/ddz239
Tasca, G., Monforte, M., Iannaccone, E., Laschena, F., Ottaviani, P., Leoncini, E., et al. (2014). Upper girdle imaging in facioscapulohumeral muscular dystrophy. PLoS One 9, e100292. doi:10.1371/journal.pone.0100292
Tasca, G., Monforte, M., Ottaviani, P., Pelliccioni, M., Frusciante, R., Laschena, F., et al. (2016). Magnetic resonance imaging in a large cohort of facioscapulohumeral muscular dystrophy patients: pattern refinement and implications for clinical trials. Ann. Neurol. 79, 854–864. doi:10.1002/ana.24640
van den Boogaard, M. L., Lemmers, R. J. L. F., Balog, J., Wohlgemuth, M., Auranen, M., Mitsuhashi, S., et al. (2016). Mutations in DNMT3B modify epigenetic repression of the D4Z4 repeat and the penetrance of facioscapulohumeral dystrophy. Am. J. Hum. Genet. 98, 1020–1029. doi:10.1016/j.ajhg.2016.03.013
Wijmenga, C., Hewitt, J. E., Sandkuijl, L. A., Clark, L. N., Wright, T. J., Dauwerse, H. G., et al. (1992). Chromosome 4q DNA rearrangements associated with facioscapulohumeral muscular dystrophy. Nat. Genet. 2, 26–30. doi:10.1038/ng0992-26
Wong, C. J., Wang, L. H., Friedman, S. D., Shaw, D., Campbell, A. E., Budech, C. B., et al. (2020). Longitudinal measures of RNA expression and disease activity in FSHD muscle biopsies. Hum. Mol. Genet. 29, 1030–1043. doi:10.1093/hmg/ddaa031
Xu, D., Ma, R., Zhang, J., Liu, Z., Wu, B., Peng, J., et al. (2018). Dynamic nature of CTCF tandem 11 Zinc fingers in multivalent recognition of DNA as revealed by NMR spectroscopy. J. Phys. Chem. Lett. 9, 4020–4028. doi:10.1021/acs.jpclett.8b01440
Yin, M., Wang, J., Wang, M., Li, X., Zhang, M., Wu, Q., et al. (2017). Molecular mechanism of directional CTCF recognition of a diverse range of genomic sites. Cell. Res. 27, 1365–1377. doi:10.1038/cr.2017.131
Zampatti, S., Colantoni, L., Strafella, C., Galota, R. M., Caputo, V., Campoli, G., et al. (2019). Facioscapulohumeral muscular dystrophy (FSHD) molecular diagnosis: from traditional technology to the NGS era. Neurogenetics 20, 57–64. doi:10.1007/s10048-019-00575-4
Zeng, W., Chen, Y. Y., Newkirk, D. A., Wu, B., Balog, J., Kong, X., et al. (2014). Genetic and epigenetic characteristics of FSHD-associated 4q and 10q D4Z4 that are distinct from non-4q/10q D4Z4 homologs. Hum. Mutat. 35, 998–1010. doi:10.1002/humu.22593
Zeng, W., de Greef, J. C., Chen, Y. Y., Chien, R., Kong, X., Gregson, H. C., et al. (2009). Specific loss of histone H3 lysine 9 trimethylation and HP1gamma/cohesin binding at D4Z4 repeats is associated with facioscapulohumeral dystrophy (FSHD). PLoS Genet. 5, e1000559. doi:10.1371/journal.pgen.1000559
Keywords: FSHD, exome, D4Z4, genetics, muscular dystrophy
Citation: Strafella C, Caputo V, Bortolani S, Torchia E, Megalizzi D, Trastulli G, Monforte M, Colantoni L, Caltagirone C, Ricci E, Tasca G, Cascella R and Giardina E (2023) Whole exome sequencing highlights rare variants in CTCF, DNMT1, DNMT3A, EZH2 and SUV39H1 as associated with FSHD. Front. Genet. 14:1235589. doi: 10.3389/fgene.2023.1235589
Received: 06 June 2023; Accepted: 09 August 2023;
Published: 22 August 2023.
Edited by:
Jordi Pérez-Tur, Spanish National Research Council (CSIC), SpainReviewed by:
Frederique Magdinier, Institut National de la Santé et de la Recherche Médicale (INSERM), FranceEdoardo Malfatti, Hôpitaux Universitaires Henri Mondor, France
Copyright © 2023 Strafella, Caputo, Bortolani, Torchia, Megalizzi, Trastulli, Monforte, Colantoni, Caltagirone, Ricci, Tasca, Cascella and Giardina. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Emiliano Giardina, emiliano.giardina@uniroma2.it