- 1State Key Laboratory of Oral Diseases and National Clinical Research Center for Oral Diseases, West China Hospital of Stomatology, Sichuan University, Chengdu, China
- 2Department of Cleft Lip and Palate, West China Hospital of Stomatology, Sichuan University, Chengdu, China
- 3Division of Growth and Development and Section of Orthodontics, School of Dentistry, University of California, Los Angeles, Los Angeles, CA, United States
Rs560426 at 1p22 was proved to be associated with NSCL/P (non-syndromic cleft lip with or without the palate) in several populations, including Han Chinese population. Here, we conducted a deep sequencing around rs560426 to locate more susceptibility variants in this region. In total, 2,293 NSCL/P cases and 3,235 normal controls were recruited. After sequencing, association analysis was performed. Western blot, RT-qPCR, HE, immunofluorescence staining, and RNA sequencing were conducted for functional analyses of the selected variants. Association analysis indicated that rs77179923 was the only SNP associated with NSCLP specifically (p = 4.70E-04, OR = 1.84), and rs12071152 was uniquely associated with LCLO (p = 4.00E-04, OR = 1.30, 95%CI: 1.12–1.51). Moreover, de novo harmful rare variant NM_004815.3, NP_004806.3; c.1652G>C, p.R551T in ARHGAP29 resulted in a decreased expression level of ARHGAP29, which in turn affected NSCL/P-related biological processes; however, no overt cleft palate (CP) phenotype was observed. In conclusion, rs12071152 was a new susceptible variant, which is specifically associated with LCLO among the Han Chinese population. Allele A of it could increase the risk of having a cleft baby. Rs77179923 and rare variant NM_004815.3, NP_004806.3; c.1652G>C, p.R551T at 1p22 were both associated with NSCLP among the Han Chinese population. However, this missense variation contributes to no overt CP phenotype due to dosage insufficiency or compensation from other genes.
Introduction
Non-syndromic cleft lip with or without the palate (NSCL/P), one of the most common orofacial clefts, has an average prevalence of 1/1,000 live births worldwide, with a relatively high prevalence among Asians (Croen et al., 1998; Tolarova and Cervenka, 1998; Mossey and Modell, 2012). The affected kids usually suffer from a number of problems related to clefts, such as speech, hearing, and psychological disorders (Lewis et al., 2017). It is necessary for them to receive coordinated multidisciplinary care that lasts from the stage of infant to adulthood, which imposes a heavy financial burden on their families.
NSCL/P is a complex disorder, with genetic and environmental factors and their interplay involved (Dixon et al., 2011; Rahimov et al., 2012; Worley et al., 2018). However, genes play a dominant role (Grosen et al., 2010; Dixon et al., 2011; Rahimov et al., 2012; Baldacci et al., 2018). Thus, lots of studies have been designed to shed light on the susceptibility genes or loci for NSCL/P, among which genome-wide association studies (GWASs) have identified an unprecedented number of genetic variants associated with it, and to date, over 40 risk loci for NSCL/P have been identified (Leslie and Marazita, 2013; Lin-Shiao et al., 2019). However, those findings only account for about 20% estimated heritability of NSCL/P (Beaty et al., 2016; Lin-Shiao et al., 2019); the missing heritability is partially attributed to the strict significance threshold of GWAS, which leads to the failed detection of that single-nucleotide polymorphism (SNP) with modest effect (Manolio et al., 2009; Tam et al., 2019); in addition, those risk loci identified by GWAS are usually driven by associated genetic variants due to linkage disequilibrium (Altshuler et al., 2008; Dickson et al., 2010), thus making it difficult to pinpoint the casual variants. Based on this, high-depth sequencing targeted at those risk loci is a cost-effective method to identify variants with larger effect sizes that are missed by GWAS, and this would also facilitate the discernment of casual variants (Manolio et al., 2009; Sazonovs and Barrett, 2018).
1p22, which contains rs560426, was initially identified as one of the risk loci for NSCL/P because of the statistically significant association between rs560426 and NSCL/P via GWAS (Beaty et al., 2010). Our previous study indicated that rs560426 was significantly associated with NSCL/P among the Han Chinese population, which further conferred susceptibility to 1p22. Rs560426 is located in ABCA4 gene, which is surely excluded from the candidate susceptibility genes in 1p22 due to its expression restricted to the retina (Beaty et al., 2010). Leslie et al. (2012) identified several rare variants that were associated with NSCL/P in ARHGAP29, which is adjacent to ABCA4 and expressed in the developing face. Therefore, ARHGAP29 was highly suspected as a susceptibility gene of NSCL/P in 1p22. From then on, a surge of studies focused on 1p22, and plenty of rare variants in ARHGAP29 were identified in multiple ethnicities (Leslie et al., 2012; Butali et al., 2014; Chandrasekharan and Ramanathan, 2014; Letra et al., 2014; Gowans et al., 2016; Savastano et al., 2017).
In this study, we aim to conduct a deep screening targeting the 1p22 locus to fully dig into susceptibility SNPs or indels through bioinformatics, statistics analysis, and functional experiments, hoping to identify more susceptibility variants at this locus for NSCL/P among the Han Chinese population.
Materials and methods
Sample collection and ethics statement
In total, 159 NSCL/P cases were included in the deep sequencing phase of our study, whereas 542 controls’ WGS data with an average coverage of 39.89 was downloaded from the Novogene internal database (http://www.novogene.com/); 2,134 NSCL/P (1047 NSCLO and 1087 NSCLP) and 2,693 normal controls from West China Second University Hospital, Sichuan University, were recruited in the replication phase. Cases were collected between 2016 and 2018 from the Cleft Lip and Palate Surgery Department of West China Hospital of Stomatology, Sichuan University. All the participants were self-recognized as the Han Chinese and denied family history as well as other congenital diseases, therein, the phenotype of the patients was assessed by both physicians and geneticists. More details of samples are shown in Supplementary Table S1.
Our study abides by the STOBE (Strengthening the Reporting of Observational Studies in Epidemiology) guidelines and was approved by HEC (the Hospital Ethics Committee) of West China Hospital of Stomatology. All individuals voluntarily joined this study with informed consent (WCHSIRB-D-2016-012R1).
Targeted region deep sequencing
DNA was extracted from peripheral blood of each sample by the salting-out method. After quality control, 1.0 μg of each DNA sample was enriched by using Agilent SureSelectXT Custom kit. Then, sequencing was conducted on the Illumina Hiseq X Ten platform to get paired-end 150bp reads by Novogene (China). The sequenced region was selected around rs560426 (GRCh37/hg19, chr1:94,453,779 to 94,739,314) based on the LD structure in CHB/JPT HapMap project.
Bioinformatics analysis
After removing adapter-related reads, N-containing reads, and low-quality reads, the clean sequence data were mapped to the human genome GRCh37/hg19 by Burrows–Wheeler Aligner (BWA) software (Li and Durbin, 2009). Then, 943 single nucleotide variants (SNPs) and 390 insertion/deletions (In/Dels) were identified by the Sequence Alignment Map (SAM tools) (Li et al., 2009) and merged by VCF (variant call format) tools (version 0.1.13) (Danecek et al., 2011). Later, variants were annotated by ANNOVAR (version 201707) (Wang et al., 2010), followed by function prediction via SIFT (Ng and Henikoff, 2003), v1.3 CADD (Kircher et al., 2014), Polyphen-2 (http://genetics.bwh.harvard.edu/pph2/) (Adzhubei et al., 2013), and MutationTaster (http://www.mutationtaster.org/) (Schwarz et al., 2010).
Statistical analysis
In the discovery phase, variants were categorized as either common or rare. Variants with MAF (minor allele frequency) ≥1% were referred to as common variants (they were Single nucleotide polymorphisms or SNPs), and case–control association analysis was performed after excluding SNPs that deviated from Hardy–Weinberg equilibrium (HWE). Three rare variants selected by three conditions were enrolled into burden analysis calculated by the R package SKAT: ① MAF <1% in the CHB population (Beijing Han Chinese population) and CHS population (Southern Han Chinese population) from 1000 Genomes Project database and Novogene internal database; ② MAF <0.001 in the Genome Aggregation Database (GnomAD); ③ at least two prediction tools suggested its harmfulness (SIFT, v1.3 CADD, Polyphen-2, and MutationTaster).
In the replication phase, SNP genotyping data were retrieved from two GWASs we have ever participated in (Sun et al., 2015; Huang et al., 2019). PLINK software (version 1.9) was used to perform the HWE test, calculate MAF, and perform a case–control association analysis for each SNP (Purcell et al., 2007). The threshold of P-value is 0.05/99 = 5.05E-04.
Sanger sequencing
Three novel harmful rare variants, which were not reported in a public database, such as dbSNP (https://www.ncbi.nlm.nih.gov/SNP/) (Smigielski et al., 2000), 1000 Genome (https://www.internationalgenome.org/), ExAC (http://exac.broadinstitute.org) (version 0.3.1) (Lek et al., 2016), CADD (http://cadd.gs.washington.edu/snv) (Rentzsch et al., 2018), and HGMD (http://www.hgmd.org) (Stenson et al., 2009), were further validated in carriers and their parents by Sanger sequencing, and PCR primers for genomic sequence were designed using Primer 3 (https://bioinfo.ut.ee/primer3-0.4.0/) (Supplementary Table S2). Then, for amplification, a mixture of Taq polymerase enzyme, PCR primers, water, and DNA sample was prepared. The amplified DNA products were then sequenced using the ABI 3730 Sequencer and analyzed with Sequence Scanner v1.0.
Cell culture and transient transfection
HEK-293T cell line was cultured in Dulbecco’s modified Eagle medium (DMEM) with 10% fetal bovine serum (PAN Biotech, Germany) and 1% Penicillin–Streptomycin Solution (Gibco, Foster City, CA, United States).
Full-length cDNA of ARHGAP29 (NM_004815.3) was synthesized and sub-cloned into pcDNA3.1 plasmid, to which site-directed mutagenesis was applied and thus obtained pcDNA3.1-ARHGAP29R551T plasmid (GeneChem, China). Then, they were transfected into HEK-293T cells by using Lipofectamine 3000 (Invitrogen, Carlsbad, CA) according to the manufacturer’s instructions, respectively.
Construction of the Arhgap29R553T mutant mouse model
The homology analysis of the amino acid sequences of human and mouse ARHGAP29 revealed that the 553rd amino acid of mouse ARHGAP29 was identical to the 551st amino acid of humans. Therefore, the CRISPR/Cas9 system was used to engineer a single base substitution mutation from G to C at the 1658th nucleotide of the cDNA of the Arhgap29 gene in the C57BL/6J mouse, resulting in a change from arginine (R) to threonine (T) at the 553rd amino acid. This part of the experiment was conducted by Gempharmtech Biotechnology Company (China), from whom we acquired F1 heterozygous Arhgap29R553T/+ mice for the subsequent experiments.
Due to the limited number of F1 heterozygous Arhgap29R553T/+ mice, they were crossed to C57BL/6J wild-type mice to generate a sufficient number of heterozygous mice. After genotyping the offsprings, heterozygous Arhgap29R553T/+ mice were chosen to be maintained. To be specific, 1–2 mm tail tissue was cut off from each mouse, from which DNA was extracted and amplified by PCR (Forward primer: CCACCACTTCTGTGGTGTCCTTG, reverse primer: CTACCCATGTTCTGCCTGTTGAG), both of which were completed using One Step Mouse Genotyping Kit (Vazyme, China). Sanger sequencing was then performed on those PCR products to confirm the genotype of each mouse.
After that, heterozygous Arhgap29R553T/+ mice were crossed overnight, females were examined for the presence of a vaginal plug the next morning, and the day when the vaginal plug was observed was designated as embryonic day 0.5 (E0.5).
RNA extraction
RNA was extracted from each group of HEK-293T cells using RNA-easy Isolation Reagent (Vazyme, China) 48 h after transfection. At E13.5, RNA was extracted from the secondary palate of homozygous Arhgap29R553T/R553T and wild-type mice. A total of 500 ng RNA was undertaken reverse transcription PCR to form cDNA by Takara PrimeScript kit.
RNA sequencing
Using the BGISEQ-500 platform, RNA sequencing was performed on the cDNA library of Arhgap29R553T/R553T and wild-type mice (BGI, China). In each group, two biological replications were included. Using DEseq2 and the Gene Ontology (GO) database, differential gene expression analysis and annotation for the biological process of DEGs (differential expression genes) were conducted.
Quantitative real-time qPCR
RT-qPCR was performed by using Takara TB Green Premix ExTaq. GAPDH was chosen as a reference gene, and primers are shown in Supplementary Table S3. Results were analyzed using the 2−ΔΔCt method. Each of the three biological replications was accompanied by three technical replications. Statistical analysis was calculated by the unpaired two-tailed t-test in GraphPad Prism 8 software.
Western blot
Furthermore, 48 h after transfection, after discarding the culture medium and washing with PBS, 250 μl of lysate was added to each well (containing 10 dsμl of PMSF per 1000 μl of RIPA) (Beyotime, China), carefully pipetted, and placed on ice for 10 min. Then, the lysate was collected and centrifuged at 10,000g for 3 min, the supernatant was diluted with ×5 loading buffer (Beyotime, China) and boiled for 10 min.
Subsequently, protein samples were separated by electrophoresis in agarose gels and transferred onto PVDF membranes, which were then blocked by 5% milk for 1 h and incubated with rabbit anti-human Arhgap29 antibody (Novus Biologicals, United States) at 4°C overnight, followed by incubation with anti-rabbit antibody (Proteintech, China) at room temperature for 1 h. At last, proteins were visualized by ECL substrate (Epizyme, China).
Micro-CT scanning
Three homozygous Arhgap29R553T/R553T and wild-type mice were selected and their entire body bone tissues were scanned by Micro-CT. The X-ray tube voltage was set to 70 kV, and the current was 114 A. The reconstruction was performed with Mimics 21.0.
HE and immunofluorescence staining
Embryos from E13.5 to E15.5 were fixed overnight in 4% paraformaldehyde and then fixed in paraffin. Serial paraffin sections of 7 μm were collected and deparaffinized in xylene and rehydrated with a range of ethanol concentrations. For regular histology, hematoxylin and eosin were used to stain tissue sections. For immunofluorescence, after heat-induced antigen retrieval, samples were blocked for 1 h with 5% bovine serum albumin in phosphate-buffered saline. The rabbit anti-human Arhgap29 antibody (Novus Biologicals, United States) was incubated overnight at 4°C. Following a PBST wash, rabbit IgG Alexa 488 (Abcam, United States) was applied for 1 h at room temperature, followed by another wash. Images were captured after mounting samples with DAPI.
Results
Rs77179923 was specifically associated with NSCLP
By targeted region sequencing, we detected a total of 943 single nucleotide variations (SNVs) and 390 In/Dels. Of them, 656 SNVs were recognized as common variants and recruited into case–control association analysis, whereas 3 rare variants were enrolled in burden analysis (data did not show any significance).
In the discovery phase, 99 of the 656 SNVs in our targeted region were identified to be potential susceptibility variants of NSCL/P with a P-value less than 0.05 (Supplementary Figure S1 and Supplementary Table S4). Subsequently, all the 99 SNPs were replicated among 1,626 NSCL/P cases and 2,255 controls. According to the significance threshold after multiple corrections, the SNPs with a P-value less than 5.05E-04 are associated with the replication phase.
MAF and HWE (Supplementary Table S5) of the replicated SNPs were calculated, and those SNPs with MAF above 1% and P-value of HWE above 0.05 were recruited into the association analysis. Interestingly, we found that rs77179923 was specifically associated with NSCLP (p = 4.70E-04, OR = 1.84, 95%CI: 1.31–2.58), and its T allele was at risk for NSCLP, which indicated that the carries could have a higher risk to give birth a cleft baby. Rs12071152 was marginally associated with NSCLO (p = 9.40E-04, OR = 1.27, 95%CI:1.10–1.46). None of SNPs was identified to be associated with NSCL/P (Table1 and Figure 1A).
FIGURE 1. Radar chat for the replication of the association analysis. Log P value with base 10 were shown in the chat, while blue, red and green line indicate the result of rs77179923, rs12071152 and significance threshold respectively. Significance threshold in the replication phase is 5.05E-04, which is adjusted by multiple correction.
Rs12071152 was uniquely associated with LCLO
To further test if the 99 SNPs associated with sub-phenotypes of NSCLO, we divided NSCLO into BCLO (bilateral cleft lip only), UCLO (unilateral cleft lip only), RCLO (right cleft lip only), and LCLO (left cleft lip only). Intriguingly, we noticed that rs12071152 showed specific association with LCLO (p = 4.00E-04, OR = 1.30, 95%CI: 1.12–1.51); although the association between rs12071152 and NSCLO (p = 9.40E-04, OR = 1.27, 95%CI: 1.10–1.46) did not survive after multiple corrections, its association with BCLO (p = 0.002, OR = 1.28, 95%CI:1.10–1.49), RCLO (p = 0.005, OR = 1.24, 95%CI:1.07–1.45), and UCLO (p = 0.001, OR = 1.27, 95%CI:1.10–1.47) were all not reached the significance threshold of 5.05E-04 (Table 2; Figure 1B). Our data indicated that there existed genetic heterogeneity among BCLO, UCLO, RCLO, and LCLO.
De novo harmful rare variant ARHGAP29R551T was identified to be associated with NSCLP
Three harmful rare variants were identified to be novel (NM_000350.2: c.979C>T in ABCA4 and NM_004815.3: c.1652G>C and NM_004815.3: c.559G>A in ARHGAP29), which have not been reported in public databases such as 1000 Genome, Esp6500, ExAC, and GnomAD.
We validated all of them by Sanger sequencing on carriers and their parents, through which NM_000350.2: c.979C>T in ABCA4 and NM_004815.3: c.559G>A in ARHGAP29 were shown to be inherited from the parents of carriers, whereas NM_004815.3: c.1652G>C in ARHGAP29 was proved to be de novo, and it resulted in a missense mutation of 551 amino acids (p.R551T) of ARHGAP29 that is highly conserved across several species (Figure 2A). Then, NM_004815.3: c.1652G>C was further screened among 508 NSCLP cases and 438 normal controls, but it did not appear. Based on the conservation and harmfulness, we speculated that NM_004815.3: c.1652G>C in ARHGAP29, a de novo harmful rare variant, would be a risk factor for NSCLP.
FIGURE 2. (A) Sanger sequencing results of the de novo harmful rare variant in ARHGAP29. Sequence chromatograms indicate the heterozygous variant (NM_004815.3, NP_004806.3; c.1652G>C, p.R551T). The red letter and box emphasize the cross-species conservation of the altered amino acid. (B, C) Western blot and RT-qPCR analysis of the ARHGAP29 expression in HEK-293T cells 48 h after plasmid transfection. The results are presented as mean values with standard deviation (SD) normalized to GAPDH, and there were three biological replicates, ***p < 0.001.
ARHGAP29R551T results in a decreased expression level of ARHGAP29 in vitro
Expression of fluorescence demonstrated that both pcDNA3.1-ARHGAP29 and pcDNA3.1-ARHGAP29R551T were efficiently expressed in HEK-293T cells. Western Blot revealed that the expression levels of ARHGAP29 in homozygous Arhgap29R553T/R553T and wild-type group were comparable (Figure 2B). However, compared to the wild-type group, RT-qPCR revealed that homozygous Arhgap29R553T/R553T led to the lower mRNA expression level of ARHGAP29 (Figure 2C).
Additionally, we examined its effect in vivo. At E18.5, there were no significant differences in body length, craniofacial morphology, or bone growth between Arhgap29R553T/R553T and wild-type mice embryos, and no overt cleft palate phenotype was observed (Figure 3A). From E13.5 to E15.5, HE images of coronal sections showed normal elevation and fusion of palate shelves in both Arhgap29R553T/R553T and wild-type mice embryos (Figure 3B). Furthermore, ARHGAP29 was expressed similarly in the palatal epithelium of Arhgap29R553T/R553T and wild-type mice embryos (Figure 3C).
FIGURE 3. (A). Macroscopic and palatal phenotypes of E18.5 embryos; (B,C) HE and immunofluorescence staining of palatal coronal sections of E13.5–E15.5 embryos.
ARHGAP29R551T affects NSCL/P-related biological processes
Even though considering the decreased expression mRNA level of ARHGAP29 in vitro, we decided to further explore the influence of Arhgap29R553T/R553T on the transcriptome in vivo by RNA sequencing on the secondary palate tissue of E13.5 homozygous Arhgap29R553T/R553T and wild-type mice embryos. As predicted, the expression level of the Arhgap29 gene transcript NM_172525.2, which is identical to the transcript, where the de novo harmful rare variant NM_004815.3: c.1652G>C located at the human genome, was also significantly downregulated when compared to the expression level in wild-type mice.
In addition, decreased Arhgap29 led to significant changes in 174 genes, 121 of which were upregulated and 53 of which were downregulated (Figure 4A). The conditions for differential gene expression analysis include FPKM (wild-type)> 1, |log2|≥ 0.8, and p < 0.05.
FIGURE 4. Results of RNA sequencing on Arhgap29R553T/R553T and wild-type mice. (A) Volcanic maps of differential expression genes. (B) GO analysis of DEGs. All the shown GO terms were significantly enriched with Q-value less than 0.05.
Gene Ontology (GO) analysis for DEGs revealed that 15 biological processes were significantly enriched in upregulated genes, of which “epithelial cell differentiation” was the most relevant term to NSCL/P. In addition, most downregulated genes were significantly enriched in biological processes related to transcription, such as “regulation of transcription, DNA-templated” and “negative regulation of transcription by RNA polymerase II” (Figure 4B).
Discussion
Since GWAS indicated that 1p22 was associated with NSCL/P (Beaty et al., 2010), a large number of common and rare variants have been identified in this region (Leslie et al., 2012; Butali et al., 2014; Chandrasekharan and Ramanathan, 2014; Letra et al., 2014; Gowans et al., 2016; Savastano et al., 2017). In this study, we aim to identify additional susceptibility variants for NSCL/P in the 1p22 region among the Han Chinese population using deep sequencing.
For common variants, we performed an initial association analysis and additional replications to investigate their associations. We found that rs77179923 was specifically associated with NSCLP (p = 4.70E-04, OR = 1.84, 95%CI: 1.31–2.58) (Table 1); rs77179923 is located in the introns of ABCA4 gene, it was once reported to be significantly associated with NSCL/P among Asian trios by Leslie et al. (2015), but subsequent research indicated that it may not be functional (Liu et al., 2017a). In addition, rs12071152 was marginally associated with NSCLO (p = 9.40E-04, OR = 1.27, 95%CI: 1.10–1.46) (Table 1), and we first identified its unique association with LCLO (p = 4.00E-04, OR = 1.30, 95%CI: 1.12–1.51) (Table 2); since it is located in the intergenic non-coding region, we used HaploReg (Version v4.1) and RegulomeDB (Version 2.0.3) to annotate rs12071152, and its A allele was observed to have altered seven motifs and showed four eQTL signals (Supplementary Tables S6, S7, and Supplementary Figure S2); these results suggested that rs12071152 is a regulatory SNP, it may function by affecting the expression of ARHGAP29 in the etiology of LCLO, whereas further validation is required.
So far, we have identified two SNPs that are specifically associated with NSCLP or LCLO. However, none were associated with the other sub-phenotypes of NSCLO, indicating genetic heterogeneity among sub-phenotypes of NSCLO. In fact, numerous studies support this viewpoint. Carlson et al. (2017) found that different regions on chromosome 13 were specifically associated with UCLO and BCLO among Asian populations; our previous study also indicated that rs1345186 in the TP63 gene was significantly associated with RCLO rather than LCLO and BCLO among the Han Chinese population (Yin et al., 2021). Moreover, these results remind us that the risk variants are likely to be masked by other signals when the association analysis was performed between controls and cases containing several subtypes. Consequently, a more detailed classification of the phenotype will be required in future genetic research to unearth more susceptibility variants.
Rare variants, with MAF less than 0.05% (Wagner, 2013), have a higher contribution to complex traits, meaning they confer a larger effect size than common variants; a portion of the common variants that show significant association with diseases are likely to be driven by rare variant (Bodmer and Bonilla, 2008; Nelson et al., 2012; Tennessen et al., 2012; Tada et al., 2016). So far, it has been reported that rare variants of ARHGAP29 play crucial roles in the etiology of NSCL/P (Chandrasekharan and Ramanathan, 2014; Liu et al., 2017b; Paul et al., 2017; Savastano et al., 2017). Liu et al. (2017b) once identified a rare variant (NM_004815.3, NP_004806.3; c.1654T>C, p.Ser552Pro) adjacent to ours in a European CPO-defected family. This mutation decreased the stability of ARHGAP29, which inhibits cell migration in immortalized human keratinocytes (iNHKs); the mutant zebrafish failed to delay epiboly, and it was thus suggested to be a loss-of-function variant (Liu et al., 2017b). In addition, Paul et al. (2017) identified a point mutation p.K326X at ARHGAP29 in the NSCL/P case, and the heterozygous Arhgap29K326X mutant mouse had abnormal adhesion prior to the formation of the palate.
Here, we identified a heterozygous missense variant NM_004815.3, NP_004806.3; c.1652G>C, p.R551T (ARHGAP29) that was de novo, highly conserved across species; it is predicted to be harmful by all in silico tools, and it is a pathogenic variant by the ACMG guideline (PS2, PS3, and PM2) (Richards et al., 2015). Based on this evidence and its adjacent missense mutation p.Ser552Pro functions as a loss-of-function variant (Liu et al., 2017b), we constructed a mouse model harboring Arhgap29R553T, which is identical to ARHGAP29R551T, to clarify its function. However, neither overt cleft palate phenotype nor abnormal palate shelves elevation or fusion was observed (Figure 3A, B). We inferred that this attributed to the insufficient dose effect of Arhgap29R553T/R553T, because Western blot and immunofluorescence assay demonstrated that both ARHGAP29R551T and Arhgap29R553T did not affect the expression of ARHGAP29 protein (Figure 2B and Figure 3C). In addition, NSCL/P is a polygenic disease involving multiple genes; thus, the effect of ARHGAP29R551T may be compensated by other genes or its functions in the etiology of NSCL/P by interacting with other genes in the signaling cascade of craniofacial embryology. Even though we cannot ignore the change of mRNA expression level of ARHGAP29 result from ARHGAP29R551T, this mouse model is valuable for identifying covariates of Arhgap29R553T/R553T at the transcriptome level, as well as discovering more NSCL/P-associated biological processes that ARHGAP29 might participate in (Paul et al., 2017). In this study, we found that the genes affected by Arhgap29R553T/R553T were enriched in the biological processes of epithelial cell differentiation and transcriptional regulation that may be related to NSCL/P, but whether they are truly involved in the occurrence of NSCL/P needs further in-depth research.
In conclusion, via targeted sequencing on 1p22 among the Han Chinese population, we found that rs77179923 was specifically associated with NSCLP; rs12071152 was significantly and specifically associated with LCLO. In addition, de novo harmful rare variants NM_004815.3, NP_004806.3; c.1652G>C, p.R551T (ARHGAP29), which decreased ARHGAP29 expression, were identified to be a risk factor for NSCLP. We generated a mouse model harboring variant identical to the de novo harmful variants; although no overt phenotype was observed, several susceptibility NSCL/P-related biological processes that are affected by Arhgap29R553T/R553T were observed after RNA-sequencing of the E13.5 secondary palate; however, the mechanism requires further investigation.
Data availability statement
All the datasets generated in this article were shown in the main text and the Supplementary Material. Any questions about the data, please contact to zhonglinjia@sina.com.
Ethics statement
The studies involving human participants were reviewed and approved by the HEC (the Hospital Ethics Committee) of West China Hospital of Stomatology. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin. The animal study was reviewed and approved by HEC (the Hospital Ethics Committee) of West China Hospital of Stomatology.
Author contributions
M-JL contributed to data acquisition, analysis, and interpretation and drafted and critically revised the manuscript. J-YS and B-HZ contributed to data collection from the database and analysis. Q-MC and BS conceived and reviewed the manuscript. Z-LJ contributed to the conception, design, data acquisition, analysis, and interpretation, and critically revised the manuscript. All of them gave final approval and agreed to be accountable for all aspects of the work.
Funding
This work was supported by the National Science Funds of China (No. 82170919 and No. 81600849) and the Major frontier issues of the application foundation project of Sichuan Science and Technology Department (2020YJ0211).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.947126/full#supplementary-material
References
Adzhubei, I., Jordan, D. M., and Sunyaev, S. R. (2013). Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. 7, Unit7.20. doi:10.1002/0471142905.hg0720s76
Altshuler, D., Daly, M. J., and Lander, E. S. (2008). Genetic mapping in human disease. Science 322, 881–888. doi:10.1126/science.1156409
Baldacci, S., Gorini, F., Santoro, M., Pierini, A., Minichilli, F., and Bianchi, F. (2018). Environmental and individual exposure and the risk of congenital anomalies: a review of recent epidemiological evidence. Epidemiol. Prev. 42, 1–34. doi:10.19191/ep18.3-4.S1.P001.057
Beaty, T. H., Marazita, M. L., and Leslie, E. J. (2016). Genetic factors influencing risk to orofacial clefts: today's challenges and tomorrow's opportunities. F1000Res 5, 2800. doi:10.12688/f1000research.9503.1
Beaty, T. H., Murray, J. C., Marazita, M. L., Munger, R. G., Ruczinski, I., Hetmanski, J. B., et al. (2010). A genome-wide association study of cleft lip with and without cleft palate identifies risk variants near MAFB and ABCA4. Nat. Genet. 42, 525–529. doi:10.1038/ng.580
Bodmer, W., and Bonilla, C. (2008). Common and rare variants in multifactorial susceptibility to common diseases. Nat. Genet. 40, 695–701. doi:10.1038/ng.f.136
Butali, A., Mossey, P., Adeyemo, W., Eshete, M., Gaines, L., Braimah, R., et al. (2014). Rare functional variants in genome-wide association identified candidate genes for nonsyndromic clefts in the African population. Am. J. Med. Genet. A 164A, 2567–2571. doi:10.1002/ajmg.a.36691
Carlson, J. C., Taub, M. A., Feingold, E., Beaty, T. H., Murray, J. C., Marazita, M. L., et al. (2017). Identifying genetic sources of phenotypic heterogeneity in orofacial clefts by targeted sequencing. Birth Defects Res. 109, 1030–1038. doi:10.1002/bdr2.23605
Chandrasekharan, D., and Ramanathan, A. (2014). Identification of a novel heterozygous truncation mutation in exon 1 of ARHGAP29 in an Indian subject with nonsyndromic cleft lip with cleft palate. Eur. J. Dent. 8, 528–532. doi:10.4103/1305-7456.143637
Croen, L. A., Shaw, G. M., Wasserman, C. R., and Tolarova, M. M. (1998). Racial and ethnic variations in the prevalence of orofacial clefts in California, 1983-1992. Am. J. Med. Genet. 79, 42–47. doi:10.1002/(sici)1096-8628(19980827)79:1<42::aid-ajmg11>3.0.co;2-m
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., et al. (2011). The variant call format and VCFtools. Bioinformatics 27, 2156–2158. doi:10.1093/bioinformatics/btr330
Dickson, S. P., Wang, K., Krantz, I., Hakonarson, H., and Goldstein, D. B. (2010). Rare variants create synthetic genome-wide associations. PLoS Biol. 8, e1000294. doi:10.1371/journal.pbio.1000294
Dixon, M. J., Marazita, M. L., Beaty, T. H., and Murray, J. C. (2011). Cleft lip and palate: understanding genetic and environmental influences. Nat. Rev. Genet. 12, 167–178. doi:10.1038/nrg2933
Gowans, L. J., Adeyemo, W. L., Eshete, M., Mossey, P. A., Busch, T., Aregbesola, B., et al. (2016). Association studies and direct DNA sequencing implicate genetic susceptibility loci in the etiology of nonsyndromic orofacial clefts in sub-saharan african populations. J. Dent. Res. 95, 1245–1256. doi:10.1177/0022034516657003
Grosen, D., Chevrier, C., Skytthe, A., Bille, C., Mølsted, K., Sivertsen, A., et al. (2010). A cohort study of recurrence patterns among more than 54, 000 relatives of oral cleft cases in denmark: support for the multifactorial threshold model of inheritance. J. Med. Genet. 47, 162–168. doi:10.1136/jmg.2009.069385
Huang, L., Jia, Z., Shi, Y., Du, Q., Shi, J., Wang, Z., et al. (2019). Genetic factors define CPO and CLO subtypes of nonsyndromicorofacial cleft. PLoS Genet. 15, e1008357. doi:10.1371/journal.pgen.1008357
Kircher, M., Witten, D. M., Jain, P., O'Roak, B. J., Cooper, G. M., Shendure, J., et al. (2014). A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315. doi:10.1038/ng.2892
Lek, M., Karczewski, K. J., Minikel, E. V., Samocha, K. E., Banks, E., Fennell, T., et al. (2016). Analysis of protein-coding genetic variation in 60, 706 humans. Nature 536, 285–291. doi:10.1038/nature19057
Leslie, E. J., Mansilla, M. A., Biggs, L. C., Schuette, K., Bullard, S., Cooper, M., et al. (2012). Expression and mutation analyses implicate ARHGAP29 as the etiologic gene for the cleft lip with or without cleft palate locus identified by genome-wide association on chromosome 1p22. Birth Defects Res. A Clin. Mol. Teratol. 94, 934–942. doi:10.1002/bdra.23076
Leslie, E. J., and Marazita, M. L. (2013). Genetics of cleft lip and cleft palate. Am. J. Med. Genet. C Semin. Med. Genet. 163C, 246–258. doi:10.1002/ajmg.c.31381
Leslie, E. J., Taub, M. A., Liu, H., Steinberg, K. M., Koboldt, D. C., Zhang, Q., et al. (2015). Identification of functional variants for cleft lip with or without cleft palate in or near PAX7, FGFR2, and NOG by targeted sequencing of GWAS loci. Am. J. Hum. Genet. 96, 397–411. doi:10.1016/j.ajhg.2015.01.004
Letra, A., Maili, L., Mulliken, J. B., Buchanan, E., Blanton, S. H., Hecht, J. T., et al. (2014). Further evidence suggesting a role for variation in ARHGAP29 variants in nonsyndromic cleft lip/palate. Birth Defects Res. A Clin. Mol. Teratol. 100, 679–685. doi:10.1002/bdra.23286
Lewis, C. W., Jacob, L. S., and Lehmann, C. U. (2017). The primary care pediatrician and the care of children with cleft lip and/or cleft palate. Pediatrics 139, e20170628. doi:10.1542/peds.2017-0628
Li, H., and Durbin, R. (2009). Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics 25, 1754–1760. doi:10.1093/bioinformatics/btp324
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. doi:10.1093/bioinformatics/btp352
Lin-Shiao, E., Lan, Y., Welzenbach, J., Alexander, K. A., Zhang, Z., Knapp, M., et al. (2019). p63 establishes epithelial enhancers at critical craniofacial development genes. Sci. Adv. 5, eaaw0946. doi:10.1126/sciadv.aaw0946
Liu, H., Busch, T., Eliason, S., Anand, D., Bullard, S., Gowans, L. J. J., et al. (2017b). Exome sequencing provides additional evidence for the involvement of ARHGAP29 in Mendelian orofacial clefting and extends the phenotypic spectrum to isolated cleft palate. Birth Defects Res. 109, 27–37. doi:10.1002/bdra.23596
Liu, H., Leslie, E. J., Carlson, J. C., Beaty, T. H., Marazita, M. L., Lidral, A. C., et al. (2017a). Identification of common non-coding variants at 1p22 that are functional for non-syndromic orofacial clefting. Nat. Commun. 8, 14759. doi:10.1038/ncomms14759
Manolio, T. A., Collins, F. S., Cox, N. J., Goldstein, D. B., Hindorff, L. A., Hunter, D. J., et al. (2009). Finding the missing heritability of complex diseases. Nature 461, 747–753. doi:10.1038/nature08494
Mossey, P. A., and Modell, B. (2012). Epidemiology of oral clefts 2012: an international perspective. Front. Oral Biol. 16, 1–18. doi:10.1159/000337464
Nelson, M. R., Wegmann, D., Ehm, M. G., Kessner, D., St Jean, P., Verzilli, C., et al. (2012). An abundance of rare functional variants in 202 drug target genes sequenced in 14, 002 people. Science 337, 100–104. doi:10.1126/science.1217876
Ng, P. C., and Henikoff, S. (2003). SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 31, 3812–3814. doi:10.1093/nar/gkg509
Paul, B. J., Palmer, K., Sharp, J. C., Pratt, C. H., Murray, S. A., Dunnwald, M., et al. (2017). ARHGAP29 mutation is associated with abnormal oral epithelial adhesions. J. Dent. Res. 96, 1298–1305. doi:10.1177/0022034517726079
Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M. A., Bender, D., et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575. doi:10.1086/519795
Rahimov, F., Jugessur, A., and Murray, J. C. (2012). Genetics of nonsyndromic orofacial clefts. Cleft Palate. Craniofac. J. 49, 73–91. doi:10.1597/10-178
Rentzsch, P., Witten, D., Cooper, G. M., Shendure, J., and Kircher, M. (2018). CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 47, D886–D894. doi:10.1093/nar/gky1016
Richards, S., Aziz, N., Bale, S., Bick, D., Das, S., Gastier-Foster, J., et al. (2015). Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American college of medical genetics and genomics and the association for molecular pathology. Genet. Med. 17, 405–424. doi:10.1038/gim.2015.30
Savastano, C. P., Brito, L. A., Faria Á, C., Setó-Salvia, N., Peskett, E., Musso, C. M., et al. (2017). Impact of rare variants in ARHGAP29 to the etiology of oral clefts: role of loss-of-function vs missense variants. Clin. Genet. 91, 683–689. doi:10.1111/cge.12823
Sazonovs, A., and Barrett, J. C. (2018). Rare-variant studies to complement genome-wide association studies. Annu. Rev. Genomics Hum. Genet. 19, 97–112. doi:10.1146/annurev-genom-083117-021641
Schwarz, J. M., Rödelsperger, C., Schuelke, M., and Seelow, D. (2010). MutationTaster evaluates disease-causing potential of sequence alterations. Nat. Methods 7, 575–576. doi:10.1038/nmeth0810-575
Smigielski, E. M., Sirotkin, K., Ward, M., and Sherry, S. T. (2000). dbSNP: a database of single nucleotide polymorphisms. Nucleic Acids Res. 28, 352–355. doi:10.1093/nar/28.1.352
Stenson, P. D., Mort, M., Ball, E. V., Howells, K., Phillips, A. D., Thomas, N. S., et al. (2009). The human gene mutation database: 2008 update. Genome Med. 1, 13. doi:10.1186/gm13
Sun, Y., Huang, Y., Yin, A., Pan, Y., Wang, Y., Wang, C., et al. (2015). Genome-wide association study identifies a new susceptibility locus for cleft lip with or without a cleft palate. Nat. Commun. 6, 6414. doi:10.1038/ncomms7414
Tada, H., Kawashiri, M. A., Konno, T., Yamagishi, M., and Hayashi, K. (2016). Common and rare variant association study for plasma lipids and coronary artery disease. J. Atheroscler. Thromb. 23, 241–256. doi:10.5551/jat.31393
Tam, V., Patel, N., Turcotte, M., Bossé, Y., Paré, G., Meyre, D., et al. (2019). Benefits and limitations of genome-wide association studies. Nat. Rev. Genet. 20, 467–484. doi:10.1038/s41576-019-0127-1
Tennessen, J. A., Bigham, A. W., O'Connor, T. D., Fu, W., Kenny, E. E., Gravel, S., et al. (2012). Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 337, 64–69. doi:10.1126/science.1219240
Tolarova, M. M., and Cervenka, J. (1998). Classification and birth prevalence of orofacial clefts. Am. J. Med. Genet. 75, 126–137. doi:10.1002/(sici)1096-8628(19980113)75:2<126::aid-ajmg2>3.0.co;2-r
Wagner, M. J. (2013). Rare-variant genome-wide association studies: a new frontier in genetic analysis of complex traits. Pharmacogenomics 14, 413–424. doi:10.2217/pgs.13.36
Wang, K., Li, M., and Hakonarson, H. (2010). ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164. doi:10.1093/nar/gkq603
Worley, M. L., Patel, K. G., and Kilpatrick, L. A. (2018). Cleft lip and palate. Clin. Perinatol. 45, 661–678. doi:10.1016/j.clp.2018.07.006
Keywords: 1p22, targeted re-sequencing, association analysis, LCLO, RNA sequencing
Citation: Li M-J, Shi J-Y, Zhang B-H, Chen Q-M, Shi B and Jia Z-L (2022) Targeted re-sequencing on 1p22 among non-syndromic orofacial clefts from Han Chinese population. Front. Genet. 13:947126. doi: 10.3389/fgene.2022.947126
Received: 18 May 2022; Accepted: 08 July 2022;
Published: 17 August 2022.
Edited by:
Babak Behnam, National Sanitation Foundation International, United StatesReviewed by:
Lucimara Neves, University of São Paulo, BrazilYongchu Pan, Nanjing Medical University, China
Copyright © 2022 Li, Shi, Zhang, Chen, Shi and Jia. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhong-Lin Jia, zhonglinjia@sina.com