HLA Variants and Inhibitor Development in Hemophilia A: A Retrospective Case-Controlled Study Using the ATHNdataset

McGill, Joseph R.; Simhadri, Vijaya L.; Sauna, Zuben E.

doi:10.3389/fmed.2021.663396

ORIGINAL RESEARCH article

Front. Med. , 07 May 2021

Sec. Hematology

Volume 8 - 2021 | https://doi.org/10.3389/fmed.2021.663396

HLA Variants and Inhibitor Development in Hemophilia A: A Retrospective Case-Controlled Study Using the ATHNdataset

$\nJoseph R. McGill$ Joseph R. McGill

Vijaya L. Simhadri

Zuben E. Sauna^*

Hemostasis Branch, Division of Plasma Protein Therapeutics, Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, MD, United States

In hemophilia A (HA) patients, F8 gene-defects as genetic risk-factors for developing inhibitors to Factor VIII have been extensively studied. Here we provide estimates of inhibitor-risk associated with the patient's Human Leukocyte Antigen (HLA). We used next generation sequencing for high-resolution HLA Class II typing of 997 HA patients. Using inhibitor prevalence reports from the My Life Our Future (MLOF) research repository, we calculated Odds Ratios (OR) for inhibitor development in a multivariate model considering HLA-DRB1/3/4/5, HLA-DPB1, HLA-DQB1, race, F8 pathogenic variant type, and age. Participants with 1 HLA variant (DPB1*02:02) had developed inhibitors at a higher rate while participants with 2 HLA variants (DRB1*04:07; DRB1*11:04) had developed inhibitors at a lower rate. Additionally, patients with missense variants had developed inhibitors at a lower rate and participants with large structural changes (>50 bp) had developed inhibitors at a higher rate (both compared to Intron 22 inversion). Using a cohort of participants with a distribution of HLA-DRB1 alleles comparable to that in the North American population we show that the HLA repertoire of a HA patient can be a risk-factor for inhibitor development.

1. Introduction

An unmet need in the management of hemophilia-A (HA) is the lack of clinically validated markers associated with the development of inhibitors, i.e., neutralizing antibodies to Factor VIII (FVIII). Approximately 20% of HA patients and 30% of severe HA patients develop inhibitors which represent an impediment to the effective management of HA (1, 2). The availability of markers for immunogenicity would prove useful for more efficient clinical care and personalization of the treatment of HA patients. Inhibitors are also the key safety concern during drug development and licensure; the absence of non-clinical markers means that immunogenicity assessments can only be made as part of phase three studies; the most expensive phase of drug development.

There is broad recognition that genetic factors play a role in determining which patients develop inhibitors and which do not (3–8). However, identifying the genetic markers of immunogenicity is challenging. For instance, there is evidence that CD4+ T-cell response is essential for eliciting inhibitors (9). It is a reasonable assumption that presentation of the peptides by the MHC-class-II (MHC-II) molecules [human leukocyte antigens (HLA), in humans] is a necessary step in the immune cascade that results in inhibitory antibodies. There have been several studies to identify HLA variants potentially associated with inhibitors, however no consistent correlates were found between studies (10–14). These studies were all performed with small sample sizes ranging from 57 to 176 participants. These sample sizes are inadequate for making meaningful, statistically powered, assessments, considering that the MHC region, containing 164 HLA genes is the most polymorphic in the human genome with over 11,000 variants reported (15).

The HLA is not the only genetic risk-factor implicated with inhibitors to FVIII. HA is caused by variants in the F8 gene that range from missense variants to large deletions (5). An earlier meta-analysis (of 30 independent studies and 5,383 participants) showed that larger gene disruptions (e.g., deletion of multiple exons) were associated with a higher OR of developing inhibitors (5). Although the meta-analysis did provide a considerably larger total cohort than any individual study, the approach suffers from some disadvantages. Meta-analyses often fail to control for the fixed effects attributable to different testing centers and the number of participants in each study. One possible outcome of this is Simpson's Paradox in which trends identified in the individual studies cannot be found in the pooled data (16). In addition, different studies often target participants with specific variants (e.g., Intron 22 Inversion) or specific populations. The differing baseline risks for these groups will further lead to heterogeneous population groups which require careful analysis to avoid biases. Furthermore, meta-analyses relies solely on previously published studies and will suffer from publication bias and possibly exaggerate results by not considering the unavailable, unpublished data (17). Consequently, meta-analyses are often considered more suitable for hypothesis generation than for hypothesis testing (18).

A large cohort of HA patients who are genotyped using consistent methods and for whom clinical information is available is a clear unmet need. The My Life, Our Future (MLOF) project [a collaboration between the American Thrombosis and Hemostasis Network (ATHN), National Hemophilia Foundation (NHF), Bloodworks Northwest and Bioverativ] provided free hemophilia genotype analysis for participants in the United States. As the MLOF collaboration did not HLA type the participants, we have HLA typed 1,000 participants for whom F8 genotype and clinical and demographic information was available. This data set is at least four-times larger than those used in published studies and is adequate to assess the association between HLA type and inhibitors. We found the HLA variant DPB1*02:02 is associated with higher odds of eliciting inhibitors to FVIII. The HLA variants DRB1*04:07 and DRB1*11:04 are associated with lower odds of developing inhibitors. With respect to pathogenic F8 variants, our results are consistent with the previous conclusion from a meta-analysis. Compared to participants with the intron-22-inversion, those with missense variants have significantly lower odds of inhibitor formation. Conversely, participants with large structural changes (>50 bp) show significantly higher odds of developing inhibitors. We also show that Hispanic participants had a higher prevalence of inhibitors.

2. Materials and Methods

2.1. Study Design

This is a retrospective case-controlled study. Data from the ATHNdataset was merged with HLA-typing data obtained by us for 997 participants. Phenotypic and genetic features were compared to the prevalence of inhibitor development in these participants. Statistical analysis was performed as a series of univariate logistic regression model that would determine inclusion of a variable in a multivariate logistic regression model.

2.2. Data Sources

The MLOF program is the result of a collaboration between the ATHN, NHF, and Bloodworks Northwest, with support of Bioverativ through June 2018. Participants and/or their parents gave written informed consent for inclusion of their samples and data in the MLOF Research Repository. Phenotypic data on MLOF Research Repository participants was abstracted from the ATHNdataset collected from participating hemophilia treatment centers around the United States, including demographic, phenotypic, and genomic data. Participants self-reported their race and ethnicity (19).

The background distribution of HLA-DRB1 Alleles was obtained using a population weighted according to US Census estimates of population demographics from July, 2018 (20).

2.3. Determinations of Hemophilia Severity and Inhibitor Development

Hemophilia Severity was identified based on reported FVIII baseline activity (percent of normal) in the ATHNdataset based on the following criteria: FVIII activity ≤1%, Severe; FVIII activity ≤5% but >1%, Moderate and FVIII activity >5%, Mild. Factor VIII activity was tracked using the lowest value that can be historically tracked. The assays were run in independent clinical laboratories and were primarily one-stage assays.

2.4. HLA Testing for Class II Loci Using Next Generation Sequencing

We used Next Generation Sequencing (NGS) as it offers robust HLA testing by increasing typing resolution vis–vis Sanger sequencing methods. DNA barcoding and single molecule sequencing were used to allow for better efficiency and economies of scale (21). LabCorp designed a test to sequence participant samples concurrently for Class II HLA loci DRB1/3/4/5, DQB1, and DPB1 by NGS. The validation was conducted using an open-platform and Illumina MiSeq analyzers. The gene coverage for the targeted NGS assay represents the Antigen Recognition Domain which is encoded in exon 2 for the MHC Class II (22).

2.5. Determining the Size of the Cohort Used for HLA Typing

No hard-and-fast rule exists for the selection of sample sizes for multivariate logistic regression; a general rule of thumb is that 10–30 samples are adequate to test the impact of a particular factor with sufficient statistical power (23). However, due to the heterogeneous distribution of alleles, some alleles, would easily be found with a frequency of 30, other alleles [e.g., HLA-DRB1*04:38 (0.0005%)] (20), would never be found in sufficient numbers for adequate statistical analysis regardless of the cohort size.

A list of 38 MHC-DRB1 alleles (20) was chosen to represent 99% of the North American Population and create a reasonable pool of alleles that would be found in the MLOF Research Repository cohort. A simulation was run generating 100 cohorts each of various sizes by randomly assigning alleles based on their frequencies in the North American population. We counted both the number of alleles that would be found in at least 30 participants as well as the population coverage of those alleles. Cohorts of sizes from 100 to 1,500 were generated and the population coverage of alleles which occurred in 30 or more individuals was recorded.

2.6. Filtering the Data

As it was not feasible to HLA type the entire cohort of 7,151 donors, a subset of 1,000 participants were chosen for HLA typing. Donors were filtered out for the following reasons (Figure 1):

FIGURE 1

Figure 1. Selection of participants for HLA typing. The 7,151 participants in the entire ATHNdataset were filtered to select participants suitable for further analysis. We have filtered out participants for sex, availability of DNA for HLA-typing, and lack of clinical or genetic information. There were 1,213 participants which met all our criteria. One thousand of these participants' DNA were sent for HLA typing.

Sex: Only participants who were listed as male were included in this analysis. The gene coding for the FVIII protein is located on the X chromosome, thus inclusion of female participants would have introduced confounding factors such as genetic carriers as well as introduce another confounding variable into the analysis.

Available DNA: Only participants with DNA available for HLA typing were considered.

Medication type, treatment type, dosage information, comprehensive care information, and pathogenic variant: We included only individuals for who clinical and genetic information was available.

After filtering the list of participants for inclusion in our HLA typed cohort, 1,213 participants remained: 958 without inhibitors and 255 with inhibitors. In order to enrich the population of inhibitor positive participants to match the proportion of inhibitor positive participants in the HA population (30%) we used stratified sampling (24). This method samples from different strata with different frequencies. We split the remaining participants into two strata based on inhibitor status. We carried out HLA typing on 1,000 participants, prioritizing selection of the inhibitor positive participants. This method was used to help bring the proportion of inhibitor positive samples to the desired 30%. As this is a case-controlled study, and our analysis relies on odds-ratios rather than relative risk or prevalence, we decided on this method to increase the frequency of observing inhibitor development. With this increase in frequency of inhibitor development observed, we stand a greater chance of observing events in conjunction with rare alleles.

2.7. Missing Data

Of the 1,000 participants sent for HLA typing: three participants had missing HLA type data (0.3%), 6 (0.6%) had missing data for race, and three participants had missing data for FVIII variant type. In total <5% of data was missing.

2.8. Statistical Analysis

All HLA-DRB1/3/4/5, HLA-DPB1, HLA-DQB1, race, ethnicity, disease severity, and pathogenic variant type were analyzed using univariate logistic regression models. For each of the HLA variants, a participant was considered as having that variant if at least one of the alleles matched. For race and variant type, ORs were calculated compared to reference levels; White for race and intron-22 inversion for variant type. Variables with some degree (p < 0.25) of significant correlation to the odds of inhibitor development were included in a multivariate model using Hosmer and Lemeshow's guidance on “purposeful variable selection” (23).

Log likelihood analysis was used to determine the appropriateness of adding age into the explanatory model as both a linear variable and a third-degree polynomial. The log likelihood looks at the difference between a model including age as a linear predictor as well as age-squared and age-cubed and a model with age only as a linear predictor. This difference is compared to a χ² distribution with degrees of freedom equal to the number of additional variables. While the addition of additional variables will necessarily increase the likelihood of a model, the χ² test helps to only include variables which are adding significant increases to the goodness-of-fit of the model.

P-values from the multivariate model were adjusted using the Benjamini-Hochberg method (25) for controlling the rate of false discoveries. The adjusted p-value reported represents the strictest false discovery rate for which a particular hypothesis will be rejected using the Benjamini-Hochberg method as used in the R (26) function “p.adjust” (27). An adjusted p-value presented here can be directly compared to using a desired false discovery rate of 0.05 as an acceptance criterion for a hypothesis test and will yield equivalent results to calculating individual p-value thresholds for each of multiple hypotheses.

This procedure was also repeated for a subset of the study cohort which had severe hemophilia omitting the variable coding for disease severity. This subset of only severe HA participants was used for our primary analysis.

All statistical analysis was done using the R programming language (26) and all graphics were produced using ggplot2 (28). All tables were produced using kable (29) and LaTeX (30).

2.9. Predicted Binding Affinity of “Foreign Sequences” at the Location of Missense Mutation in the F8 Gene of Study Participants

For all study participants with a missense mutation, a list of FVIII sequences that would be foreign for each participant was generated. This is the wild type sequence (found in the infused FVIII drug) at that location and is foreign to a participant with the missense mutation. We then used netMHCIIpan version 3.2 (31) to estimate the binding affinities of all foreign peptides in the region of the missense mutations to the HLA-DRB1 alleles identified in that participant. The binding affinities were reported as percentile ranks. The minimum percentile rank (highest affinity) for each participant was used in determining if binding affinities are significantly higher for those individuals with inhibitors. We used the Shapiro-Wilk test and the results were compared using a one-sided Mann–Whitney U-test testing the hypothesis that the percentile rank scores of participants who had developed an inhibitor would be lower than participants who had not developed inhibitors.

3. Results

3.1. Selecting a Representative Cohort Size

Per our simulation, a sample size of 1,000 participants provides adequate coverage of HLA variants in all 100 runs. In these runs, 97.3% of the allele population of North America was expected to be found in sufficient numbers for analysis.

3.2. Participant Characteristics

We have presented a detailed breakdown of participant characteristics for the entire ATHNdataset (N = 7,151), The subset of HLA-typed participants with Severe HA (N = 612), and the HLA-typed subset (N = 997) (Tables 1–3). We have additionally compared the characteristics of the HLA-type group as a whole with the entire ATHNdataset to ensure that there was no bias in the cohort selection other than the enrichment of inhibitor positive cases (Supplementary Table 1).

TABLE 1

Table 1. Participant characteristics, the ATHNdataset.

TABLE 2

Table 2. Participant characteristics, HLA typed participants with severe hemophilia A.

TABLE 3

Table 3. Participant characteristics, HLA typed participants.

The entire cohort studied was 7,151 Hemophilia A participants. Of these participants, 1,123 (15.7%) had developed inhibitory antibodies. In the subset of 612 severe HA participants, 217 (35.5%) had developed inhibitors. In the subset of 997 HLA-typed participants, 252 (25.28%) had developed inhibitory antibodies.

3.3. HLA Typing of Participants

We obtained 1,000 DNA samples from MLOF. The samples were subjected to high resolution (4-digit) HLA typing. The complete data set is presented in Supplementary Table 2. Each HLA variant in our study occurs at a frequency that is comparable to its respective frequency in the North American population (Figure 2). Moreover, the HLA-DRB1 alleles identified in the 997 participants cover 99.5% of the allelic variation in North America. Additionally, 18 alleles representing 82% of the North American population were found at or >30 times. The distribution of HLA-DRB1 and HLA-DQB1 alleles in the cohort is comparable to the distribution of alleles in the North American Population.

FIGURE 2

Figure 2. HLA frequencies in the study cohort. HLA frequencies of the 997 participants are compared to the expected frequencies of the North American sub-population. (A) The frequencies of HLA-DRB1 alleles in the study cohort (green) closely match the background distribution of alleles in the whole North American population. (B) The frequencies of HLA-DQB1 alleles in the study cohort (green) closely match the background distribution of alleles in the whole North American population.

3.4. Univariate Analysis of Severe HA Participants

Each variable was first analyzed using a univariate logistic regression model (see section 2) and those determined to be significant with a p-value of < 0.25 were included in the final multivariate model.

Univariate analysis identified the following HLA alleles (p < 0.25) for inclusion in the multivariate model: 9 HLA-DRB1 alleles; HLA-DRB1*01:01, HLA-DRB1*01:03, HLA-DRB1*04:04, HLA-DRB1*04:05, HLA-DRB1*04:07, HLA-DRB1*08:01, HLA-DRB1*11:04, HLA-DRB1*15:01, and HLA-DRB1*15:03; 4 HLA-DQB1 alleles; HLA-DQB1*03:01, HLA-DQB1*05:01, HLA-DQB1*05:02, and HLA-DQB1*06:02; 3 HLA-DPB1 alleles; HLA-DPB1*02:02, HLA-DPB1*18:01, and HLA-DPB1*19:01; As well as HLA-DRB3*01:01 and HLA-DRB5*01:01 (Supplementary Table 3). No HLA-DRB4 alleles met the criterion for inclusion in the multivariate model.

Both Hispanic and Black or African American/Not Hispanic participants, compared to white participants, met the univariate criterion of p < 0.25 to be included in the multivariate model (Supplementary Table 4).

Four variant types (compared to intron 22 inversion) were selected for inclusion in the multivariate model: frameshifts, large structural changes (>50 bp), missense variants, and nonsense variants (Supplementary Table 4).

Age, as either a linear predictor or as a third-degree polynomial, was significant for inclusion in the multivariate model. A comparison of log Likelihoods (see section 2); p = 0.0094 with 2 degrees of freedom) showed that the third-degree polynomial was a significant addition to the model.

3.5. A Multivariate Regression Analysis of Severe HA Participants

Based on the variables selected using univariate analysis, a final multivariate model was fit including 18 HLA alleles, one variable for race, four different variant types, and three variables for age (age, age-squared, and age-cubed). P-values for the variables were adjusted and results with a false discovery rate of <0.05 were considered significant (Figure 3, Table 4). It was found that controlling for age was extremely important in explaining the higher incidence of inhibitor development in younger participants independent of other factors.

FIGURE 3

Figure 3. Results of the multivariate analysis. (A) Severe HA participants increased odds of inhibitor development were found for Hispanic participants (OR = 2.50, 95%CI 1.37–4.54), large structural variants (OR = 2.85, 95%CI 1.21–6.67), and HLA-DPB1*02:02 (OR = 16.50, 95%CI 2.87–94.78). Decreased odds were found for missense variants (OR = 0.18, 95%CI 0.089–0.35), HLA-DRB1*04:07 (OR = 0.17, 95%CI 0.048–0.58), and HLA-DRB1*11:04 (OR = 0.18, 95%CI 0.048–0.67). (B) All HLA participants increased odds of inhibitor development were found for severe HA (OR = 2.92, 95%CI 1.56–5.50), Large structural variants (OR = 4.03, 95%CI 1.82–8.89), and HLA-DPB1*02:02 (OR = 6.08, 95%CI 1.95–18.94). Decreased odds were found for missense variants (OR = 0.37, 95%CI 0.22–0.60).

TABLE 4

Table 4. Multivariate model results—severe HA participants.

One HLA allele was found to be significantly correlated with the increased odds for having developed an inhibitor: HLA-DPB1*02:02 [OR = 16.5, 95% CI (2.87,94.78), adjusted-p = 9.35*10⁻³].

Two HLA alleles were found to be associated with decreased odds for having developed inhibitors: HLA-DRB1*04:07 [OR = 0.17, 95% CI (0.048, 0.58), adjusted-p = 0.0174]; and HLA-DRB1*11:04 [OR = 0.18, 95% CI (0.048, 0.67), adjusted-p = 0.0334].

Missense variants were found to be correlated with decreased odds for inhibitor development [OR = 0.18, 95% CI (0.09, 0.35), adjusted-p = 1.42*10⁻⁵]. Large structural variants affecting >50 base pairs were associated with increased odds of inhibitor development [OR = 2.85, 95% CI (1.21, 6.67), adjusted-p = 0.045].

The effect of age was significant for each of the three variables for age with adjusted-p values of 4.1*10⁻⁴, 3.3*10⁻³, and 1.1*10⁻². The polynomial allows for a steep decrease in OR from 0 to 20 years, a relatively flat OR from 20 to 60 and slight decreases in OR after age 60. The effect of age is a decreasing odds of inhibitor development generally as participants get older.

3.6. Univariate Analysis in All HLA-Typed Participants

Each variable was first analyzed using a univariate logistic regression model (see section 2) and those determined to be significant with a p-value of <0.25 were included in the final multivariate model.

Univariate analysis identified the following HLA alleles (p < 0.25) for inclusion in the multivariate model: 12 HLA-DRB1 alleles; HLA-DRB1*01:01, HLA-DRB1*01:03, HLA-DRB1*04:04, HLA-DRB1*04:05, HLA-DRB1*04:07, HLA-DRB1*08:01, HLA-DRB1*11:01, HLA-DRB1*11:04, HLA-DRB1*12:01, HLA-DRB1*12:02, HLA-DRB1*15:01, and HLA-DRB1*15:03; 4 HLA-DQB1 alleles; HLA-DQB1*03:01, HLA-DQB1*05:01, HLA-DQB1*05:02, and HLA-DQB1*06:02; 7 HLA-DPB1 alleles; HLA-DPB1*02:02, HLA-DPB1*03:01, HLA-DPB1*05:01, HLA-DPB1*10:01, HLA-DPB1*11:01, HLA-DPB1*18:01, and HLA-DPB1*19:01; and HLA-DRB3*01:01 and HLA-DRB5*01:01 (Supplementary Table 5). No HLA-DRB4 alleles met the criterion for inclusion in the multivariate model.

Both severe HA and moderate HA diagnoses, as compared to mild HA, met the criterion of a p < 0.25 for inclusion into the multivariate model (Supplementary Table 6).

Similarly, only one race/ethnicity (compared to White), Black or African American/Not Hispanic, met the univariate criterion of p < 0.25 to be included in the multivariate model (Supplementary Table 6).

Four variant types (compared to intron 22 inversion) were selected for inclusion in the multivariate model: intron 1 inversions, large structural changes (>50 bp), missense variants, and nonsense variants (Supplementary Table 6).

Age, as either a linear predictor or as a third-degree polynomial, was significant for inclusion in the multivariate model. A comparison of log Likelihoods (see section 2; p = 0.0105 with 2 degrees of freedom) showed that the third-degree polynomial was a significant addition to the model.

3.7. A Multivariate Regression Analysis of All HLA-Typed Participants

Based on the variables selected using univariate analysis, a final multivariate model was fit including 25 HLA alleles, one variable for race/ethnicity, four different variant types, two variables for disease severity, and three variables for age (age, age-squared, and age-cubed). P-values for the variables were adjusted and results with a false discovery rate of <0.05 were considered significant (Figure 3, Table 5).

TABLE 5

Table 5. Multivariate model results—all HLA-typed participants.

Severe HA participants had a higher rate of inhibitor formation [OR = 2.92, 95% CI (1.56, 5.50), adjusted-p = 0.0075].

One HLA allele was found to be associated with decreased odds for inhibitor development: HLA-DPB1*02:02 [OR = 6.08, 95% CI (1.95, 18.94), adjusted-p = 0.011]. No HLA alleles were found to be significantly correlated with decreased odds for having developed an inhibitor.

Missense variants were found to be correlated with decreased odds for inhibitor development [OR = 0.37, 95% CI (0.23, 0.60), adjusted-p = 0.0018]. Large structural variants affecting greater than 50 base pairs were associated with increased odds of inhibitor development [OR = 4.02, 95% CI (1.82, 8.89), adjusted-p = 0.0067].

The effect of age was significant for each of the three variables for age with adjusted-p values of 0.0012, 0.007, and 0.023. The polynomial allows for a steep decrease in OR from 0 to 20 years, a relatively flat OR from 20 to 60 and slight decreases in OR after age 60. The effect of age is a decreasing odds of inhibitor development generally as participants get older.

3.8. Comparison of Predicted Binding Affinities of “Foreign Sequences” at the Location of Missense Mutation in the F8 Gene of Study Participants

The median percentile rank binding affinity in participants who had ever had an inhibitory response was 5.5 as compared to 7.5 for participants who had never had an inhibitory response. As the data was found to be non-Gaussian even after transformations were applied (Shapiro-Wilk p-values of 7.67 * 10⁻⁷ and ≤ 2.2* 10⁻¹⁶, respectively) the non-parametric Mann–Whitney U-Test was used. This test rejected to null hypothesis that the distributions of the two samples was similar (Supplementary Figure 1).

4. Discussion

Published work supports the postulate that genetic factors play an important role in the development of inhibitors to FVIII drug products (4, 5, 10–14, 19). A meta-analysis published in 2012 showed that F8 variant type influenced inhibitor development in HA patients (5). In the study reported here involving 612 participants with severe HA who were HLA typed, we found that the risk of inhibitor development was higher in participants with large (>50 bp) structural variants (OR = 2.85) and lower in patients with missense variants (OR = 0.18; Figure 3, Table 4) which is consistent with the meta-analysis of previous studies. However, other genetic risk factors for inhibitor development have not been researched to the same extent as F8 variants. One of the most important genetic variables associated with immune responses is the HLA repertoire. The handful of studies on the association between specific HLAs and inhibitors (10–14) involve 57 to 176 participants which is too low for making statistical estimates.

In this survey we have focused on the HLA-DRB1/3/4/5, HLA-DPB1, and HLA-DQB1 genes. The generation of anti-drug antibodies to replacement proteins is driven by CD4+ helper T cells (9). This pathway involves the HLA Class II molecules. The HLA-DRB1 variants are the most diverse Class II molecules and are predominantly, but not exclusively, involved in the presentation of peptides derived from protein drugs. For instance, in a recent study of FVIII peptides identified on monocyte-derived dendritic cells, 78% of the peptides were found to bind HLA-DRB1 (32).

The MLOF data set includes pathogenic F8 variant data for 7,151 participants. All samples were processed at a central facility using the same validated method (33). It was not cost-effective for us to HLA type all 7,151 participants. We determined through simulation that 1,000 participants were expected to provide adequate coverage of alleles while also having enough samples of common alleles to allow for statistical comparisons.

We obtained high-resolution HLA typing for 997 of the 1,000 participants. The HLA-DRB1 alleles identified represent 99% of the allelic variation in North America. The individual HLA-DRB1 and HLA-DPQ1 alleles in our data set occur at frequencies comparable to those found in the North American population (Figure 2).

Based on ORs in our analysis of participants with severe HA from a multivariate binomial logistic regression, we determined that only one HLA variant, DPB1*02:02, was associated with a higher risk of inhibitor development with an OR of 16.50 (95% CI 2.87–94.78). Two variants, DRB1*04:07 and DRB1*11:04, were associated with a lower risk of inhibitor development with ORs of 0.17 (95% CI 0.048–0.58) and 0.18 (95% CI 0.05–0.67; Figure 3, Table 4). While similar trends were found in the data examining all HLA-typed participants, the two seemingly protective HLA-DRB1 variants failed to reach statistical significance (Figure 3, Table 5).

Several studies indicate that Hispanic HA patients are at higher risk of developing inhibitors than White patients (34–36). Using the 612 HLA typed participants with severe HA from the MLOF Research Repository provided similar results. Hispanic participants had an association with higher rates of inhibitor formation (OR = 2.50, 95% CI 1.37–4.54; Figure 3, Table 4). The differential inhibitor-risk based on race independent of HLA-type is interesting in the context of HLA repertoires because human sub-populations have different relative frequencies of HLA variants (37).

An interesting aspect of our findings is that HLA variants identified as having a significant association with inhibitor development are relatively rare in the North American population. Moreover, the HLA-DPB1*02:02 allele occurred in only 10 participants which is less frequent than most other alleles. It is only because we HLA typed a relatively large cohort of participants that these rare HLA alleles were identified in sufficient numbers to obtain statistical significance in multivariate analyses.

Our analysis failed to confirm findings in previous works comparing HLA-type to inhibitor development (3, 7, 10–14). Plausible reasons include the small sample sizes in previous studies and the reliance on univariate analyses failing to control for correlation with other cofactors. It is not possible to determine if these studies included a representative distribution of HLA variants or sufficient numbers of replicates of each variant for statistical analysis. However, given the limited number of participants in each study, 57–176, it is unlikely that these criteria were met.

A limitation of our study is the possibility of bias in the selection of participants for HLA-typing. As this dataset involved a post-hoc analysis of data, the data was not collected with this study in mind. Incomplete clinical data and available DNA for HLA-testing forced us to select for participants who could satisfy the needs of this study. Our study is an important step in identifying correlates with significant effects of a patient's risk of inhibitor development. We hope that this will help to inform future research on the relationship between HLA-type (independently and in association with other genetic markers) and inhibitor positive patients with HA.

To illustrate this class of studies, we used the subset of participants with a missense mutation in the F8 gene to explore the hypothesis that foreign-peptide-HLA-DRB1 binding affinity is a risk factor for inhibitor development. Based on inhibitor data from HA participants with missense mutations, several previous studies support this hypothesis (7, 38, 39). The hypothesis is based on the rationale that presentation of foreign peptides is an initial, necessary step in eliciting an immune response to a protein therapeutic (9). Thus, for an immune response to be elicited two conditions must be met; (a) the infused protein must generate peptides that are foreign to the patient and (b) these foreign peptides must be efficiently presented to the immune system by HLA molecules. Foreign peptides that bind with high affinity to an individual patient's HLA with high affinity have a lower off-rate and thus a higher probability of eliciting an immune response. The median percentile rank binding affinity in participants who were inhibitor positive was 5.5 as compared to 7.5 for participants who had never developed inhibitors. As the data was found to be non-Gaussian even after transformations were applied (Shapiro–Wilk p-values of 7.67* 10⁻⁷ and ≤ 2.2* 10⁻¹⁶, respectively), the non-parametric Mann–Whitney U-Test was used. This test rejected to null hypothesis that the distributions of the two samples was similar (Supplementary Figure 1).

In this study we present a data set of 997 fully HLA-typed participants with HA. The HLA typed subset is derived from a larger set of 7,151 participants. The 997 HLA-typed participants capture the heterogeneity of the HA participant population with respect to F8 variants, severity of disease and racial diversity. Moreover, the HLA-DRB1 variants identified represent 98% of the North American Population, occur at relative frequencies observed in the wider population and include sufficient replicates of each HLA-DRB1 variant for meaningful statistical analyses. Using this data set we identified 1 HLA variant associated with an increased risk of inhibitors and 2 HLA variants associated with a reduced risk of inhibitors. The MLOF Research Repository is an extremely useful data set for uncovering the genetic determinants associated with inhibitor development in HA. The HLA repertoire represents an important and highly variable genetic characteristic of HA patients that was lacking. The enhanced data set can now be used to generate more complex models to identify biomarkers for predicting inhibitor development in HA patients.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Ethics Statement

The studies involving human participants were reviewed and approved by Western International Review Board. Written informed consent to participate in this study was provided by the participants' legal guardian/next of kin.

Author Contributions

JM and ZS designed the research. JM and VS performed the research. and JM, VS, and ZS wrote the paper. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by intramural grants from the US Food and Drug Administration (FDA), and in part by an appointment of JM to the Research Participation Program at the Center for Biologics Evaluation and Research administered by the Oak Ridge Institute for Science and Education through an interagency agreement between the US Department of Energy and the FDA (to ZS). The MLOF program was developed as a partnership between NHF, ATHN, Bloodworks Northwest, and Bioverativ and supported financially by Bioverativ, NHF, Bloodworks Northwest, and ATHN.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to thank Dr. Michael Recht of ATHN, Dr. Barbara Konkle of Bloodworks Northwest, and Artur Belov of the FDA for their extremely helpful comments and suggestions.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2021.663396/full#supplementary-material

References

1. Lusher JM, Arkin S, Abildgaard CF, Schwartz RS. Recombinant factor VIII for the treatment of previously untreated patients with hemophilia A. Safety, efficacy, and development of inhibitors. N Engl J Med. (1993) 328:453–9. doi: 10.1056/NEJM199302183280701

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Wight J, Paisley S. The epidemiology of inhibitors in haemophilia A: a systematic review. Haemophilia. (2003) 9:418–35. doi: 10.1046/j.1365-2516.2003.00780.x

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Pavlova A, Delev D, Lacroix-Desmazes S, Schwaab R, Mende M, Fimmers R, et al. Impact of polymorphisms of the major histocompatibility complex class II, interleukin-10, tumor necrosis factor-alpha and cytotoxic T-lymphocyte antigen-4 genes on inhibitor development in severe hemophilia A. J Thromb Haemost. (2009) 7:2006–15. doi: 10.1111/j.1538-7836.2009.03636.x

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Pavlova A, Zeitler H, Scharrer I, Brackmann HH, Oldenburg J. HLA genotype in patients with acquired haemophilia A. Haemophilia. (2010) 16:107–12. doi: 10.1111/j.1365-2516.2008.01976.x

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gouw SC, van den Berg HM, Oldenburg J, Astermark J, de Groot PG, Margaglione M, et al. F8 gene mutation type and inhibitor development in patients with severe hemophilia A: systematic review and meta-analysis. Blood. (2012) 119:2922–34. doi: 10.1182/blood-2011-09-379453

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Eckhardt CL, van Velzen AS, Peters M, Astermark J, Brons PP, Castaman G, et al. Factor VIII gene (F8) mutation and risk of inhibitor development in nonsevere hemophilia A. Blood. (2013) 122:1954–62. doi: 10.1182/blood-2013-02-483263

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Pandey GS, Yanover C, Howard TE, Sauna ZE. Polymorphisms in the F8 gene and MHC-II variants as risk factors for the development of inhibitory anti-factor VIII antibodies during the treatment of hemophilia a: a computational assessment. PLoS Comput Biol. (2013) 9:e1003066. doi: 10.1371/journal.pcbi.1003066

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Bachelet D, Albert T, Mbogning C, Hässler S, Zhang Y, Schultze-Strasser S, et al. Risk stratification integrating genetic data for factor VIII inhibitor development in patients with severe hemophilia A. PLoS ONE. (2019) 14:e0218258. doi: 10.1371/journal.pone.0218258

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Sauna ZE, Lagassé D, Pedras-Vasconcelos J, Golding B, Rosenberg AS. Evaluating and mitigating the immunogenicity of therapeutic proteins. Trends Biotechnol. (2018) 36:1068–84. doi: 10.1016/j.tibtech.2018.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Hay CR, Ollier W, Pepper L, Cumming A, Keeney S, Goodeve AC, et al. HLA class II profile: a weak determinant of factor VIII inhibitor development in severe haemophilia A. Thromb Haemost. (1997) 77:234–7. doi: 10.1055/s-0038-1655944

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Oldenburg J, Picard JK, Schwaab R, Brackmann HH, Tuddenham EG, Simpson E. HLA genotype of patients with severe haemophilia A due to intron 22 inversion with and without inhibitors of factor VIII. Thromb Haemost. (1997) 77:238–42. doi: 10.1055/s-0038-1655945

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Bril WS, MacLean PE, Kaijen PH, van den Brink EN, Lardy NM, Fijnvandraat K, et al. HLA class II genotype and factor VIII inhibitors in mild haemophilia A patients with an Arg593 to Cys mutation. Haemophilia. (2004) 10:509–14. doi: 10.1111/j.1365-2516.2004.01011.x

PubMed Abstract | CrossRef Full Text | Google Scholar

13. De Barros MF, Herrero JC, Sell AM, DeMelo FC, Braga MA, Pelissari CB, et al. Influence of class I and II HLA alleles on inhibitor development in severe haemophilia A patients from the south of Brazil. Haemophilia. (2012) 18:e236–40. doi: 10.1111/j.1365-2516.2011.02604.x

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Kempton CL, Payne AB. HLA-DRB1-factor VIII binding is a risk factor for inhibitor development in nonsevere hemophilia: a case-control study. Blood Adv. (2018) 2:1750–5. doi: 10.1182/bloodadvances.2018019323

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Norman PJ, Norberg SJ, Guethlein LA, Nemat-Gorgani N, Royce T, Wroblewski EE, et al. Sequences of 95 human. Genome Res. (2017) 27:813–23. doi: 10.1101/gr.213538.116

CrossRef Full Text | Google Scholar

16. Simpson EH. The interpretation of interaction in contingency tables. J R Stat Soc. (1951) 13:4. doi: 10.1111/j.2517-6161.1951.tb00088.x

CrossRef Full Text | Google Scholar

17. Reade MC, Delaney A, Bailey MJ, Angus DC. Bench-to-bedside review: avoiding pitfalls in critical care meta-analysis–funnel plots, risk estimates, types of heterogeneity, baseline risk and the ecologic fallacy. Crit Care. (2008) 12:220. doi: 10.1186/cc6941

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Jakobsen JC, Wetterslev J, Winkel P, Lange T, Gluud C. Thresholds for statistical and clinical significance in systematic reviews with meta-analytic methods. BMC Med Res Methodol. (2014) 14:120. doi: 10.1186/1471-2288-14-120

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Konkle BA, Johnsen JM, Wheeler M, Watson C, Skinner M, Pierce GF, et al. Genotypes, phenotypes and whole genome sequence: approaches from the my life our future haemophilia project. Haemophilia. (2018) 24(Suppl. 6):87–94. doi: 10.1111/hae.13506

PubMed Abstract | CrossRef Full Text | Google Scholar

20. McGill JR, Yogurtcu ON, Verthelyi D, Yang H, Sauna ZE. SampPick: selection of a cohort of subjects matching a population HLA distribution. Front Immunol. (2019) 10:2894. doi: 10.3389/fimmu.2019.02894

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Smith AM, Heisler LE, St Onge RP, Farias-Hesson E, Wallace IM, Bodeau J, et al. Highly-multiplexed barcode sequencing: an efficient method for parallel analysis of pooled samples. Nucleic Acids Res. (2010) 38:e142. doi: 10.1093/nar/gkq368

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Mack SJ, Cano P, Hollenbach JA, He J, Hurley CK, Middleton D, et al. Common and well-documented HLA alleles: 2012 update to the CWD catalogue. Tissue Antigens. (2013) 81:194–203. doi: 10.1111/tan.12093

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Hosmer DW, Lemeshow S, Sturdivant RX. Applied Logistic Regression. 3rd Edn. Hoboken, NJ: Wiley (2013).

Google Scholar

24. Levy PS, Lemeshow S. Sampling of Populations : Methods and Applications. 4th Edn. Wiley Series in Survey Methodology. Hoboken, NJ: Wiley (2008).

Google Scholar

25. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. (1995) 57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x

CrossRef Full Text | Google Scholar

26. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing (2019). Available online at: https://www.R-project.org

Google Scholar

27. Yekutieli D, Benjamini Y. Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics. J Stat Plann Infer. (1999) 82:171–96. doi: 10.1016/S0378-3758(99)00041-5

CrossRef Full Text | Google Scholar

28. Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York, NY: Springer-Verlag (2016).

29. Xie Y. Dynamic Documents With R and knitr. Chapman Hall/CRC the R series. Boca Raton, FL: CRC Press (2014).

Google Scholar

30. Lamport L. LATEX : A Document Preparation System. Reading, MA: Addison-Wesley Pub. Co. (1986).

Google Scholar

31. Jensen KK, Andreatta M, Marcatili P, Buus S, Greenbaum JA, Yan Z, et al. Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology. (2018) 154:394–406. doi: 10.1111/imm.12889

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Jankowski W, Park Y, McGill J, Maraskovsky E, Hofmann M, Diego VP, et al. Peptides identified on monocyte-derived dendritic cells: a marker for clinical immunogenicity to FVIII products. Blood Adv. (2019) 3:1429–40. doi: 10.1182/bloodadvances.2018030452

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Johnsen JM, Fletcher SN, Huston H, Roberge S, Martin BK, Kircher M, et al. Novel approach to genetic analysis and results in 3000 hemophilia patients enrolled in the My Life, Our Future initiative. Blood Adv. (2017) 1:824–34. doi: 10.1182/bloodadvances.2016002923

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Kruse-Jarres R, Pajewski NM, Leissinger CA. The role of race and ethnicity in the clinical outcomes of severe hemophilia A patients with inhibitors. Blood. (2007) 110:1163. doi: 10.1182/blood.V110.11.1163.1163

CrossRef Full Text

35. Carpenter SL, Michael Soucie J, Sterner S, Presley R, Hemophilia Treatment Center Network I. Increased prevalence of inhibitors in Hispanic patients with severe haemophilia A enrolled in the Universal Data Collection database. Haemophilia. (2012) 18:e260–5. doi: 10.1111/j.1365-2516.2011.02739.x

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Miller CH, Benson J, Ellingsen D, Driggers J, Payne A, Kelly FM, et al. F8 and F9 mutations in US haemophilia patients: correlation with history of inhibitor and race/ethnicity. Haemophilia. (2012) 18:375–82. doi: 10.1111/j.1365-2516.2011.02700.x

PubMed Abstract | CrossRef Full Text | Google Scholar

37. González-Galarza F, Takeshita LC, Santos EM, Kempson F, Maia M, Silva A, et al. Allele frequency net 2015 update: new features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations. Nucleic Acids Res. (2014) 43:D784–8. doi: 10.1093/nar/gku1166

PubMed Abstract | CrossRef Full Text | Google Scholar

38. van Haren SD, Wroblewska A, Herczenik E, Kaijen PH, Ruminska A, ten Brinke A, et al. Limited promiscuity of HLA-DRB1 presented peptides derived of blood coagulation factor VIII. PLoS ONE. (2013) 11:e80239. doi: 10.1371/journal.pone.0080239

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Yanover C, Jain N, Pierce G, Howard TE, Sauna ZE. Pharmacogenetics and the immunogenicity of protein therapeutics. Nat Biotechnol. (2011) 29:870–3. doi: 10.1038/nbt.2002

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: inhibitors, factor VIII, HLA-type, statistics, hemophilia, ATHN, MLOF

Citation: McGill JR, Simhadri VL and Sauna ZE (2021) HLA Variants and Inhibitor Development in Hemophilia A: A Retrospective Case-Controlled Study Using the ATHNdataset. Front. Med. 8:663396. doi: 10.3389/fmed.2021.663396

Received: 02 February 2021; Accepted: 06 April 2021;
Published: 07 May 2021.

Edited by:

Giancarlo Castaman, University of Florence, Italy

Reviewed by:

Karin Fijnvandraat, Amsterdam University Medical Center, Netherlands
Jan Voorberg, AMC-Sanquin Landsteiner Laboratory, Netherlands

Copyright © 2021 McGill, Simhadri and Sauna. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zuben E. Sauna, enViZW4uc2F1bmFAZmRhLmhocy5nb3Y=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

HLA Variants and Inhibitor Development in Hemophilia A: A Retrospective Case-Controlled Study Using the ATHNdataset

1. Introduction

2. Materials and Methods

2.1. Study Design

2.2. Data Sources

2.3. Determinations of Hemophilia Severity and Inhibitor Development

2.4. HLA Testing for Class II Loci Using Next Generation Sequencing

2.5. Determining the Size of the Cohort Used for HLA Typing

2.6. Filtering the Data

2.7. Missing Data

2.8. Statistical Analysis

2.9. Predicted Binding Affinity of “Foreign Sequences” at the Location of Missense Mutation in the F8 Gene of Study Participants

3. Results

3.1. Selecting a Representative Cohort Size

3.2. Participant Characteristics

3.3. HLA Typing of Participants

3.4. Univariate Analysis of Severe HA Participants

3.5. A Multivariate Regression Analysis of Severe HA Participants

3.6. Univariate Analysis in All HLA-Typed Participants

3.7. A Multivariate Regression Analysis of All HLA-Typed Participants

3.8. Comparison of Predicted Binding Affinities of “Foreign Sequences” at the Location of Missense Mutation in the F8 Gene of Study Participants

4. Discussion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Acknowledgments

Supplementary Material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good