Curation and expansion of the Human Phenotype Ontology for systemic autoinflammatory diseases improves phenotype-driven disease-matching

Maassen, Willem; Legger, Geertje; Kul Cinar, Ovgu; van Daele, Paul; Gattorno, Marco; Bader-Meunier, Brigitte; Wouters, Carine; Briggs, Tracy; Johansson, Lennart; van der Velde, Joeri; Swertz, Morris; Omoyinmi, Ebun; Hoppenreijs, Esther; Belot, Alexandre; Eleftheriou, Despina; Caorsi, Roberta; Aeschlimann, Florence; Boursier, Guilaine; Brogan, Paul; Haimel, Matthias; van Gijn, Marielle

doi:10.3389/fimmu.2023.1215869

ORIGINAL RESEARCH article

Front. Immunol., 12 September 2023

Sec. Autoimmune and Autoinflammatory Disorders: Autoinflammatory Disorders

Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.1215869

Curation and expansion of the Human Phenotype Ontology for systemic autoinflammatory diseases improves phenotype-driven disease-matching

Willem Maassen^1†

Geertje Legger^2†

Ovgu Kul Cinar^3†

Paul van Daele^4,5

Marco Gattorno⁶

Brigitte Bader-Meunier^7,8

Carine Wouters⁹

Tracy Briggs^10,11

Lennart Johansson¹

Joeri van der Velde¹

Morris Swertz¹

Ebun Omoyinmi³

Esther Hoppenreijs¹²

Alexandre Belot^13,14

Despina Eleftheriou³

Roberta Caorsi⁶

Florence Aeschlimann^7,15

Guilaine Boursier¹⁶

Paul Brogan¹⁷

Matthias Haimel¹⁸

Marielle van Gijn^19*

¹Genomics Coordination Centre, Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
²Department of Rheumatology and Clinical Immunology, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
³Department of Paediatric Rheumatology, Great Ormond Street Hospital for Children National Health Service Trust, London, United Kingdom
⁴Department of Internal Medicine, Erasmus Medical Centre, Rotterdam, Netherlands
⁵Department of Immunology, Erasmus Medical Centre, Rotterdam, Netherlands
⁶UOC Reumatologia e Malattie Autoinfiammatorie, IRCCS Istituto Giannini Gaslini, Genoa, Italy
⁷Department of Paediatric Immunology-Hematology and Rheumatology, Necker University Hospital - APHP, Paris, France
⁸Laboratory of Immunogenetics of Paediatric Autoimmune Diseases, UMR 1163, Imagine Institute, INSERM, Paris, France
⁹Department of Pediatric Rheumatology, University Hospital Leuven, Leuven, Belgium
¹⁰Division of Evolution and Genomic Sciences, School of Biological Sciences, University of Manchester, Manchester, United Kingdom
¹¹Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester University Hospitals National Health Service Foundation Trust, Manchester, United Kingdom
¹²Department of Pediatric Rheumatology, Pediatrics, Radboud University Medical Center, Nijmegen, Netherlands
¹³National Referee Centre for Rheumatic and AutoImmune and Systemic Diseases in Children (RAISE), Pediatric Nephrology, Rheumatology, Dermatology Unit, INSERM, Hospital of Mother and Child, Hospices Civils of Lyon, Lyon, France
¹⁴International Center of Infectiology Research (CIRI), University of Lyon, INSERM, Claude Bernard University, Lyon, France
¹⁵Division of Pediatric Rheumatology, University Children’s Hospital Basel, Basel, Switzerland
¹⁶Laboratory of Rare and Autoinflammatory Genetic Diseases and Reference Centre for Autoinflammatory Diseases and Amyloidosis (CEREMAIA), Department of Medical Genetics, Rare Diseases and Personalized Medicine, CHU Montpellier, University of Montpellier, Montpellier, France
¹⁷Inflammation and Rheumatology Section, University College London Great Ormond Street Institute of Child Health, London, United Kingdom
¹⁸Boehringer Ingelheim RCV GmbH & Co KG, Vienna, Austria
¹⁹Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands

Introduction: Accurate and standardized phenotypic descriptions are essential in diagnosing rare diseases and discovering new diseases, and the Human Phenotype Ontology (HPO) system was developed to provide a rich collection of hierarchical phenotypic descriptions. However, although the HPO terms for inborn errors of immunity have been improved and curated, it has not been investigated whether this curation improves the diagnosis of systemic autoinflammatory disease (SAID) patients. Here, we aimed to study if improved HPO annotation for SAIDs enhanced SAID identification and to demonstrate the potential of phenotype-driven genome diagnostics using curated HPO terms for SAIDs.

Methods: We collected HPO terms from 98 genetically confirmed SAID patients across eight different European SAID expertise centers and used the LIRICAL (Likelihood Ratio Interpretation of Clinical Abnormalities) computational algorithm to estimate the effect of HPO curation on the prioritization of the correct SAID for each patient.

Results: Our results show that the percentage of correct diagnoses increased from 66% to 86% and that the number of diagnoses with the highest ranking increased from 38 to 45. In a further pilot study, curation also improved HPO-based whole-exome sequencing (WES) analysis, diagnosing 10/12 patients before and 12/12 after curation. In addition, the average number of candidate diseases that needed to be interpreted decreased from 35 to 2.

Discussion: This study demonstrates that curation of HPO terms can increase identification of the correct diagnosis, emphasizing the high potential of HPO-based genome diagnostics for SAIDs.

Introduction

Systemic autoinflammatory diseases (SAIDs) are rare heterogeneous disorders in which there is aberrant activation of the innate immune system without an infectious cause. SAIDs are characterized by recurrent episodes of fever and other signs and symptoms of inflammation, such as arthritis, serositis, skin abnormalities, and elevated acute-phase proteins. The first monogenetic SAIDs described were hereditary fever disorders such as familial Mediterranean fever (FMF). These are caused by pathogenic variants in genes encoding for inflammasome-related proteins resulting in aberrant activation of interleukin-1 (IL1) (1, 2). Since the discovery of FMF around 1997, many more monogenic SAIDs have been described involving pathways such as the interferon-related autoinflammatory diseases (interferonopathies) and nuclear factor kappa light chain enhancer of activated B cells (NF-kB)-related autoinflammatory diseases (rhelopathies) (2, 3). The SAIDs are a heterogeneous group of rare disorders, and even though many SAIDs are monogenetic, many patients with a suspected SAID do not receive a genetic diagnosis (2, 4).

Given the rarity of these diseases, most clinicians see only a few SAID patients throughout their careers, which may lead to considerable diagnostic delay or misdiagnosis, potentially causing delayed or even incorrect treatment and adverse patient outcomes (5).

Accurate and standardized phenotypic descriptions are essential to diagnose rare diseases, discover new disease-causing genes, and make accurate phenotype–genotype correlations. This information is not only important for interpretation of genetic data but also needed for the sharing of clinical data to combine knowledge and cluster unsolved patients. Given this unmet need, the Human Phenotype Ontology (HPO) system was conceptualized and was published with initial terminology in 2008 (6). HPO is a community-based tool that has evolved as a fundamental collection of medical vocabulary, and it has been increasingly adopted as the standard to describe phenotypic abnormalities for clinical phenotypic data capture (7). Each HPO term describes a distinct phenotypic feature (e.g., lymphadenopathy, HP:0002716), and the HPO structure allows for similarity measures between patient phenotypes. HPO now contains more than 200,000 phenotypic annotations for hereditary diseases, of which 2,120 are considered rare diseases, affecting fewer than 1 in 2,000 people in the general population (5).

Unfortunately, the lack of comprehensive SAID-related HPO terms has limited the use of HPO for these diseases. In a previous endeavor, HPO terms for several known inborn errors of immunity (IEI), including 32 SAIDs, were systematically reviewed, curated, and submitted to the HPO database (8). However, whether this curation process improved the diagnosis of SAID has not been investigated. Moreover, in the period since the previous curation, 10 new monogenetic SAIDs have been discovered and submitted (9).

In this multicenter study, we aimed to demonstrate the potential of using HPO terms in diagnosing SAIDs. To start, we curated the HPO terms for the 10 new monogenetic SAIDs and submitted these to the HPO database. Next, we investigated the effect of curation of the HPO terms for all 42 curated SAIDs on diagnosis using data for 98 genetically confirmed SAID patients from eight different expertise centers. To measure the effect of the curation effort on finding the correct diagnosis, we used LIRICAL, a computational algorithm, to calculate how consistent the phenotypes are with the verified diagnosis (10). Finally, we did a pilot study to illustrate how HPO-based genome diagnostics for SAIDs could perform.

Materials and methods

Standardized reannotation

To curate monogenetic SAIDs, we used the 2021-10-10 version of the HPO database and a standardized reannotation method to reannotate the 10 newly discovered SAIDs (8, 11). In short, publications that described phenotypic presentations of the diseases of interest were collected by SAID experts (12). Next, phenotypic features were extracted and transformed into HPO terms using a machine learning–based model (8). Two-tier expert evaluation was performed on the HPO terms, and additional terms could be suggested as required. When at least 80% agreement was achieved, the validated terms were submitted to the HPO. Together with the results of Haimel et al. (8), 42 different SAIDs have now been reannotated. Below, we refer to the 2021-10-10 version of the HPO database as the “original set of HPO terms” and the curated version as the “curated set of HPO terms.”

Patient cohort

For this study, we created an anonymized patient cohort of 98 patients from eight different European SAID expertise centers that are part of the ERN-RITA (European Reference Network for Rare Immunodeficiencies, Autoinflammatory and Autoimmune Diseases) (Supplementary Table S1). Patients who were genetically confirmed to have one of the 42 reannotated SAIDs were selected by clinicians based on availability of patients with these rare disorders in each expertise center. We aimed to include patients with as many of the reannotated disorders as possible. Clinicians were asked to retrospectively use HPO terms to describe the phenotype of the patients based on patient records. The HPO terms and the specific SAID for each patient were shared anonymously to build the resulting cohort. Below, we refer to the genetically confirmed diagnoses as “verified diseases.” If available, whole-exome sequencing (WES) data were collected in Variant Call Format (VCF) (n = 12) (Supplementary Table S1). The patient data fulfill all of the requirements for patient anonymity according to the ethical committee of the University Medical Centre Groningen. Written informed consent was obtained from the 12 patients for sharing their WES data.

LIRICAL

LIRICAL (Likelihood Ratio Interpretation of Clinical Abnormalities) is an open-source program that uses the likelihood ratio (LR) statistic to determine whether a phenotypic abnormality is consistent with the diseases in the HPO database (10, 13) (Supplementary Material 1). We chose the LIRICAL method because it can assess combinations of HPO terms and because it outperforms similar phenotype-driven gene-prioritization methods (7). We used LIRICAL version 1.3.4.

HPO-based LR calculation

LIRICAL calculates an LR for every candidate disease using HPO terms as input. For the calculation of the phenotype-based LR for candidate diseases per patient, LIRICAL uses the frequency of all phenotypic features among patients with a disease and the frequency in the background population. Both frequencies are extracted from the HPO database. Using the online repository and the documentation of Robinson et al. (10), we generated markdown language files (YAML) for all of the patients that contained the HPO terms documented for the patient and required data files such as the list of HPO-annotated diseases in the HPO database and the posttest probability threshold used (5%). Results were extracted from the generated Tab-Separated Value (TSV) files for all patients.

HPO- and WES-based LR calculation

LIRICAL calculated an additional LR based on the number of predicted pathogenic variants encountered in the genes associated with the candidate disease based on HPO and the frequency of the variant in the background population. To do this, the VCF file and the documented HPO terms were used while adding the reference genome and the location of the VCF file. Using the LR, LIRICAL ranked the different possible diagnoses for every potential diagnosis reported.

Evaluating curated HPO terms using LIRICAL

Comparing LRs before and after curation

LR scores for the verified diseases of the 98 patients were calculated for the verified diseases using both the original and curated set of HPO terms. The verified diseases were defined based on Online Mendelian Inheritance in Man (OMIM) disease identifiers (14). The output was matched based on the patient identifier and the OMIM identifier for their verified diseases. For comparison reasons, only patients whose verified SAIDs were found in both lists were included.

Comparing disease rankings before and after curation

Based on the calculated LRs, candidate diseases were ranked per patient. The assigned ranks were divided into four groups: Ranks 1–3 (high), Ranks 4–9 (intermediate), Rank ≥10 (low), and Undetected. A verified disease was considered to be undetected when it was not found in the candidate list. Using these groups, we compared the set of HPO terms before and after the curation effort. To determine whether the difference was specific for a particular SAID, we compared the number of patients for whom a specific verified SAID had a high rank [true positive (TP)] with the number of patients with a different verified SAID for which this specific SAID was also ranked high [false positive (FP)].

Use of HPO in a genome diagnostics pilot

We also analyzed the impact of including WES data when using LIRICAL. Here, we compared the average number of candidate diseases found by LIRICAL (while still including the verified disease) with the LIRICAL results using only the HPO terms as input. As LIRICAL ranks all possible candidate diseases, whereas a clinician would initially mainly consider the highest-ranked diseases, we only included diseases found by LIRICAL with an LR >0 and a rank ≤10.

These numbers were also compared to WES analysis without HPO terms. Because LIRICAL does not support analysis of WES data without HPO terms, we used MOLGENIS VIP version 5.0.5 (default settings), a computational diagnostic pipeline that combines widely used algorithms, to annotate, filter, and classify genetic variants following the American College of Medical Genetics (ACMG) guidelines (15, 16) (Supplementary Material 2). To determine the number of candidate diseases, we extracted the unique number of genes in which genetic variants were found. We also carried out the same analysis using a virtual gene panel of the genes associated with the 42 different SAIDs that were reannotated in HPO. Ten patients whose verified SAIDs were found using all four methods were included.

Statistical analysis

Assuming a normal distribution for the calculated LRs, we used the Student’s t-test to compare the average difference in LRs using the HPO terms before and after the curation effort. The Student’s t-test was used to determine the confidence interval for the average increase in LRs. To test whether this resulted in a significantly different distribution of assigned ranks, we used a two-sided Fisher’s exact test. For all tests, a p value <0.05 was considered statistically significant. All statistical analyses were performed in R version 4.2.2 (17).

Results

Reannotated diseases

For this study, we reannotated 10 newly discovered SAIDs (SOCS1, TET2, CEBPE, CDC42, LSM11, RNU7-1, STAT2, RIPK1, NCKAP1L, and UBA1) as described in the International Union of Immunological Societies (IUIS) classification criteria of 2021 (11). On average, 44 HPO terms were added to each disease. Together with 32 previously reannotated SAIDs, this resulted in 42 reannotated and curated SAIDs (Supplementary Table S2).

Comparing the LRs for verified diseases before and after curation

Of 92 detected SAIDs (the SAIDs of six patients were excluded, as they were not detected using either the original or the curated set of HPO terms), 38% showed an increased log₁₀(LR) with an average increase of 2.61 (p = 1.7 × 10^-7) (Figure 1). Therefore, assuming an equal pretest probability, the curated HPO terms increased the posttest probability of the verified SAID as a diagnosis.

FIGURE 1

Figure 1 Likelihood ratios for verified diseases using HPO terms. The LRs for the SAIDs of 92 patients using the original and the curated HPO set. Each dot represents the LR of the verified SAID of a patient. The x-axis and y-axis describe the log₁₀(LR)s calculated using the set of HPO terms before curation and after curation, respectively. Dots that fall within the gray zone indicate SAIDs with an improved LR. LRs can be interpreted as how many times more (or less) likely it is that patients have the disease based on the documented HPO terms as compared to patients without the disease. Negative LRs thus indicate that these patients are less likely to have the disease. Using the curated HPO set, results showed an increased log₁₀(LR) (p = 1.7 × 10^-7) with an average increase of 2.61 [95% CI (1.71, 3.50)].

Ranking of diseases before and after curation

Comparing the rankings of verified diseases

Next, we ranked the candidate diseases based on the calculated LRs. In general, the ranking of the verified SAID increased when using the curated set of HPO terms (p = 0.0009). Based on the calculated log₁₀(LR)s, 66% (65/98) vs. 86% (84/98) of SAIDs were detected before and after curation, respectively. The verified SAIDs were ranked in the highest category in 38 patients using the original HPO terms, and this rose to 45 patients when using the curated HPO terms. Figure 2 shows the total number of curated SAIDs and how they were grouped within the different ranges.

FIGURE 2

Figure 2 Summary of disease rank before and after curation. Using the set of HPO terms before curation, just 65 of the 98 verified SAIDs were detected (log₁₀LR >0), whereas 84 of the 98 were detected using the set of HPO terms after curation. The ranked SAIDs were divided into four ranges: Ranks 1–3, Ranks 4–9, Rank ≥10, and Undetected. This resulted in two significantly different distributions within the different ranges (p = 0.0009). The right bar with the rank distribution using the curated SAIDs resulted in seven more SAIDs being ranked from 1 to 3. In addition, 19 more SAIDs were detected.

We also looked at the rankings of individual diseases and noted that the effect of the curation effort was different for each SAID. The most positive effect was seen for Muckle–Wells syndrome (MWS), where the verified SAID was assigned the highest rank in only two out of seven patients before curation as compared to six out of seven patients after curation (Supplementary Figure S1). In contrast, FMF was assigned a lower rank (Rank ≥10) in three out of 16 patients before curation, whereas this was the case in 12 out of 16 patients after curation.

Comparing TP and FP rates

To assess if the improvement after curation was specific for each SAID rather than due to improved detection of SAIDs in general, we compared the TP and FP rates. Table 1 shows the TP and FP rates for all verified diseases assigned a rank between 1 and 3. TP rates increased for MWS, TRAPS (periodic fever, familial, autosomal dominant), and VEXAS (VEXAS syndrome, somatic), with VEXAS showing the biggest increase (from 3 out of 7 to 6 out of 7). However, the increase in specificity was different for each disease. The FP rate of MWS increased from 6 to 22, whereas the FP rate of VEXAS increased from 6 to 8.

TABLE 1

Table 1 True-positive and false-positive rates.

Use of HPO in a genome diagnostics pilot

For 12 of the 98 patients, we also had WES data, enabling a genome diagnostics pilot study. For the original dataset using only HPO terms, the verified SAID was found in the candidate list for nine out of 12 patients. When also using the WES data, the verified SAID was found for 10 out of 12 patients. Using the curated set of HPO terms, the verified SAID was found for 10 out of 12 patients [one patient with FMF and one patient with DADA2 (vasculitis, autoinflammation, immunodeficiency, and hematologic defects syndrome) were missed], whereas the verified SAID was found for 12 out of 12 patients when using the curated HPO terms and the WES data. Using a WES analysis detected all 12 verified SAIDs, both with and without the use of a virtual SAID gene panel of 42 different genes to filter the results (Supplementary Table S2).

As the number of candidate diseases left to evaluate after analysis determines the applicability of an approach for genome diagnostics, we compared the average number of candidate diseases left for clinicians to interpret after the analysis using four methods: using only HPO terms, using HPO terms and WES data, using only WES data, and using WES data filtered based on a virtual SAID gene panel of 42 different genes. Figure 3 illustrates the average number of candidate diseases that were generated. Although the numbers are small, this figure suggests that HPO-based WES-filtering produces similar results to filtering based on a SAID gene panel.

FIGURE 3

Figure 3 Average number of candidate diseases using LIRICAL with the curated set of HPO terms and MOLGENIS VIP. The first two bars show the average numbers of candidate diseases detected using LIRICAL with HPO without WES data and with WES data as input, 35.7 [95% CI (28.4, 43.0)] and 2.2 [95% CI (2.0, 2.4)], respectively. The total number of candidate diseases and the SAID gene panel-filtered number of candidate diseases by MOLGENIS VIP were extracted by grouping the detected variants by gene (all studied SAIDs are monogenic), and these results are represented by the last two bars, with values of 222 [95% CI (191.7, 8,252.3)] and 2.1 [95% CI (1.8, 2.4)], respectively. A log2-transformation is applied on the y-axis to compare the height of the different bars. *The number of candidate diseases is very high and is therefore not used in a diagnostic setting. This analysis setting was to demonstrate that interpreting WES data without clinical direction results in a large number of variants.

Discussion

In this multicenter study, we aimed to demonstrate the importance of curating HPO terms for SAIDs and the potential of HPO-based SAID diagnostics. To do so, we used 42 reannotated SAIDs: 32 curated SAIDs from a previous endeavor and 10 new SAIDs that were curated and submitted to the HPO database in the current study (8, 11). To validate the curation, we collected HPO terms for 98 SAID patients from eight different European expertise centers and used LIRICAL to compare the results before and after curation (7). HPO reannotation led to enhanced homogeneity in the disease clusters and improved the ranking of the true diagnoses in most real-life patients. The curation resulted in more accurate ranking of verified SAIDs, with seven SAIDs prioritized in the highest-ranking category. In addition, we detected 19 more verified SAIDs. Finally, a pilot study showed that an analysis using curated HPO terms and WES data resulted in the highest diagnostic rate, with an average of only two candidate genes left to interpret. This indicates that curating HPO terms increases the efficiency of computational algorithms to prioritize the correct SAID in real-life patients.

After analyzing the differences in the rankings of the individual SAIDs, we observed that the effect of the curation process varied per disease. For 13 SAIDs, the total number of patients detected increased. In addition, for seven SAIDs, the number of patients assigned the highest rank increased. On the other hand, for DADA2 and FMF, the number of patients detected did not change and the average rank decreased. The same variation in the effect of the curation process was reflected by differences in the TP and FP rates for different verified diseases, which indicates how well LIRICAL is able to rank the diseases using the LR. TP rates increased for MWS, TRAPS, and VEXAS, with VEXAS showing the biggest increase, from 3 out of 7 to 6 out of 7. However, the increase in specificity is also different for each disease. The FP rate of MWS increased from 6 to 22, whereas the FP rate of VEXAS increased from 6 to 8. Additionally, although the TP for DADA2 decreased slightly, from 7 to 6, the FP rate for DADA2 decreased from 17 to 9. This suggests that an increase in sensitivity could be at the expense of a decrease in specificity. However, in clinical use, high sensitivity for potential rare disorders might be more valuable for drawing the attention of clinicians to diseases they did not encounter before in regular practice.

One possible explanation for the variation per disease could be the homogeneity of disease phenotypes in SAID patients, which may have led to overlap between the different documented HPO terms. This phenotypic overlap could dilute the contribution of an individual HPO term to the LR of a specific verified disease. This effect is also represented by the slight drop in the diagnoses below the diagonal in Figure 1 and by the fact that the pilot study missed DADA2 and FMF, which are both annotated with many common HPO terms. A technical explanation for this apparent lack in specificity could be that LIRICAL assumes an equal pretest probability for all SAIDs (10). However, in practice, it seems more likely that clinicians have a suspicion for a specific subset of diseases based on their experience. Finally, LIRICAL assumes that there is no relation between phenotypes and that they always occur independently. Nevertheless, phenotypic features could include or exclude each other, contributing to more specific HPO term annotations.

The diagnostic value of a computational algorithm is determined by both the prioritization of candidate diseases and the number of candidate diseases left to evaluate. The latter number represents the number of diseases left for clinicians to interpret to verify their suspicions of a possible diagnosis or to apply for a SAID gene panel study. Therefore, we compared the average number of candidate diseases in the list after analysis using different methods. The number of candidate diseases found using HPO-based WES analysis was similar to the number of candidates found using a SAID-based virtual gene panel analysis. Although we were only able to collect WES data for 12 patients, our results suggest that the added value of HPO terms is most prominent when using them in genetic analysis, as also concluded by Yuan et al. (7).

Our study has some limitations. Although we involved pediatric and adult expertise centers from different countries, SAIDs are rare diseases, so we could have introduced selection bias due to the limited availability of patients in the different centers. Another source of bias could be introduced by the individual clinicians who retrospectively translated the clinical description of the patients into HPO terms. Different clinicians might have a tendency to assign specific HPO terms or to use a greater number of HPO terms to describe a patient phenotype. For example, the minimum number of assigned HPO terms to an MWS patient is 3 and the maximum number of assigned HPO terms to an MWS patient is 12. A solution here could have been to use a predetermined method, e.g., a two-tier expert evaluation, to assign HPO terms. However, our aim was to make use of the diversity in clinicians to evaluate the added value of HPO terms in daily practice. For this reason, we did not specify the method by which the HPO terms needed to be assigned.

The HPO system itself is a community-based system in which clinicians and translational researchers can extend and refine the HPO system (6). It combines the experience and knowledge of experts all over the world. However, it is dependent on the quality of many different sources and the availability of HPO terms for specific symptoms. For some relevant SAID symptoms, the HPO terms were unavailable, such as non-infectious osteomyelitis. These new HPO terms have been submitted to the HPO project but were not yet included in the current study. In addition, because of the rarity of many SAIDs, we did not use the frequency of symptoms and age of onset in the annotation of the specific SAIDs. For less rare SAIDs like FMF, including this information might have resulted in a better prioritization and detection rate.

We chose to use LIRICAL to measure the impact of the curation because it was developed by the same research group that initiated the HPO project and because the open-source LIRICAL software is publicly available (6, 10). Additionally, LIRICAL provides the possibility to use multiple HPO terms and the combined information of HPO terms and WES data. However, we have not performed a benchmark study to compare the different available phenotype-based prioritization tools that might perform better in the prioritization of the correct disease (18). Nevertheless, we have shown that systemically curating HPO terms for SAIDs significantly improves the ranking of the correct diagnoses in real-life patients and might accelerate clustering of phenotypes in larger disease cohorts.

A potential use of HPO terms in genome diagnostics could be as a method for developing WES gene panels. Gene panel–based filtering for genetically heterogeneous disorders is widely implemented in current practice (19). Advantages of this approach are its focus on well-defined genes and its ability to minimize incidental findings. Nevertheless, because of their focus, gene panels may miss important disease-related genes that were initially not related to the disease (20). Moreover, selection and curation of adequate target genes for a panel are time-consuming. Hereditary genetic disorders can affect more than one organ system, with various clinical presentations, which makes gene panel curation a challenging task (19). As a consequence, bioinformatic approaches have been developed to create phenotype-driven gene targets. Maver et al. (19) demonstrated that phenotype-based associations between genes in a virtual gene panel correspond with the gene associations in genome diagnostics and that it could be integrated into sequencing workflows. Using a phenotype-based approach could save time when curating new gene panels, as clinical experts and researchers from different disciplines are able to contribute to the development of the HPO database (21). Finally, because genes can be related to multiple HPO terms, developing a virtual gene panel based on combined sets of HPO terms that are related to specific diseases could increase diagnostic specificity.

In conclusion, we have demonstrated the potential of HPO-based SAID diagnostics and the value of our HPO curation effort by an increased probability of finding the correct diagnosis in real-life patients. Future curation efforts could contribute to the annotation of more specific HPO terms to distinguish between homogeneous disease phenotypes for SAIDs and other disease clusters. This will support the identification of new disease-causing genes and potentially lead to high-quality phenotype-based virtual gene panels.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by the ethical committee of the University Medical Centre Groningen. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent from the 12 patients for sharing their WES data was obtained.

Author contributions

Wrote the paper: WM, GL, OC, MH, MvG. Curated HPO terms for SAIDs: GL, OC, PD, MG, BB-M, CW, TB, EH, AB, DE, RC, FA, PB. Analyzed the data: WM, MH. Conceived and designed the experiments: WM, GL, MvG, MH, JV, LJ, MS. Supervised project: MvG. Gathered patient data: GL, PD, EO, GB, EH, MG, RC, CW, FA, BB-M. Reviewed paper: GL, OC, MG, MH, PD, MG, BB-M, CW, TB, LJ, JV, MS, EO, EH, AB, DE, RC, FA, GB, PB. All authors contributed to the article and approved the submitted version.

Acknowledgments

This study has been performed as part of the Molecular Testing working group of ERN-RITA and ISSAID (International Society of Systemic Auto-Inflammatory Diseases) members. Additionally, we thank Katy McIntyre, our indoor scientific editor at the Genetics Department in the University Medical Centre Groningen, for her efforts.

Conflict of interest

MH is currently employed by Boehringer Ingelheim RCV GmbH & Co KG.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1215869/full#supplementary-material

Glossary

www.frontiersin.org

References

1. Romano M, Arici ZS, Piskin D, Alehashemi S, Aletaha D, Barron KS, et al. The 2021 EULAR/American College of Rheumatology points to consider for diagnosis, management and monitoring of the interleukin-1 mediated autoinflammatory diseases: cryopyrin-associated periodic syndromes, tumour necrosis factor receptor-associated periodic syndrome, mevalonate kinase deficiency, and deficiency of the interleukin-1 receptor antagonist. Ann Rheum Dis (2022) 81(7):907–21. doi: 10.1136/annrheumdis-2021-221801

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Yildiz M, Haslak F, Adrovic A, Barut K, Kasapcopur O. Autoinflammatory diseases in childhood. Balkan Med J (2020) 37(5):236–46. doi: 10.4274/balkanmedj.galenos.2020.2020.4.82

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Papa R, Caorsi R, Volpi S, Gattorno M. New monogenic autoinflammatory diseases: 2021 year in review. Immunol Lett (2022) 248:96–8. doi: 10.1016/j.imlet.2022.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Sutera D, Bustaffa M, Papa R, Matucci-Cerinic C, Matarese S, D’Orsi C, et al. Clinical characterization, long-term follow-up, and response to treatment of patients with syndrome of undifferentiated recurrent fever (SURF). Semin Arthritis Rheumatol (2022) 55:152024. doi: 10.1016/j.semarthrit.2022.152024

CrossRef Full Text | Google Scholar

5. Benito-Lozano J, Arias-Merino G, Gomez-Martinez M, Ancochea-Diaz A, Aparicio-Garcia A, Posada de la Paz M, et al. Diagnostic process in rare diseases: determinants associated with diagnostic delay. Int J Environ Res Public Health (2022) 19(11):6456. doi: 10.3390/ijerph19116456

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Robinson PN, Kohler S, Bauer S, Seelow D, Horn D, Mundlos S. The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. Am J Hum Genet (2008) 83(5):610–5. doi: 10.1016/j.ajhg.2008.09.017

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Yuan X, Wang J, Dai B, Sun Y, Zhang K, Chen F, et al. Evaluation of phenotype-driven gene prioritization methods for Mendelian diseases. Brief Bioinform (2022) 23(2):bbac019. doi: 10.1093/bib/bbac019

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Haimel M, Pazmandi J, Heredia RJ, Dmytrus J, Bal SK, Zoghi S, et al. Curation and expansion of Human Phenotype Ontology for defined groups of inborn errors of immunity. J Allergy Clin Immunol (2022) 149(1):369–78. doi: 10.1016/j.jaci.2021.04.033

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Tangye SG, Al-Herz W, Bousfiha A, Cunningham-Rundles C, Franco JL, Holland SM, et al. Human inborn errors of immunity: 2022 update on the classification from the international union of immunological societies expert committee. J Clin Immunol (2022) 42(7):1473–507. doi: 10.1007/s10875-022-01289-3

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Robinson PN, Ravanmehr V, Jacobsen JOB, Danis D, Zhang XA, Carmody LC, et al. Interpretable clinical genomics with a likelihood ratio paradigm. Am J Hum Genet (2020) 107(3):403–17. doi: 10.1016/j.ajhg.2020.06.021

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Tangye SG, Al-Herz W, Bousfiha A, Cunningham-Rundles C, Franco JL, Holland SM, et al. The ever-increasing array of novel inborn errors of immunity: an interim update by the IUIS committee. J Clin Immunol (2021) 41(3):666–79. doi: 10.1007/s10875-021-00980-1

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Papa R, Cant A, Klein C, Little MA, Wulffraat NM, Gattorno M, et al. Towards European harmonisation of healthcare for patients with rare immune disorders: outcome from the ERN RITA registries survey. Orphanet J Rare Dis (2020) 15(1):33. doi: 10.1186/s13023-020-1308-x

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Feller W. An Introduction to Probability Theory and Its Applications. 2 ed. Wiley (1991).

Google Scholar

14. Amberger J, Bocchini CA, Scott AF, Hamosh A. McKusick’s online mendelian inheritance in man (OMIM). Nucleic Acids Res (2009) 37(Database issue):D793–6. doi: 10.1093/nar/gkn665

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Johansson LFv, Hendriksen D, Charbon B, Slofstra MK, Sietsma R, van den Hoek S, et al. Abstracts from the 54th European society of human genetics (ESHG) conference: e-posters. Eur J Hum Genet (2022) 30:487. doi: 10.1038/s41431-021-01026-1

CrossRef Full Text | Google Scholar

16. Richards S, Aziz N, Bale S, Bick D, Das S, Gastier-Foster J, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med (2015) 17(5):405–24. doi: 10.1038/gim.2015.30

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Team RC. R: A Language and Environment for Statistical Computing. (2022). Available at: https://www.R-project.org.

Google Scholar

18. van der Velde KJ, van den Hoek S, van Dijk F, Hendriksen D, van Diemen CC, Johansson LF, et al. A pipeline-friendly software tool for genome diagnostics to prioritize genes by matching patient symptoms to literature. Adv Genet (Hoboken) (2020) 1(1):e10023. doi: 10.1002/ggn2.10023

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Maver A, Lovrecic L, Volk M, Rudolf G, Writzl K, Blatnik A, et al. Phenotype-driven gene target definition in clinical genome-wide sequencing data interpretation. Genet Med (2016) 18(11):1102–10. doi: 10.1038/gim.2016.22

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Neveling K, Feenstra I, Gilissen C, Hoefsloot LH, Kamsteeg EJ, Mensenkamp AR, et al. A post-hoc comparison of the utility of sanger sequencing and exome sequencing for the diagnosis of heterogeneous diseases. Hum Mutat (2013) 34(12):1721–6. doi: 10.1002/humu.22450

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Köhler S, Gargano M, Matentzoglu N, Carmody LC, Lewis-Smith D, Vasilevsky NA, et al. The human phenotype ontology in 2021. Nucleic Acids Res (2021) 49(D1):D1207–d17. doi: 10.1093/nar/gkaa1043

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Human Phenotype Ontology (HPO), systemic autoinflammatory disorders (SAIDs), genome diagnostics, variant interpretation, whole exome sequencing (WES), LIRICAL

Citation: Maassen W, Legger G, Kul Cinar O, van Daele P, Gattorno M, Bader-Meunier B, Wouters C, Briggs T, Johansson L, van der Velde J, Swertz M, Omoyinmi E, Hoppenreijs E, Belot A, Eleftheriou D, Caorsi R, Aeschlimann F, Boursier G, Brogan P, Haimel M and van Gijn M (2023) Curation and expansion of the Human Phenotype Ontology for systemic autoinflammatory diseases improves phenotype-driven disease-matching. Front. Immunol. 14:1215869. doi: 10.3389/fimmu.2023.1215869

Received: 02 May 2023; Accepted: 09 August 2023;
Published: 12 September 2023.

Edited by:

Ozgur Kasapcopur, Istanbul University-Cerrahpasa, Türkiye

Reviewed by:

Mohamed Tharwat Hegazy, Cairo University, Egypt
Amra Adrovic, Koç University Hospital, Türkiye
Sezgin Sahin, Istanbul University-Cerrahpasa, Türkiye

Copyright © 2023 Maassen, Legger, Kul Cinar, van Daele, Gattorno, Bader-Meunier, Wouters, Briggs, Johansson, van der Velde, Swertz, Omoyinmi, Hoppenreijs, Belot, Eleftheriou, Caorsi, Aeschlimann, Boursier, Brogan, Haimel and van Gijn. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Marielle van Gijn, bS5lLnZhbi5naWpuQHVtY2cubmw=

†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Curation and expansion of the Human Phenotype Ontology for systemic autoinflammatory diseases improves phenotype-driven disease-matching

Introduction

Materials and methods

Standardized reannotation

Patient cohort

LIRICAL

HPO-based LR calculation

HPO- and WES-based LR calculation

Evaluating curated HPO terms using LIRICAL

Comparing LRs before and after curation

Comparing disease rankings before and after curation

Use of HPO in a genome diagnostics pilot

Statistical analysis

Results

Reannotated diseases

Comparing the LRs for verified diseases before and after curation

Ranking of diseases before and after curation

Comparing the rankings of verified diseases

Comparing TP and FP rates

Use of HPO in a genome diagnostics pilot

Discussion

Data availability statement

Ethics statement

Author contributions

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

Glossary

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good