A graph-based machine learning framework identifies critical properties of FVIII that lead to hemophilia A

Ferreira, Marcos V.; Nogueira, Tatiane; Rios, Ricardo A.; Lopes, Tiago J. S.

doi:10.3389/fbinf.2023.1152039

ORIGINAL RESEARCH article

Front. Bioinform. , 10 May 2023

Sec. Network Bioinformatics

Volume 3 - 2023 | https://doi.org/10.3389/fbinf.2023.1152039

This article is part of the Research Topic Graph Representation Learning in Biological Networks View all 5 articles

A graph-based machine learning framework identifies critical properties of FVIII that lead to hemophilia A

Marcos V. Ferreira¹

Tatiane Nogueira¹

Ricardo A. Rios¹

Tiago J. S. Lopes²*

¹Institute of Computing, Federal University of Bahia, Salvador, Brazil
²Center for Regenerative Medicine, National Center for Child Health and Development Research Institute, Tokyo, Japan

Introduction: Blood coagulation is an essential process to cease bleeding in humans and other species. This mechanism is characterized by a molecular cascade of more than a dozen components activated after an injury to a blood vessel. In this process, the coagulation factor VIII (FVIII) is a master regulator, enhancing the activity of other components by thousands of times. In this sense, it is unsurprising that even single amino acid substitutions result in hemophilia A (HA)—a disease marked by uncontrolled bleeding and that leaves patients at permanent risk of hemorrhagic complications.

Methods: Despite recent advances in the diagnosis and treatment of HA, the precise role of each residue of the FVIII protein remains unclear. In this study, we developed a graph-based machine learning framework that explores in detail the network formed by the residues of the FVIII protein, where each residue is a node, and two nodes are connected if they are in close proximity on the FVIII 3D structure.

Results: Using this system, we identified the properties that lead to severe and mild forms of the disease. Finally, in an effort to advance the development of novel recombinant therapeutic FVIII proteins, we adapted our framework to predict the activity and expression of more than 300 in vitro alanine mutations, once more observing a close agreement between the in silico and the in vitro results.

Discussion: Together, the results derived from this study demonstrate how graph-based classifiers can leverage the diagnostic and treatment of a rare disease.

1 Introduction

Blood coagulation is a vital process that stops the bleeding that ensues after a blood vessel is damaged. Injuries to the endothelial cell layer of blood vessels lead to the production of Tissue Factor Pathway Inhibitor (TFPI), which in turn starts a cascade of signals that activate and inhibit more than a dozen factors and lead to the assembly of a fibrin clot at the site of injury (Hoffbrand et al., 2016). Evidently, any mutations to the genes involved in this delicate system lead to the disruption of this essential process; for instance, patients harboring mutations on the SERPINC1 gene are prone to develop thrombosis [the excessive formation of blood clots (Hoffbrand et al., 2016)]. On the other hand, inherited or spontaneous mutations to the Coagulation factor 8 (F8) lead to hemophilia A (HA), a coagulation disorder that cause patients to have uncontrolled bleeding episodes.

This is an X-linked heritable disease affecting approximately 1 in every 5,000–10,000 live male births (Hoffbrand et al., 2016), and as a result, the blood coagulation cascade is impaired to different extents depending on the type of mutation on the F8 gene. Disease symptoms may vary from mild (clotting activity level 5%–40%, with only rare bleeding episodes), to moderate (clotting activity 1%–5%, more frequent episodes), and severe [clotting activity $<$ 1%, permanent bleeding risk and chronic joint damage (Lee et al., 2014)].

Although it is a relatively rare disorder, the coagulation pathway is well-characterized and treatment options are improving since the 1950s, evolving from blood-derived FVIII concentrates (Lee et al., 2014) to recombinant proteins (Peters and Harris, 2018), monoclonal antibodies (Kitazawa et al., 2012; Østergaard et al., 2021) and gene therapy (Nathwani, 2019). However, current treatment options still have major issues that have to be addressed (Lenting et al., 2017), for instance, it is of paramount importance to improve the half-life of recombinant FVIII proteins (currently ∼ 12–19 h), as well as its immunogenic profile to avoid the development of neutralizing antibodies, a condition affecting 30% of patients (Peters and Harris, 2018).

To this end, a deep understanding of the FVIII protein structure is essential. Using genetic information and protein structure properties, previous studies started to uncover aspects of single amino acid changes and their relation to severe or mild forms of HA (Doss, 2012). However, the lack of strict data curation and the lack of advanced statistical and machine learning methods hampered the mechanistic understanding and prediction of the effect of novel mutations on the FVIII protein.

In this study, to predict the degree of dysfunction that mutations cause in this protein, we used a graph representation of the FVIII that we established previously (Lopes et al., 2021a), and the mutation profile of 5,793 patients diagnosed with HA. We used this information and other structural and evolutionary properties of FVIII as input to 4 different graph-based neural network architectures (GNN), and found that this setup is highly efficient to learn the underlying properties of the FVIII architecture, and predict with good accuracy the effect of single-point non-synonymous mutations.

Moreover, aiming at creating recombinant FVIII proteins with improved half-life, immunogenic and folding profiles (Prezotti et al., 2022), we retrained these models to predict the coagulation activity of more than 300 alanine mutations. As a result, we found that the GNN models reliably predict the reduction in the activity of FVIII, effectively emulating in silico the results of costly and laborious in vitro assays.

In summary, this study builds on our previous efforts and demonstrates the feasibility of using GNNs to advance the understanding of a rare disease. We named this framework GNN-HemA and made it open-source, anticipating that the community will reproduce our findings and extend it to study diseases beyond hemophilia.

2 Results

2.1 Creation of the FVIII residue network

The FVIII protein has 2,332 amino acids and is composed of 5 domains (A1-A2-B-A3-C1-C2) (Childers et al., 2022). It circulates bound to the von Willebrand Factor (vWF), and after being activated via thrombin-mediated cleavage of some residues, it becomes detached from vWF, loses its B domain and changes into its activated form (FVIIIa) (Childers et al., 2022). As co-factor for the coagulation factor IXa, FVIIIa binds to the phospholipid membrane of activated platelets and enhances its activity more than 100,000 times (Lee et al., 2014). Together, they form the so-called tenase complex to activate the coagulation factor X (FX) into FXa. In turn, FXa converts prothrombin to thrombin, already close to the end of the coagulation cascade [i.e., the formation of a stable fibrin clot (Lee et al., 2014)].

In previous studies, we created a residue interaction network (RIN) of the FVIIIa protein, where each residue was represented by a node, and two nodes were connected by an edge if the residues were close to each other in the 3D structure (Figure 1A). This representation of the FVIIIa protein helped us quantify the importance of each of its residues, and understand how perturbations (i.e., mutations), lead to the loss of its function (Yan et al., 2014).

FIGURE 1

FIGURE 1. Design of the GNN-HemA. (A) From the pre-processed FVIII structure, we generated a residue network, obtained structural measures like solvent accessible area as well as a conservation score for each residue. This served as input for GNN classifiers, that were trained to predict the severity of 626 patients with HA, as well as the coagulation activity of more than 300 alanine mutant FVIII constructs Pellequer et al. (2011); Plantier et al. (2012). (B) In detail, the GNN algorithms’ training process starts by extracting sub-graphs from the residue network obtained from pre-processed the FVIII-RIN. Next, the sub-graphs are used to train a Graph Attention Network (GAT) with four attention heads. After computing the attention scores, GAT utilizes a Multilayer Perceptron (MLP) to classify the graph nodes according to the severity of hemophilia A or the coagulation activity of the FVIII alanine mutants.

In this study, to create a RIN, we used the FVIIIa structure predicted by AlphaFold2 (Jumper et al., 2021; Varadi et al., 2022), because it had a very good agreement with experimentally determined structures (Ngo et al., 2008; Shen et al., 2008; Smith et al., 2020), but in contrast to these models, the AlphaFold2 structure did not have large missing segments–an essential requirement to create a complete residue network.

We used the FVIIIa structure as input to RINerator (Doncheva et al., 2011). This program first adds hydrogen atoms to the structure, allowing it to identify non-covalent interactions between amino acids. Next, the non-covalent interactions are identified using a small probe (∼0.25 Å) rolled around the van der Waals surface of each residue, and a contact is defined if the probe touches two non-covalently bonded atoms (Word et al., 1999a; Word et al., 1999b). Finally, the interactions between residues are represented by edges, indicating that these residues are connected by a i) side-chain–side-chain, ii) side-chain–main-chain, iii) main-chain–main-chain hydrogen bond or non-covalent interaction between their atoms. In the FVIIIa RIN, the distance between the residues’ atoms was ∼5 Å (Supplementary Table S1 contains the complete network).

In mathematical terms, we modeled our data as a graph $G = (V, ξ)$ , such that $V$ is a set of residues and ξ is a set of edges that represents the connection between two nodes, i.e., there is a connection (u, v) ∈ ξ if two amino acids $u, v \in V$ are in close proximity on the FVIII 3D structure. Here, the graph contains only undirected edges. Therefore, by using GNN, it is possible to train graph-based models (f) representing the connections from the protein structures, thus describing better the attributes and relationships according to our class labels $(f : G \to Y)$ (e.g., the severity of hemophilia).

Finally, we used the 3D structure of FVIIIa to calculate the relative surface exposure of each residue, to produce a large multiple sequence alignment of FVIIIa and obtain a conservation score of each of its residues (Methods). In practical terms, these measures quantify from a structural and evolutionary perspective the importance of each residue of FVIIIa (Figure 1).

Taken together, in addition to the centrality measures that can be derived from the graph itself, we aimed at ranking the importance of each FVIIIa residue from different perspectives. The FVIIIa RIN and these additional measures compose the base for input to the GNN algorithms.

2.2 Predicting hemophilia a severity with graph-based machine learning classifiers

After generating the datasets that form the basis for GNN classifiers, we manually collected and pre-processed thousands of HA cases of patients harboring single-point non-synonymous mutations (Methods). After our stringent data sanitation, our dataset contained 626 HA cases (293 mild, 123 moderate and 210 severe), as well as the position and the amino acid substitution of each patient (Figure 2A). While patients with severe HA require prophylactic care that consists of intravenous injections 2-3 times per week, treatment for mild and moderate cases often require intravenous injections only when an injury occurs (Prezotti et al., 2022). For this reason, we grouped the mild/moderate cases into one class and severe into another (majority class ratio of 0.66). Mathematically, χ represents a dataset with the mutations harbored by hemophiliac patients with different severity levels $Y$ . Each residue x_i ∈ χ contains a set of attributes $A = {a_{1}, a_{2}, \dots, a_{n}}$ representing the properties of them which was substituted in x_i (namely, the structural and evolutionary features of each amino acid).

FIGURE 2

FIGURE 2. Predicting the severity of HA. (A) After careful data sanitation, our dataset had 626 unique cases of HA (Supplementary Table S2), caused by single-point, non-synonymous mutations. We merged the mild and moderate cases into a single class, reducing the problem to a 2-class classification. (B–C) Mutations at residues buried at the core of FVIII (i.e., low solvent accessible area), and conserved during evolution (i.e., low conservation score) result in severe HA, most likely due to the disruption the FVIII protein conformation. (D–E) Comparing different classifiers’ architectures, we obtained a classification accuracy of 0.69, and F1 value of 0.44, highlighting the difficulty associated to predicting the severity of HA from clinical data, but still useful to anticipate the effects of single-point mutations.

Before using machine learning classifiers (ML) to predict the severity of HA, we assessed whether the structural and evolutionary properties of the FVIII residues could distinguish between severe and mild/moderate phenotypes. We found that the solvent accessible area and the conservation of residues are powerful discriminators of HA severity (Figure 2B), as we observed in a previous study with different clinical cases (Lopes et al., 2021a). These results indicate that mutations to the most conserved and buried residues of FVIII lead to severe hemophilia, while substitutions of the residues close to the protein surface are associated to mild or moderate phenotypes.

Next, we used structural and evolutionary measures in conjunction with the FVIII-RIN for the GNN-based classification. The predicting model used in our GNN-HemA framework was implemented on top of the SHADOW-GNN (Decoupled GNN on a shallow subgraph) (Zeng et al., 2021), which is considered the state-of-the-art for implementing different GNN models. With the Shadow-GNN, we created a experimental setup using Graph Attention Networks (GAT) (Veličković et al., 2017) to model the FVIII protein.

We have trained four GAT models combining different numbers of layers (3 and 5) and sub-graph extractors (L-HOP and PPR). In GAT, layers refer to the repeated application of a particular computation on the graph’s nodes, which is used to learn node representations (Methods).

To assess our results, we designed our experiments by using a 6-fold cross-validation strategy. We split our dataset into 6 folds due to the small amount of available data, i.e., by using more folds (e.g., 10 folds as usual in ML tasks), a larger number of nodes were available for training, but only a few would remain for the validation stage.

Using this training regimen and comparing the performance of the different GNN architectures, we found that the best model GAT with 3-PPR predicted the severity of HA with accuracy of 0.7 and F1 value of 0.44 (Figures 2D, E), indicating that the GNN models are able to find with modest performance the characteristics distinguishing severe and mild/moderate HA phenotypes (Figure 2C). Compared to existing methods that attempt to predict harmful effects of mutations (e.g., Polyphen-2 and Provean (Adzhubei et al., 2013; Choi and Chan, 2015), the GNN-Hema produced equivalent results in all cases (Supplementary Figure S1).

In summary, the best prediction of HA severity was obtained using the Shadow-GAT, an attention-based architecture for classifying nodes in graph-structured data (Veličković et al., 2017). By using a self-attention strategy, Shadow-GAT computes hidden representations of each node. This attention architecture has several desirable features (Veličković et al., 2017), including the fact that it is efficient and parallelizable, can handle nodes of varying degrees by assigning arbitrary weights to neighbors, and is suitable for inductive learning [i.e., the tasks where the model must generalize to new, unseen graphs (Veličković et al., 2017)].

2.3 Predicting in vitro activity

After using the GNN-Hema to predict the severity of HA in patients, we wanted to assess the feasibility of using the same framework to predict the effect of targeted alanine mutations. For this purpose, we used the coagulation activity and the antigen levels of 344 alanine mutations on the A2 and the C2 domains of the FVIII protein (Pellequer et al., 2011; Plantier et al., 2012). The A2 domain is the most important domain of this protein, as it has binding sites for FIXa and for FX [the members of the tenase complex (Lee et al., 2014)]. Moreover, by itself, the A2 domain is able to enhance the activity of FIXa [albeit with lower efficiency compared to the full-protein (Fay and Koshibu, 1998; Fay et al., 1999)]. The C2 domain exerts multiple activities, including interaction with the membrane of platelets and binding to the von Willebrand Factor (Inaba et al., 2022). Hence, anticipating the effects of mutations in these domains can enhance the understanding of vital FVIII functions.

First, we divided our dataset into two-classes, namely, the mutations that retained a medium or high coagulation activity of FVIII, and the mutations that considerably disrupted its function (coagulation activity $> 50 %$ and $< 50 %$ of WT, respectively; Figure 3A; Supplementary Table S3). We verified that substitutions of residues buried at the core of FVIII and conserved during the course of evolution, often reduce dramatically the coagulation activity of the recombinant proteins (Lopes et al., 2021b; Lopes et al., 2022; Figures 3B, C). Next, we used this dataset as input to the GNN-Hema, together with the FVIIIa-RIN and the structural and evolutionary measures of all residues, and observed that the GNN could classify the alanine mutations in the A2 and the C2 domains with accuracy of 69% and F1 of 0.61 (Figures 3D, E). While these results do reach the threshold necessary for clinical diagnosis, in practice they help reduce the number of candidates designed to generate recombinant FVIII proteins.

FIGURE 3

FIGURE 3. Predicting the reduction of coagulation activity in alanine mutants. (A) We considered 344 alanine mutations to the A2 and the C2 domains of FVIII. We divided these mutations into two groups, namely, those that retained at least 50% of the coagulation activity of the WT, and those below this threshold, measured by a chromogenic assay (Pellequer et al., 2011; Plantier et al., 2012) (Supplementary Table S3). (B–C) As it happens with clinical cases, the targeted mutations at the core hydrophobic residues and to those that are highly conserved, impair the co-factor activity of FVIII (Lopes et al., 2021b). (D–E) The GAT 3 HOP architecture presented the best predictive power, with an accuracy of 0.7 and F1 value of 0.61, indicating that this GNN model can be used to simulate in silico the effect of targeted alanine mutations to FVIII.

Next, we aimed to predict the antigen levels of the 333 alanine mutant proteins (Plantier et al. (2012); Pellequer et al. (2011); Figure 4A; Supplementary Table S4). The antigen was measured by “sandwich” ELISA. This assay was used to evaluate the effectiveness of expressing and secreting FVIII mutant constructs. The ELISA assay is also known as an antigen assay because it measures both functional and nonfunctional FVIII proteins by measuring the amount of FVIII antigen (protein) that is immobilized on the ELISA plate. Once more, we observed that substitutions of the most buried and conserved residues of the A2 and C2 domains lead to a major reduction of the antigen rescue levels of the mutants [Lopes et al. (2021b; 2022)] (Figures 4B, C). We found that the GAT-5 PPR architecture successfully distinguished mutants that retained high antigen levels ( $> 50 %$ of WT), against those that displayed low antigen levels ( $< 50 %$ of WT; Figures 4D, E). These results indicate that the GNN-Hema reliably identify mutants with unusual conformation, and as the shape defines the function in the protein world, these predictions can be used to discard unpromising candidate molecules.

FIGURE 4

FIGURE 4. Predicting the reduction of coagulation activity in alanine mutants. (A) We considered 333 targeted alanine mutations to the A2 and C2 domains of FVIII. We divided these mutations into two groups, namely, those that retained at least 50% of the coagulation activity of the WT, and those below this threshold, measured by an ELISA (antigen) assay (Pellequer et al., 2011; Plantier et al., 2012) (Supplementary Table S4). (B–C) As expected, substitutions of the residues located at the core of these domains, as well as the most conserved ones, result in poor rescue of recombinant proteins by ELISA, suggesting that these mutations affected to a higher extent the correct folding and expression of FVIII. (D–E) The classification evaluation emphasizes GAT 5 PPR and GAT 3 PPR presented the best accuracy and F1 results, respectively.

Together, these results indicate that using GNN-based classifiers is a viable approach to emulate in silico, the mulecular perturbations that could only be obtained by in vitro experiments.

3 Discussion

In this study, we established a computational pipeline to anticipate the effects of mutations in the FVIII protein. We found that representing its structure as an undirected graph, and using it as input for graph-based neural networks is viable to predict the severity of hemophilia A in patients harboring non-synonymous mutations. Moreover, the same classifiers were retrained to predict the loss-of-function of more than 300 targeted alanine mutations (Pellequer et al., 2011; Plantier et al., 2012), establishing an helpful resource for the rational design of recombinant therapeutic FVIII proteins.

The so-called protein residue networks are well-studied representations that enable researchers to ellucidate the underlying 3D biophysical and biological properties (Yan et al., 2014). For instance, there is a close relationship between the centrality of nodes in a network, and the level of disruption to the protein function caused by mutations [i.e., substitutions of the most central nodes lead to a complete loss-of-function (Amitai et al., 2004)]. Additionally, protein networks have helped to ellucidate the organization of amino acids into modules, maintaining the correct positioning of binding sites (del Sol et al., 2006). In our case, we leveraged on this knowledge to created residue networks specifically aimed at studying the effect of mutations related to hemophilia A (Lopes et al., 2021b), hemophilia B (Lopes et al., 2022), and to thrombosis (Lopes et al., 2023). While we obtained good results in those studies, we used only general-purpose ML algorithms suitable for tabular data.

Here, we introduced the use of GNNs–a more close representation that learn directly from a graph structure, without having to first calculate centrality measures and convert them to tabular data. In general, the execution of GNNs depends on encoder-decoder functions to represent the graph as node embeddings, which is processed by using Neural Message Passing (NMP). Each message-passing iteration performed during the training phase, new knowledge from node embeddings are updated according to information aggregated from their neighborhoods (Zeng et al., 2021). This approach displayed positive results when predicting new edges and node importance (Hamilton, 2020). However, the fundamental difference between these applications and the present study is the size of the datasets used.

While previous studies used graphs of millions of nodes and edges, our hemophilia datasets had only a few hundred cases–as is often the case when researching rare diseases. After comparing several GNN architectures and training regimens, we observed that it is possible to predict with reasonable certainty the effect of substitutions of the FVIIIa residues. This compares well with our previous studies (Lopes et al., 2021b), and surpassed existing alternatives (Adzhubei et al., 2013; Choi and Chan, 2015) (Supplementary Figure S1). As others also observed, predicting the effect of harmful or benign mutations is a difficult problem in the structural biology field (Broom et al., 2020), but there are high hopes placed on strategies based on deep-learning (Akdel et al., 2022).

In particular for the study of hemophilia, we are aware of the factors that hinder a more favorable prediction of mutation effects. First, there are known inconsistencies in the diagnosis of patients due to difficulties in standardizing reagents, discrepancies between one- and two-stages assays (Potgieter et al., 2015), and the reported diagnosis and what is observed in terms of bleeding frequencies (Inaba et al., 2022). Moreover, albeit the GNN models used here are the state-of-the-art algorithms (Zeng et al., 2021), they were not designed for small datasets; this requires its underlying architecture to be modified, varying the number of layers to properly extract implicit information from the FVIII proteins. Moreover, we have fine-tuned the hyperparameters to adjust the final model to our data, thus reaching the best performance in classifying the hemophilia severity (Methods). Yet, we are confident that with sequencing technology becoming widely available, and a vibrant community continuously improving GNN algorithms, the field is headed for accurate and personalized diagnostics.

In conclusion, the GNN-Hema is to our knowledge, the first application of graph-based classifiers to predict the effect of mutations to the FVIII protein–an application urgently required for diagnosis and for the generation of superior recombinant proteins. We implemented GNN-Hema as an open-source application, anticipating that the research community will extend and repurpose it to study other diseases.

4 Materials and methods

4.1 Creation of the RIN

We downloaded the FVIII structure generated by AlphaFold 2 (Jumper et al., 2021; Varadi et al., 2022), and removed the residues of the initial signal peptide, and the a1, a2 and B domain regions (residues −19 to −1, 336-372, 711-740, 741-1,689, respectively, in the legacy numbering system), because they had low modeling quality (pLDDT). Hence, our FVIIIa structure started at the residues Ala-Thr-Arg.

We used the Rosetta software suite release 280 and the ref2015 score function (Leaver-Fay et al., 2011) to find the most appropriate rotamer conformation of all residues, in a way to minimize the overall free-energy of the structure. We used the parameters -ignore_unrecognized_res -relax:constrain_relax_to_start_coords -relax:coord_constrain_sidechains -relax:ramp_constraints false -ex1 -ex2 -use_input_sc and generated 100 structures as output, from which we selected the one with the lowest energy score.

We used this structure as input to RINerator version 0.5.1 (Doncheva et al., 2011), which rely on the Probe and Reduce programs (Word et al., 1999a; Word et al., 1999b). In the first step, it adds hydrogen atoms to the structure, which is essential to identify non-covalent interactions between amino acids, and second, it identifies the non-covalent interactions using a small probe (approximately 0.25 Å), rolled around the van der Waals surface of each amino acid, and a contact is established if the probe is simultaneously in contact with two non-covalently bonded atoms.

We considered that two residues interacted if there was at least one edge between them, independent of the edge type. To analyze the FVIII-RIN, we used R version 3.6.3 (R Core Team, 2022) and the iGraph package, version 1.2.5 (Csardi and Nepusz, 2006). With the iGraph package, we used the function simplify to remove redundant edges and self-interactions. We visualized the networks using Cytoscape version 3.8.2 (Shannon et al., 2003).

4.2 Structural and evolutionary measures of FVIII

We used Chimera version 1.14 (Pettersen et al., 2004) to extract the solvent-excluded area (areaSES) and the solvent-accessible surface area (areaSAS), and to calculate the relative surface exposure of all amino acids from the customized FVIIIa structure. We divided the solvent-excluded area of the residue by the surface area of the same type of residue in a reference state; in our case, we used the reference values of the 20 standard amino acids in Gly-X-Gly tripeptides (Bendell et al., 2014). Moreover, we obtained the conservation score from the ConsurfDB webserver (Ben Chorin et al., 2020), using the FVIII protein structure as input for the search query.

4.3 Genetic data and mutations datasets

For the training and prediction of the severity of HA, we manually searched the EAHAD and the CHAMP FVIII mutation databases (McVey et al., 2020) (https://www.cdc.gov/ncbddd/hemophilia/champs.html; visited in 19 April 2022), and searched for single-point, non-synonymous mutations. We remove conflicting instances, such as those reported with multiple phenotypes at the same time (e.g., “Mild/Moderate”), or with mismatches between the residue position and the actual amino acid, as well as those that introduced a stop codon. Moreover, if there were multiple phenotypes reported for the mutations at the same position, we kept those that could be disambiguated by majority voting. Our final dataset had 626 instances (293 mild, 123 moderate, 210 severe). For the residue network used as input to the GNNs to predict the HA severity, we selected only the edge with the highest score between two residues, independent of the edge type (e.g., main-chain - main chain, or side-chain - side-chain). Next, we normalized the weights of all selected edges to the interval [0,1], and used the areaSAS and the conservation of each residue of the network in conjunction as input to the GNNs.

For the training and prediction of the coagulation activity and the antigen levels of the FVIII alanine mutants (Pellequer et al., 2011; Plantier et al., 2012). We divided the dataset into two classes ( $> 50 %$ percent of WT, and $< 50 %$ percent of WT). For the residue network used as input to the GNNs, we selected only the edge with the highest score between two residues, independent of the edge type (e.g., main-chain - main chain, or side-chain - side-chain), and normalized the weights of all selected edges to the interval [0,1], but reversed it, so that the most meaningful edges had a higher score. We used the relative surface exposure and the conservation of each residue of the network in conjunction as input to the GNNs.

4.4 The GNN architecture

As previously mentioned, the GNN models used to learn from the FVIII protein structure were trained using shadow-GNN (Decoupled GNN on a shallow subgraph) (Zeng et al., 2021) by using the steps summarized in Figure 1B.

The next step is responsible for extracting sub-graphs from the protein structure. The SHADOW implementation contains two extractors: i) L-HOP, which retrieves an entire or a random subset of the target node’s L-HOP neighbors; and ii) PPR, which uses the Personalized PageRank (PPR) algorithm to compute the scores of other nodes relative to the target node, then selects the top K nodes with the highest scores. In our experiments, we define the hyperparameter space for L-HOP extractor as Depth (L = 2), Budget (b = 20); and for PPR extractor as: Budget (b = 150), with thresholding (epsilon = 1e − 5).

In the subsequent phase, the outputs obtained from the extractors were utilized to optimize the parameters of our GNN. Specifically, in this study, we have trained a Graph Attention Network (GAT) architecture with four attention heads. In essence, attention heads compute the importance of different interactions (e.g., node-node), keeping the focus on the most relevant information in the graph. As output in our case, attention heads provide scores to weigh the contribution of nodes to the final representation of the graph.

Following the computation of the attention scores, GAT utilizes a Multilayer Perceptron (MLP) network to classify the nodes in the graph. In the present study, the MLP network is implemented with a hidden dimension of 256, a dropout rate of 0.35, random subset aggregation (drop edge) of 0.1, a learning rate of 1e − 3 and a batch size of 128. These specific hyperparameter settings were chosen based on the results obtained from our experimental evaluations, and were selected to optimize the performance of the GNN model. Next, we used the Relu activation function $(f (u) = \max \{u, 0\})$ to process a given MLP output u and provide the estimated target. Finally, it is worth mentioning that all hyperparameters were set in an experimental setup using the vanilla version of Shadow-GNN.

5 Statistical analysis

The validation metrics used to assess the GNN model were the accuracy and the F1, as usually considered in the literature (Veličković et al., 2017; Zeng et al., 2021). Both metrics were calculated from the contingency tables produced by the validation process, considering the number of True Positive (TP), True Negative (TN), False Postive (FP), and False Negative (FN). The accuracy was calculated using Eq. 1, in which n represents the total number of nodes used to validate the final models. The F1 measure was calculated using Eq. 2, such that Eqs 3, 4 compute the recall and precision measures, respectively.

a c c = \frac{T P + T N}{n} (1)

F 1 = \frac{2 \cdot (R e c a l l \cdot P r e c i s i o n)}{R e c a l l + P r e c i s i o n} (2)

R e c a l l = \frac{T P}{T P + F N} (3)

P r e c i s i o n = \frac{T P}{T P + F P} (4)

F1 complements the accuracy and majority ratio by combining information from precision and recall measures. Precision estimates the fraction of correctly classified nodes among the ones classified as positive, while recall is the fraction of total positive nodes indeed classified as positive (Fernández et al., 2018).

The statistical tests were performed using the R statistical package version 4.2.2 (R Core Team, 2022).

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author. The GNN-HemA source code and datasets developed in this study are available at https://github.com/LabIA-UFBA/GNN-HemA.

Author contributions

TL, TN, and RR conceptualized the study. TL organized the clinical and the alanine mutation data. MF, TN, RR, and TL implemented the algorithms and performed the analyses. TL interpreted the results. MF, TN, RR, and TL wrote the manuscript.

Funding

TL was supported by the Council for Science, Technology and Innovation (CSTI), Cross-ministerial Strategic Innovation Promotion Program (SIP), “Innovative AI Hospital System,” the National Institute of Biomedical Innovation, Health and Nutrition (NIBIOHN) [grant number SIPAIH20D01], JSPS KAKENHI [JP22K06119] and the National Center for Child Health and Development internal grant [2022B-2]. RR was supported by Google Research Awards for Latin America 2021. RR and TN were supported by a grant from the Terumo Life Science Foundation, CAPES (Coordination for the Improvement of Higher Education Personnel—Brazilian federal government agency), CNPq (Brazilian National Council for Scientific and Technological Development) and FAPESP (Center of Mathematical Sciences Applied to Industry, CEPID-CeMEAI) [2013/07375-0].

Conflict of interest

TL received consulting fees from Pola Chemical Industries, Yokohama, Japan for projects unrelated to the current study, and speaker honoraria from Sanofi Japan.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbinf.2023.1152039/full#supplementary-material

References

Adzhubei, I., Jordan, D. M., and Sunyaev, S. R. (2013). Predicting functional effect of human missense mutations using polyphen-2. Curr. Protoc. Hum. Genet. 76, Unit7.20. doi:10.1002/0471142905.hg0720s76

PubMed Abstract | CrossRef Full Text | Google Scholar

Akdel, M., Pires, D. E., Pardo, E. P., Jänes, J., Zalevsky, A. O., Mészáros, B., et al. (2022). A structural biology community assessment of alphafold2 applications. Nat. Struct. Mol. Biol. 29, 1056–1067. doi:10.1038/s41594-022-00849-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Amitai, G., Shemesh, A., Sitbon, E., Shklar, M., Netanely, D., Venger, I., et al. (2004). Network analysis of protein structures identifies functional residues. J. Mol. Biol. 344, 1135–1146. doi:10.1016/j.jmb.2004.10.055

PubMed Abstract | CrossRef Full Text | Google Scholar

Ben Chorin, A., Masrati, G., Kessel, A., Narunsky, A., Sprinzak, J., Lahav, S., et al. (2020). Consurf-db: An accessible repository for the evolutionary conservation patterns of the majority of pdb proteins. Protein Sci. 29, 258–267. doi:10.1002/pro.3779

PubMed Abstract | CrossRef Full Text | Google Scholar

Bendell, C. J., Liu, S., Aumentado-Armstrong, T., Istrate, B., Cernek, P. T., Khan, S., et al. (2014). Transient protein-protein interface prediction: Datasets, features, algorithms, and the rad-t predictor. BMC Bioinforma. 15, 82–12. doi:10.1186/1471-2105-15-82

PubMed Abstract | CrossRef Full Text | Google Scholar

Broom, A., Trainor, K., Jacobi, Z., and Meiering, E. M. (2020). Computational modeling of protein stability: Quantitative analysis reveals solutions to pervasive problems. Structure 28, 717–726.e3. doi:10.1016/j.str.2020.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Childers, K. C., Peters, S. C., and Spiegel, P. C. (2022). Structural insights into blood coagulation factor viii: Procoagulant complexes, membrane binding, and antibody inhibition. J. Thromb. Haemostasis 20, 1957–1970. doi:10.1111/jth.15793

CrossRef Full Text | Google Scholar

Choi, Y., and Chan, A. P. (2015). Provean web server: A tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics 31, 2745–2747. doi:10.1093/bioinformatics/btv195

PubMed Abstract | CrossRef Full Text | Google Scholar

Csardi, G., and Nepusz, T. (2006). The igraph software package for complex network research. InterJournal, Complex Syst. 1695, 1–9.

Google Scholar

del Sol, A., Fujihashi, H., Amoros, D., and Nussinov, R. (2006). Residue centrality, functionally important residues, and active site shape: Analysis of enzyme and non-enzyme families. Protein Sci. 15, 2120–2128. doi:10.1110/ps.062249106

PubMed Abstract | CrossRef Full Text | Google Scholar

Doncheva, N. T., Klein, K., Domingues, F. S., and Albrecht, M. (2011). Analyzing and visualizing residue networks of protein structures. Trends Biochem. Sci. 36, 179–182. doi:10.1016/j.tibs.2011.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Doss, C. G. (2012). In silico profiling of deleterious amino acid substitutions of potential pathological importance in haemophlia a and haemophlia b. J. Biomed. Sci. 19, 30. doi:10.1186/1423-0127-19-30

PubMed Abstract | CrossRef Full Text | Google Scholar

Fay, P. J., and Koshibu, K. (1998). The a2 subunit of factor viiia modulates the active site of factor ixa. J. Biol. Chem. 273, 19049–19054. doi:10.1074/jbc.273.30.19049

PubMed Abstract | CrossRef Full Text | Google Scholar

Fay, P. J., Koshibu, K., and Mastri, M. (1999). The a1 and a2 subunits of factor viiia synergistically stimulate factor ixa catalytic activity. J. Biol. Chem. 274, 15401–15406. doi:10.1074/jbc.274.22.15401

PubMed Abstract | CrossRef Full Text | Google Scholar

Fernández, A., García, S., Galar, M., Prati, R. C., Krawczyk, B., and Herrera, F. (2018). Learning from imbalanced data sets, 10. Springer.

Google Scholar

Hamilton, W. L. (2020). Graph representation learning. Synth. Lect. Artifical Intell. Mach. Learn. 14, 1–159. doi:10.2200/s01045ed1v01y202009aim046

CrossRef Full Text | Google Scholar

Hoffbrand, A. V., Higgs, D. R., Keeling, D. M., and Mehta, A. B. (2016). Postgraduate haematology. 7th edn. John Wiley & Sons.

Google Scholar

Inaba, H., Nishikawa, S., Shinozawa, K., Shinohara, S., Nakazawa, F., Amano, K., et al. (2022). Coagulation assay discrepancies in Japanese patients with non-severe hemophilia a. Int. J. Hematol. 115, 173–187. doi:10.1007/s12185-021-03256-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., et al. (2021). Highly accurate protein structure prediction with alphafold. Nature 596, 583–589. doi:10.1038/s41586-021-03819-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Kitazawa, T., Igawa, T., Sampei, Z., Muto, A., Kojima, T., Soeda, T., et al. (2012). A bispecific antibody to factors ixa and x restores factor viii hemostatic activity in a hemophilia a model. Nat. Med. 18, 1570–1574. doi:10.1038/nm.2942

PubMed Abstract | CrossRef Full Text | Google Scholar

Leaver-Fay, A., Tyka, M., Lewis, S. M., Lange, O. F., Thompson, J., Jacak, R., et al. (2011). “Rosetta3: An object-oriented software suite for the simulation and design of macromolecules,” in Methods in enzymology (Elsevier), 487, 545–574.

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, C. A., Berntorp, E. E., and Hoots, W. K. (2014). Textbook of hemophilia. 3rd edn. John Wiley & Sons.

Google Scholar

Lenting, P. J., Denis, C. V., and Christophe, O. D. (2017). Emicizumab, a bispecific antibody recognizing coagulation factors ix and x: How does it actually compare to factor viii? Blood, J. Am. Soc. Hematol. 130, 2463–2468. doi:10.1182/blood-2017-08-801662

PubMed Abstract | CrossRef Full Text | Google Scholar

Lopes, T. J., Rios, R., Nogueira, T., and Mello, R. F. (2021a). Prediction of hemophilia a severity using a small-input machine-learning framework. NPJ Syst. Biol. Appl. 7, 22–28. doi:10.1038/s41540-021-00183-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Lopes, T. J., Rios, R., Nogueira, T., and Mello, R. F. (2021b). Protein residue network analysis reveals fundamental properties of the human coagulation factor viii. Sci. Rep. 11, 12625–12711. doi:10.1038/s41598-021-92201-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Lopes, T. J., Nogueira, T., and Rios, R. (2022). A machine learning framework predicts the clinical severity of hemophilia b caused by point-mutations. Front. Bioinforma. 2, 912112. doi:10.3389/fbinf.2022.912112

CrossRef Full Text | Google Scholar

Lopes, T. J., Rios, R. A., Rios, T. N., Alencar, B. M., Ferreira, M. V., and Morishita, E. (2023). Computational analyses reveal fundamental properties of the at structure related to thrombosis. Bioinforma. Adv. 3, vbac098. doi:10.1093/bioadv/vbac098

CrossRef Full Text | Google Scholar

McVey, J. H., Rallapalli, P. M., Kemball-Cook, G., Hampshire, D. J., Giansily-Blaizot, M., Gomez, K., et al. (2020). The European association for haemophilia and allied disorders (eahad) coagulation factor variant databases: Important resources for haemostasis clinicians and researchers. Haemophilia 26, 306–313. doi:10.1111/hae.13947

PubMed Abstract | CrossRef Full Text | Google Scholar

Nathwani, A. C. (2019). “Gene therapy for hemophilia,” in Hematology 2014, the American society of hematology education program book 2019, 1–8.

PubMed Abstract | CrossRef Full Text | Google Scholar

Ngo, J. C. K., Huang, M., Roth, D. A., Furie, B. C., and Furie, B. (2008). Crystal structure of human factor viii: Implications for the formation of the factor ixa-factor viiia complex. Structure 16, 597–606. doi:10.1016/j.str.2008.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Østergaard, H., Lund, J., Greisen, P. J., Kjellev, S., Henriksen, A., Lorenzen, N., et al. (2021). A factor viiia–mimetic bispecific antibody, mim8, ameliorates bleeding upon severe vascular challenge in hemophilia a mice. Blood 138, 1258–1268. doi:10.1182/blood.2020010331

PubMed Abstract | CrossRef Full Text | Google Scholar

Pellequer, J.-L., Shu-wen, W. C., Saboulard, D., Delcourt, M., Négrier, C., and Plantier, J.-L. (2011). Functional mapping of factor viii c2 domain. Thromb. haemostasis 106, 121–131. doi:10.1160/th10-09-0572

PubMed Abstract | CrossRef Full Text | Google Scholar

Peters, R., and Harris, T. (2018). Advances and innovations in haemophilia treatment. Nat. Rev. Drug Discov. 17, 493–508. doi:10.1038/nrd.2018.70

PubMed Abstract | CrossRef Full Text | Google Scholar

Pettersen, E. F., Goddard, T. D., Huang, C. C., Couch, G. S., Greenblatt, D. M., Meng, E. C., et al. (2004). UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612. doi:10.1002/jcc.20084

PubMed Abstract | CrossRef Full Text | Google Scholar

Plantier, J.-L., Saboulard, D., Pellequer, J.-L., Négrier, C., and Delcourt, M. (2012). Functional mapping of the a2 domain from human factor viii. Thromb. haemostasis 107, 315–327. doi:10.1160/th11-07-0492

PubMed Abstract | CrossRef Full Text | Google Scholar

Potgieter, J. J., Damgaard, M., and Hillarp, A. (2015). One-stage vs. chromogenic assays in haemophilia a. Eur. J. Haematol. 94, 38–44. doi:10.1111/ejh.12500

PubMed Abstract | CrossRef Full Text | Google Scholar

Prezotti, A. N. L., Frade-Guanaes, J. O., Yamaguti-Hayakawa, G. G., and Ozelo, M. C. (2022). Immunogenicity of current and new therapies for hemophilia a. Pharmaceuticals 15, 911. doi:10.3390/ph15080911

PubMed Abstract | CrossRef Full Text | Google Scholar

R Core Team (2022). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.

Google Scholar

Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi:10.1101/gr.1239303

PubMed Abstract | CrossRef Full Text | Google Scholar

Shen, B. W., Spiegel, P. C., Chang, C.-H., Huh, J.-W., Lee, J.-S., Kim, J., et al. (2008). The tertiary structure and domain organization of coagulation factor viii. Blood, J. Am. Soc. Hematol. 111, 1240–1247. doi:10.1182/blood-2007-08-109918

PubMed Abstract | CrossRef Full Text | Google Scholar

Smith, I. W., d’Aquino, A. E., Coyle, C. W., Fedanov, A., Parker, E. T., Denning, G., et al. (2020). The 3.2 å structure of a bioengineered variant of blood coagulation factor viii indicates two conformations of the c2 domain. J. Thromb. Haemostasis 18, 57–69. doi:10.1111/jth.14621

PubMed Abstract | CrossRef Full Text | Google Scholar

Varadi, M., Anyango, S., Deshpande, M., Nair, S., Natassia, C., Yordanova, G., et al. (2022). Alphafold protein structure database: Massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic acids Res. 50, D439–D444. doi:10.1093/nar/gkab1061

PubMed Abstract | CrossRef Full Text | Google Scholar

Veličković, P., Cucurull, G., Casanova, A., Romero, A., Lio, P., and Bengio, Y. (2017). Graph attention networks. arXiv preprint arXiv:1710.10903.

Google Scholar

Word, J. M., Lovell, S. C., LaBean, T. H., Taylor, H. C., Zalis, M. E., Presley, B. K., et al. (1999a). Visualizing and quantifying molecular goodness-of-fit: Small-probe contact dots with explicit hydrogen atoms. J. Mol. Biol. 285, 1711–1733. doi:10.1006/jmbi.1998.2400

PubMed Abstract | CrossRef Full Text | Google Scholar

Word, J. M., Lovell, S. C., Richardson, J. S., and Richardson, D. C. (1999b). Asparagine and glutamine: Using hydrogen atom contacts in the choice of side-chain amide orientation. J. Mol. Biol. 285, 1735–1747. doi:10.1006/jmbi.1998.2401

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, W., Zhou, J., Sun, M., Chen, J., Hu, G., and Shen, B. (2014). The construction of an amino acid network for understanding protein structure and function. Amino acids 46, 1419–1439. doi:10.1007/s00726-014-1710-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Zeng, H., Zhang, M., Xia, Y., Srivastava, A., Malevich, A., Kannan, R., et al. (2021). Decoupling the depth and scope of graph neural networks. Adv. Neural Inf. Process. Syst. 34, 19665–19679.

Google Scholar

Keywords: protein structure, machine learning, bioinformatics, residue network, FVII, FVIIIa, graph neural network

Citation: Ferreira MV, Nogueira T, Rios RA and Lopes TJS (2023) A graph-based machine learning framework identifies critical properties of FVIII that lead to hemophilia A. Front. Bioinform. 3:1152039. doi: 10.3389/fbinf.2023.1152039

Received: 27 January 2023; Accepted: 10 April 2023;
Published: 10 May 2023.

Edited by:

Pietro Hiram Guzzi, Magna Græcia University, Italy

Reviewed by:

Federica Chiappori, National Research Council (CNR), Italy
Qin Xu, Shanghai Jiao Tong University, China

Copyright © 2023 Ferreira, Nogueira, Rios and Lopes. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tiago J. S. Lopes, dGlhZ28tam9zZUBuY2NoZC5nby5qcA==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

A graph-based machine learning framework identifies critical properties of FVIII that lead to hemophilia A

1 Introduction

2 Results

2.1 Creation of the FVIII residue network

2.2 Predicting hemophilia a severity with graph-based machine learning classifiers

2.3 Predicting in vitro activity

3 Discussion

4 Materials and methods

4.1 Creation of the RIN

4.2 Structural and evolutionary measures of FVIII

4.3 Genetic data and mutations datasets

4.4 The GNN architecture

5 Statistical analysis

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good