- 1Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, United States
- 2Center of Structural Biology, Vanderbilt University, Nashville, TN, United States
- 3Mass Spectrometry Research Center, Department of Biochemistry, Vanderbilt University, Nashville, TN, United States
- 4Department of Chemistry, Vanderbilt University, Nashville, TN, United States
- 5Institute for Drug Discovery, University Leipzig Medical School, Leipzig, Germany
Although computational structure prediction has had great successes in recent years, it regularly fails to predict the interactions of large protein complexes with residue-level accuracy, or even the correct orientation of the protein partners. The performance of computational docking can be notably enhanced by incorporating experimental data from structural biology techniques. A rapid method to probe protein-protein interactions is hydrogen-deuterium exchange mass spectrometry (HDX-MS). HDX-MS has been increasingly used for epitope-mapping of antibodies (Abs) to their respective antigens (Ags) in the past few years. In this paper, we review the current state of HDX-MS in studying protein interactions, specifically Ab-Ag interactions, and how it has been used to inform computational structure prediction calculations. Particularly, we address the limitations of HDX-MS in epitope mapping and techniques and protocols applied to overcome these barriers. Furthermore, we explore computational methods that leverage HDX-MS to aid structure prediction, including the computational simulation of HDX-MS data and the combination of HDX-MS and protein docking. We point out challenges in interpreting and incorporating HDX-MS data into Ab-Ag complex docking and highlight the opportunities they provide to build towards a more optimized hybrid method, allowing for more reliable, high throughput epitope identification.
Introduction
The structural characterization of Ab-Ag interactions has become increasingly important throughout the last decade. The affinity and specificity of monoclonal Abs make them very effective medical therapies and the top growing drug classes in the past few years (1–5). As a result, there is a rising need for enhanced bioanalytical methods to characterize the structural basis of Ab-Ag recognition. Correct identification of critical residues for Ab engineering informs Ab engineering to improve its affinity and specificity or binding breadth for therapeutic applications and Ag design to produce prophylactic or therapeutic vaccines for disease prevention (6–9).
Methods that can rapidly map epitopes play a crucial role in guiding early-stage Ab development and provide critical information for early decision making and Ab selection. Nevertheless, this task remains challenging (10). Existing epitope mapping methods, such as electron microscopy (EM), X-ray crystallography, bio-layer interferometry, and site-directed mutagenesis can be expensive, have limited-throughput, and still might not capture the binding interface of Ab-Ag complexes (7). Some of these methods only recognize linear epitopes, while 80-90% of Ab-Ag interactions have at least one conformational epitope (11, 12).
Computationally, Ab-Ag complexes can be predicted using protein-protein docking. The challenges that come with accurate prediction of Ab-Ag complexes are two-fold: the sampling problem where too many potential binding interfaces need to be explored, and the scoring problem where the native model must be accurately ranked and chosen from a big pool of generated outputs. The integration of experimentally derived information at the sampling and scoring stage of docking has been shown to yield more accurate models by focusing the potential sampling conformations and distinguishing correct predicted models from the rest (13, 14).
HDX-MS is a powerful technique used increasingly to study protein-protein interactions. It can be combined with computational protein modeling protocols to produce high-resolution near-native models. In this review, we will discuss how HDX-MS has been applied to map Ab-Ag interactions and delineate the limitations of the HDX-MS experiment and data interpretation. We will emphasize the limitations that computation has been unable to resolve fully, including the simulation of HDX-MS data from protein atomic structure as well as the lack of quantitative restraints, inaccurate data interpretation, and non-optimized scoring functions for the use of HDX-MS data during protein-protein docking. Our specific interest lies in employing HDX-guided computational structure prediction to model Ab-Ag complexes.
HDX-MS Overview: Fundamentals, Applications, and Advantages as an Epitope Mapping Tool
HDX-MS measures the change in mass of a protein after its exposure to deuterated solvent, resulting from the isotopic exchange between protein backbone amide hydrogens and deuterium (15–18). To measure deuterium exchange for specific peptides, the protein is labeled in the presence of D2O and fragmented by proteolysis. The backbone amide hydrogens free of hydrogen bond and/or exposed to the solvent typically exchange more rapidly compared to those participating in stable hydrogen bond or shielded from the solvent. The rate and location of deuterium incorporation provide fundamental biophysical information about protein folding pathways, localized structural conformation of proteins, and protein interactions.
While standard biophysical methods such as X-ray crystallography provide static models of a protein, HDX-MS offers valuable information about protein dynamics in addition to its structure. HDX-MS is applicable to numerous study areas: structural-function relationships, membrane proteins, protein folding and refolding, impacts of post-translational modifications, sequence change, or denaturation, and protein binding analysis (19, 20). More specifically, HDX-MS has been employed to examine protein conformations, to study protein intrinsically disordered regions (21), to study conformational dynamics after modification (i.e., compare the dynamic change of H7 HA0 trimer and cleaved HA trimer) (22), and to monitor protein folding pathways and quality control [i.e., biopharmaceutical comparability studies detected subtle conformational differences between protein samples (20) or detected potential aggregation interfaces (23)]. HDX-MS is also performed to locate protein-ligand binding sites as well as protein-protein interactions (18, 24).
HDX-MS has been increasingly applied as an epitope mapping tool due to its many advantages (25, 26). The biggest advantage of HDX-MS is that it is fast and cost-effective, thus achieving a relatively high throughput. With the development of HDX-MS data processing software and a routine workflow, an epitope mapping study can be completed within one week in ideal situations (27). Its rapid turnaround time can be beneficial in urgent public health situations when Ab profiling is necessary for Ab therapeutic development (28). In addition, HDX-MS requires minimal amounts of protein material (μl of the sample at low μM concentrations), which renders it applicable for the characterization of large panels of Ab-Ag complexes (19, 29). Furthermore, HDX-MS is unrestricted by the size of the proteins because they are cleaved into peptic peptides prior to deuterium uptake analysis. In fact, HDX-MS is one of few technologies that can be employed to study the local conformational dynamics of hemagglutinin, a major surface protein of influenza viruses in solution, because these complexes are very large (25). Generally, de novo structure determination by nuclear magnetic resonance (NMR) is challenging for proteins larger than 50 kDa. Hence, Ab-Ag complexes are not good targets for traditional NMR studies because most of them easily exceed 50kDa (30). All these advantages make HDX-MS a very potent and rapid method to study Ab-Ag complexes, as well as proteins that are unamenable to other structural methods due to their intrinsic nature.
HDX-MS Experimental Approach and Data Processing for Ab-Ag Structure Prediction
HDX-MS has been used successfully in the past to map Ab epitopes to Ag surfaces (e.g., 14, 20, 22, 28, 31–36). A typical HDX-MS epitope mapping experiment (Figure 1) is performed on the Ag alone as a reference and then in complex with the Ab. The protein samples are labeled in D2O buffer under equilibrium conditions for each of several different time points. The binding of an Ab reduces the solvent exposure of Ag residues residing in the binding interface. This lowers the level of deuterium incorporation into peptic fragments containing the affected residues. Therefore, comparing HDX-MS profiles between Ab-free and Ab-bound states enables the identification of potential epitope peptides. Optionally, another set of labeling runs on the Ab by itself is added for paratope mapping (Figure 1).
Figure 1 Overview of a typical workflow for an epitope/paratope mapping HDX-MS experiment. Separately, the Ag, Ab-Ag complex, and Ab are labeled in D2O and incubated for varying lengths of time. The reactions are then quenched at low pH and low temperature. The protein samples are digested (typically with pepsin) to generate peptide fragments. Peptide fragments from each sample are analyzed using LC-MS to identify mass differences at various time points. The D uptake altered by binding enables identification of putative paratope and epitope peptides. Figure is adapted from (20).
After labeling, quenching is performed immediately by a drop in pH and temperature to minimize back exchange (37). Denaturants and reducing agents (i.e., tris(2-carboxyethyl)phosphine (TCEP)) are often added to the quench solution to reduce disulfide bonds in Abs because they are highly protease-resistant and will complicate MS data interpretation if left intact (19, 38). After pepsin digest (typically on-column), proteolytic peptides are desalted, separated on an ultra-high-performance liquid chromatography (LC) column, and ionized by electrospray ionization (ESI). The peptides’ masses are measured. By comparing the mass in the complex sample and in free sample for each peptide, its deuterium incorporation is computed, revealing potential peptides participating in the Ab-Ag interface (19).
Overall, the output of a conventional HDX-MS experiment is the level of deuterium uptake of each individual peptide at each labeling time point, accumulated by amide groups constituting that peptide. In most epitope mapping studies, the exchange rate of each peptide is averaged over all its amide groups, compared between the complex state and free state, and mapped onto the Ag/Ab structure if available (14, 20, 22, 31–35). These peptide-resolution HDX-MS data preclude quantitative analysis and identification of residue interactions between Ab and Ag (39).
HDX-MS provides the most comprehensive understanding of Ab-Ag interactions when combined with other structural techniques such as chemical-crosslinking with MS (XL-MS), cryo-electron microscopy (cryo-EM), or X-ray crystallography. XL-MS delivers information about the proximity of nearby protein sites and the distances between specific residues due to the identified inter-molecular cross-links. When combined with HDX-MS, XL-MS increases the chance of identifying a protein-protein binding site, pinpoints the exact interacting residues, and contributes explicit distance restraints for complex modeling. Their integration allows for the generation of more precise, high-confidence models of protein interfaces (13, 40).
HDX-MS offers valuable information about protein dynamics and flexibility while standard biophysical techniques such as X-ray crystallography and cryo-EM provide static structures of a protein. As each approach brings forth unique information, combining HDX-MS with either X-ray crystallography or cryo-EM can complement one another and provides a more comprehensive picture of protein interactions. The distinctive ability of HDX-MS to capture conformational changes and dynamics was highlighted in a recent protein interaction study (41). Several promising sites for antibody neutralization on SARS-CoV-2 spike protein were detected by HDX-MS and would have been overlooked if investigators had relied only on a snapshot of the protein complex from X-ray crystallography.
Employing both HDX-MS and cryo-EM, a study of the interaction between transcription initiation factor σS and its activator protein Crl revealed an allosteric structural change at one site among all HDX-predicted binding sites (42). The allosteric site would have not been discovered based on the cryo-EM structure alone. Similarly, one of the HDX-predicted binding sites would have been a false positive based on the HDX-MS data alone. The combination of HDX-MS and cryo-EM unveils information that is unattainable by each individual technique. Besides distinguishing allostery, the benefits of this combined approach were thoroughly reviewed by Engen and Komives (43); examples of which include protein quality control, protein folding, and the process of large complex assembly and their mechanisms (43).
HDX-MS Experimental Limitations Complicate Accurate Prediction of Residue-Residue Interactions
Emerging as a potent epitope mapping technique, HDX-MS comes, however, with several limitations. A brief introduction of the major inherent limitations of HDX-MS and some alternative or complementary approaches currently being used, along with their respective advantages and disadvantages, is presented below (see also Figure 2 and Table 1).
Table 1 Current alternative or complementary experimental approaches and their pros and cons regarding each HDX-MS experimental limitation.
Peptide-Level Resolution
In most commonly used experimental setups, HDX-MS reaches peptide-level resolution, and individual residues’ contribution remains uncertain. When protonated peptide ions collide with neutral gas upon collision-induced dissociation (CID) - a typically-employed ion fragmentation method, CID increases the vibrational energy of the ions and leads to extensive intramolecular H/D scrambling (44). As a result, HDX-MS fails to achieve residue-specific resolution with CID employed and only provides the mass change for an entire peptide fragment (calculated as the centroid value difference of a peptide isotopic envelope at two time points) (24). There has not been a widely established MS method to locate single amino acid residues to which deuterium is incorporated (20, 45, 57).
An experimental solution to this challenge is the usage of electron-induced fragmentation methods such as electron transfer dissociation (ETD) and electron capture dissociation (ECD). In ETD/ECD with very gentle ion source conditions, D scrambling rarely happens because the internal energy deposited upon the recombination of an electron with the positive peptide ion remains localized, causing bond cleavage at the site where the electron is captured (11, 46). Nevertheless, gas-phase scrambling can still occur and needs to be diminished for different peptides through meticulous optimization of ion fragmentation and ESI parameters (24, 39). In principle, the ETD/ECD experiment enables single residue resolution through the calculation of mass differences between fragment ions (c and z ions) differing by exactly one overhanging amino acid. While different acid-stable proteases can be exploited to generate overlapping peptides, obtaining peptides that differ by only one residue and overhang through the entire protein sequence is unlikely. As ETD/ECD fragmentation is most efficient with peptides at high charge states, some peptides will be unamenable to this fragmenting method, preceding its application as a routine method for localize deuterium uptake at individual amide level (58). Overall, HDX-MS ETD/ECD fragmentation has promising potential to provide amino acid resolution but there is no guarantee that residue resolution, even only for the sites of interest, will be achieved (44, 45).
Insufficient Sequence Coverage
HDX-MS experiments depend heavily on identifying reproducible peptides spanning the whole protein sequence. There is no threshold on what peptide coverage level is qualified as acceptable in the HDX-MS community (19). The goal is always to maximize the sequence coverage and to generate as many overlapping and unique peptides as possible to increase the sequence resolution in the optimization stage prior to HDX-labeling. To achieve this goal, the emphasis is on optimizing various HDX-MS conditions including quenching, digestion, and chromatography separation in the optimization stage (59). Insufficient peptide coverage or improper peptide identification in HDX-MS limits the ability to detect protein-protein interactions.
Large complexes contribute to this challenge in HDX-MS analysis because too many peptic peptides are produced. Given the restraints of the quench conditions in HDX, it is challenging to acquire chromatographic resolution for all the peptides generated. This results in overlapping mass spectra and complicates proper HDX-MS identification (38). In a recent report from Garcia et al., HDX-MS sequence coverage was significantly reduced when using a whole virus sample because of overlapping mass spectra from too many viral proteins present (60). One way to increase the resolution and reduce mass spectral overlap in HDX-MS experiments is to incorporate ion mobility spectrometry (IMS) after the chromatography step. Overlapping peptides in chromatographic separation can be separated in the gas phase with IMS, thus allowing the identification of more peptides in large complexes. However, more work is needed to routinely incorporate IMS in the overall HDX-MS experiment (45).
Highly glycosylated proteins, such as viral envelope proteins, pose a special challenge in HDX-MS experiment analysis. For glycoproteins, the substantial heterogeneity of the glycans combined with the broad specificity of pepsin digestion produces a diverse pool of peptic glycopeptides, leading to poor MS signal intensity of individual glycopeptides. As a result, severely reduced sequence coverage around the glycan sites is observed, significantly impacting the utility of HDX-MS to analyze such proteins (51). In Puchades et al., all residues immediately surrounding the hemagglutinin glycosylation sites lacked coverage (25). This challenge can be addressed by enzymatic deglycosylation of the glycoprotein with peptide-N4-(N-acetyl-β-d-glucosaminyl)asparagine amidase F (PNGase F) - the most common enzyme used prior to the HDX-MS labeling step (48). However, deglycosylation before labeling may alter the protein-protein interaction and prevent correct identification of antigenic sites, as some Abs carefully navigate around glycans. Deglycosylation also risks destabilizing the proteins and can cause aggregation. PNGase A and PNGase H+ are enzymes of choice when it is desirable to remove the N-linked glycosylations after labeling to preserve the native conformational dynamics and interaction of the glycoproteins (49, 50). Nevertheless, these enzymes require offline pepsin digestion and manual sample injection into the LC-MS system, which can be more labor-intensive than the automated sample injection system conventionally employed in the epitope mapping experiment. There are strategies to enrich the glycopeptides for MS analysis, but, to date, none have been developed for routine implementation in HDX-MS epitope mapping experiments (61, 62).
In addition to the highly glycosylated Ags, Abs with their disulfide-bonded regions add complexity to HDX-MS analysis. Peptides containing disulfide bonds are resistant to digestion, thus producing complicated fragment ion spectra and severely compromising interpretation. To reach an acceptable coverage of Ab sequences and facilitate HDX-MS analysis, a high concentration of TCEP is commonly added to the quench buffer to increase its disulfide reduction efficiency during the quench period (19, 38). On the other hand, too much TCEP can deteriorate LC and MS performance (51).
The stochastic behavior of pepsin contributes to difficulties in HDX-MS analysis. Although pepsin prefers to cleave between hydrophobic residues, it is still a non-specific protease with unpredictable cleaving pattern (63). Peptides generated from pepsin digestion display a great diversity of sequences; however, only highly reproducible and ubiquitous peptides can be used for further study and sequence-level comparison of deuterium content (52). Therefore, multiple replicates are needed to identify peptides reproducibly after pepsin digestion. When multiple replicate digestions under the same condition fail to result in reproducible peptides covering a region of interest, another strategy to increase coverage is to alter the digestion conditions by including additives such as denaturants or by using an activated pepsinogen coupled column. One other option is to change to or supplement with another enzyme (i.e., aspartic protease such as rice field eel pepsin) (53). A commercially available co-immobilized, dual protease column combines pepsin and type XIII protease from Aspergillus into a single packed column. The combination of their complementary specificities was shown to enhance the digestion efficiency of IgG molecules compared to pepsin or type XIII protease alone (54).
An acceptable sequence coverage percentage entirely depends on each practitioner’s own standard, their preliminary knowledge of the protein interaction, and the purpose of the project. Specifically in epitope mapping studies, the complementarity-determining regions (CDRs) of an Ab are often the regions of special interest. If the sequence coverage is perceived to be unsatisfactory or the regions of interest are not covered after optimization, the HDX study stage may be cancelled. It is not uncommon to encounter cases from time to time where HDX-MS does not provide any useful information on the binding sites of the proteins of interest. It is also important to note that even when the sequence coverage percentage is high, there is always a possibility of false negative results because a section of the protein sequence is not covered. Fortunately, this difficulty is one of several that computational docking can alleviate.
Allosteric Effect Being Indistinguishable From the Binding Interface
Another limitation of HDX-MS is that protection due to protein-protein interaction can be confounded by allosteric effects. HDX-MS fails to discriminate between the direct binding sites and the remote conformational change resulted from complexation (11, 25, 32). Often complementary experiments are carried out to validate the direct binding sites such as site-directed mutagenesis (20). This approach can be laborious and expensive, with the occasional occurrence of false-positive results. Disulfide trapping in cells is another method that can verify the binding interface and validate MS data in physiologically-relevant conditions by introducing two cysteine residues, one per interaction partner, at selected positions within their potential interaction interface and examining the formation of disulfide bonds with oxidizing agents (13). The employment of XL-MS as a complementary approach can also distinctly detect conformational changes induced by protein aggregation from the reduction in solvent exposure (55). For modeling the complex, XL-MS contributes unambiguous distance restraints, unlike HDX-MS. The kinetic millisecond HDX-MS (TRESI-HDX) method utilizes the early (millisecond) labeling time where conformational equilibria are unsettled, thus allowing the HDX-MS signal to develop as the binding event occurs before the allosteric event (64). The millisecond deuterium labeling takes place in a capillary mixer incorporated into a microfluidic chip (65). The sample is then quenched, digested, and ionized rapidly. However, only epitope peptides from Ags can be identified. It is difficult to identify paratope peptides from the Ab via TRESI-HDX-MS because the pepsin-linked agarose resin used on the chip makes pepsin digestion less efficient, resulting in insufficient peptide coverage on the Ab. Furthermore, distinguishing binding events from allosteric events with TRESI-HDX-MS depends heavily on the time it takes for allostery to develop, which varies from case to case (32). Besides the above, many other methods can be used to validate directly contacting regions, such as low-resolution cryo-EM, immunodiffusion in gel, Ab competition, surface plasmon resonance, etc (56, 66).
Computational Methods Try to Estimate the Residue-Specific Resolution of HDX-MS Experimental Data, Simulate HDX-MS Data From Protein Structure, and Explain Underlying Factors Contributing to the Observed HDX-MS Data
Computational methods are being used more frequently to tackle the inherent limitations of the HDX-MS technique. In this section, we focus on the state of computational methods in improving the resolution in HDX-MS experimental datasets and simulating HDX-MS data for native model selection through parallel comparison with the observed deuterium uptake profile. The complementarity of protein modeling and HDX-MS will greatly benefit from a detailed understanding of factors underlying the HDX mechanism.
The Protection Factor (PF) Is a Biophysical Concept Used to Describe Hydrogen-Deuterium Exchange Behavior
The H-D exchange reaction occurs at neutral pH in most experiments and predominantly adheres to a base catalysis mechanism. The amide proton is attacked by a deuterated hydroxide ion, leaving the amide nitrogen atom negatively charged (Figure 3). The amidated anion then removes a D+ ion from another D2O molecule to get re-protonated (57).
Figure 3 Schematic of base catalysis HDX of amide backbone protons in solution. Figure is reproduced from (57).
The key to investigating protein structure using HDX-MS is that the conversing rate from N-H to N-D is modulated by the protein structural attributes. The overall exchange mechanism can be described by the equilibrium below:
kop and kcl represent the rate constants for the opening and closing reactions. kch is the “chemical” rate constant or the intrinsic and maximal exchange rate of N–H → N–D conversion measured for each residue amide N-H when they are solvent-exposed and without hydrogen bonds. The combination of these three rate constants is unique to each N-H group in a protein.
The N-H groups represent a continuum of hydrogen deuterium exchange mechanisms. If the rate of the opening/closing transition is lower than the chemical exchange rate (kch ≫ kcl), chemical exchange occurs quickly after conversion to the solvent-exposed form (i.e., the amide exchanges as soon as the very first opening event) and the observed rate (kHDX) is governed by the rate of structural opening (kHDX = kop). This kinetic regime is called EX1. Under physiological conditions, EX2 is the dominant kinetic regime where the reconversion of the solvent-exposed to the protected state occurs much faster than the rate of chemical exchange: kcl ≫ kch (17). Thereby, only a fraction of the available amide hydrogens will undergo deuterium exchange before resuming the closed conformation. The HDX exchange rate for EX2 regime is
kch is dependent on the pH, temperature, and the protein primary sequence (i.e., the neighboring residues) and can be calculated from these parameters (67, 68). The only variable depending on the protein structure-related features is kop/kcl ratio – the equilibrium constant of the closed-to-open state of N-H reaction (Kop). The inverse of this equilibrium constant (Kop-1) is the so-called protection factor (PF) (17, 29, 39).
The PF (kch/kHDX) illustrates the degree of reduction in the observed exchange rate compared to the intrinsic exchange rate of a backbone amide hydrogen, in relation to the protein structure. Therefore, PF is a representation of how protein conformational properties hamper an amide from deuterium exchange. The concept of PF is especially important in folding simulations and can be used to infer local structural stability at specific sites on the protein (69, 70).
Computational Approaches Enhancing the Resolution in HDX-MS Experimental Datasets Through the Estimation of PF Values or Equivalently Exchange Rates
Multiple mathematical and statistical methods have been developed to delineate exchange rates at the residue level of HDX-MS datasets with optimized fragmentation patterns (71–74). These predicting methods comprise a mathematical algorithm to assign potential HDX rates into equivalence classes for individual residues from overlapping peptide fragments (74), weighted residue-by-residue averaging of the HDX exchange midpoints (73), or a Bayesian modeling statistical method to assess the significance and magnitude of HDX differences between two states, which provides an estimate of ΔHDX for each residue given enough data available (72). Some studies localize the incorporated deuterium in peptides from the isotope distribution pattern. This approach computed trail deuterium uptake value for each amino acid in the protein sequence to achieve the best global goodness-of-fit to the experimental isotopic envelope of each observed peptide (71). This approach functions optimally with well-defined isotopic envelopes provided by high-resolution data. Improved versions of the method were reported (75, 76); however, the computational cost is quite expensive and thus restricts the method application to only small proteins weighing less than 30 kDa (76). Also, based on isotope envelope analysis, Hamuro et al. combined their isotope envelopes global-fitting algorithm with wide labeling time windows (eight orders of magnitude) and ETD to achieve residue-specific resolution (77, 78). Another computational method exploits overlapping peptide fragments from HDX-MS datasets cleaved by a mixture of pepsin and protease type XIII (79). A linear optimization iteration is performed to quantify the deuterium incorporation in segments from overlapping peptides, followed by a non-linear iteration to quantify exchange rates for all amides.
Overall, while the above computational methods demonstrate the viability of determining HDX rate at residue-resolution level, they are being adopted relatively slowly for conventional use. The reasons are due to the complex data processing procedures and the demanding experimental requirements. This is because most approaches, specifically those utilizing global fitting iteration, depend largely on high-quality data with an abundance of overlapping fragments, multiple time points, wide time windows, and resolved isotopic envelopes (79, 80). These requirements limit their use on most available HDX-MS datasets where the fragment patterns and redundant coverage are far from optimal. A statistical method developed by Skinner et al. was proposed to yield estimates or restraints of PFs for individual amides based on overlapping fragments regardless of the quality of input data (80). Nevertheless, the caveat is that the solution is ambiguous since the method can only provide an estimation of plausible distributions of the PFs (80).
Computational Methods Predict HDX-MS Data (PF and Deuterium Uptake Level) for Comparison With HDX-MS Experimental Values to Improve Protein Model Selection
Simulating HDX-MS result from protein model is a promising approach for protein structure prediction. By comparing experimental deuterium uptake with predicted deuterium profiles derived from protein models, native configurations can be validated (Figure 4). This approach requires the ability to accurately predict HDX-MS PFs from a protein structure, which has proven challenging. The details of this will be discussed in the next paragraph. The predicted PFs are used along with the intrinsic exchange rates kch to calculate kHDX for each residue (Equation 2), from which their deuterium level is computed at user-defined labeling times. The deuterium uptake over contiguous residues of peptides present in the HDX-MS experimental sequence coverage list is integrated to estimate peptide isotope uptake data. Unique HDX-MS patterns are simulated for each protein model, facilitating model ranking and validation of native models through pairwise comparison to HDX-MS experiment data. Back exchange also needs to be accounted for. Deuterium loss from back exchange is easier to correct for during HDX-MS experimental data acquisition than through computational approximation, by measuring a fully deuterated control sample. Nevertheless, this extra step is disregarded in a majority of the reported HDX-MS datasets. While deuterium losses can be approximated computationally (68, 81), the varying experimental conditions such as temperatures, buffer solutions, pH, and LC eluent concentrations render these estimates inexact (82). Error in back exchange correction contributes to some of the disagreement between experimental and computationally predicted HDX-MS data (39, 82).
Figure 4 Flowchart of HDX-MS data simulation to validate plausible protein models. For each protein model, PF at each amide position is predicted. From PF and the intrinsic exchange rate kch, the D value of each residue is calculated. HDX-MS data for each peptide (from the experimental peptide list) is generated by summing D uptake of the member residues. The simulated HDX-MS profiles of protein models are compared with the experimental data normalized for back exchange. Matching D profiles of multiple peptides validates the protein model, facilitating differentiation between native and non-native models.
Many Different Methods Have Been Developed to Predict PF From Protein Structures
Methods to estimate HDX-MS behavior of proteins through PF prediction have a unique combination of metrics. Each one incorporates different features of protein conformation to varying extents to derive PFs including solvent accessibility, hydrogen bond, electrostatic force, and solvation (39, 82–89). Simulation methods for PF estimation generally belong to one of the two groups: fractional population models and empirical models.
The first group - fractional population models, exemplified by the COREX method, exploits the connection of PF with the free energy difference between a protein folded and unfolded states (90). PF of a residue is derived as an equilibrium constant from a structural ensemble in an MD simulation and is computed by fractioning the total number of states where an amide is buried and “closed” to the total number of states where it is exposed and “open” (89).
The second group - empirical models, associates PF with protein structure-related features and uses structure-based scoring functions to approximate PF. The widely used phenomenological approximation method is an example of this group (83, 91). PF is characterized as the sum contributions of hydrogen bonding and amide burial (the number of heavy atoms surrounding the amide), with each term scaled by a previously defined weighting factor. Despite being underperforming, this method benefits from the ease of implementation with promising potential for high-throughput scoring of an ensemble of modeling outputs (92–94). The calculation of HDX data from a protein atomic models based on the phenomenological expression is now feasible for researchers using a software package called HDX ensemble reweighing (HDXer) developed by Bradshaw et al. (93). In a recent study, this technique was applied to simulate the deuterium uptake of the top docking poses to compare with the experimental data (36). The simulated HDX profiles of the two docking complexes aligned well with the empirical data, further validating the selected models, and highlighting the advances being made in the field of HDX prediction.
Also belong to this group of empirical models, a recent method called protection factor prediction based on protein motions (POPPeT) incorporated additional information of secondary structure features and protein motions (95). The POPPeT method was shown to predict amide hydrogen PFs more accurately than both the phenomenological approximation and the COREX method. However, POPPeT was based on information about HDX-enabling protein motion, which is rarely available and is thus not considered a full-fledged PF prediction method by its authors (95).
Moreover, high accuracy in amide hydrogen PF prediction do not necessarily translate into high accuracy in deuterium uptake at peptide level (95). A recent study compared the accuracy of HDX-MS simulation from nine different PF predicting models (96). Using the same set of proteins, each model was evaluated by how well it matches HDX-MS experiment result. Mohammadiarani et al. concluded that the fractional population models outperformed conventional empirical models. Nevertheless, even the most reliable models in this study still had large errors (around 40%) and low correlation coefficients when predicting experimental HDX-MS data.
Overall, attempts to reliably estimate deuteration profiles that match closely with the experimentally determined HDX-MS data based on predicted PFs have been proven challenging so far. In addition, PF formulations that require usage of MD simulations or other sampling methods are computationally expensive and thus, are impractical for high-throughput ranking of docking simulation models. Indeed, further testing on the capacity of these methods to identify native conformations among others is severely lacking.
Simulated HDX-MS Patterns From Protein Decoys Can Benefit Structure Prediction
The Borysik research group is the only group, to date, to have extended to work on simulating HDX-MS in structure prediction and docking (92, 94). Using PF predicted from the easy-to-implement phenomenological approximation method, Borysik and his colleagues predicted HDX-MS outputs for two monomeric proteins (barnase and alpha-lactalbumin) and two homomeric proteins (enolase and serum amyloid P), each with 1,000 models and compared them with experimental HDX-MS data (94). HDX-MS simulation generated for crystal structures of two homo oligomeric proteins correlate significantly less with the experimental data than the monomeric proteins. This might be because the algorithm used to predict PF was originally trained on monomeric globular proteins only and was never optimized for usage on large multiple chain proteins. Indeed, when this phenomenological approximation method was applied to help select native conformations among the docking outputs in another study, optimization was required to tune the PF expression for protein interfaces (92). PF of proteins obtained in their bound configurations was redistributed towards those values simulated for the unbound states, akin to incomplete binding where a fraction of each protein would populate unbound conformations in the mixed samples. In both studies, pairwise comparisons of HDX-MS simulated output with experimental data was shown to be sufficient to discern native structures from non-native ones, as well as selecting the native docking pose (iRMSD<0.7 Å) without the necessity of further data analyzing or manual interpretation (92, 94). It was demonstrated that the predicted PFs, even when poorly determined, can still be adequate for simulation of HDX-MS data or HDX-MS difference data (produced by subtracting the simulated isotope uptake of the bound from the unbound states) that effectively evaluate and discriminate between poses. As the two studies were performed to only a few proteins, future work is in need to fully assess the potential of simulated HDX-MS data in high-throughput structure evaluation. Nevertheless, these studies revealed the promising power of HDX-MS in protein modeling and how this area would greatly benefit from improved prediction of PFs from protein atomic structures.
Factors Contributing to PF Are Incompletely Understood
Progress on improved simulation methods is pending upon a deeper understanding of HDX structure determinants. Factors commonly thought to contribute to amide backbone protection, making the overall exchange rate constant kHDX smaller than the intrinsic exchange rate kch are intramolecular hydrogen bonding and limited solvent accessibility (17, 97). A study by McAllister and Konermann reexamined this notion by exploring the correlation between experiment and simulation (98). In their 1-μs all-atom simulation of ubiquitin, only 57 out of 72 amide exchanges were accounted for through hydrogen bonding or solvent exposure. Some amide hydrogens were solvent accessible and did not participate in hydrogen bonding but were found to be protected from exchange. Electrostatic factors and hydration could potentially be contributing factors for these observed discrepancies between the simulation and HDX pattern. The ambiguous understanding of the HDX-MS process has led to the apprehension that HDX events are too intricate to be expressed in a formula (99). More study on the correlation between theoretical mechanisms of HDX-MS and its experimental measurement is necessary to improve future models.
HDX-MS Experimental Results on Ab-Ag Interaction Have Been Combined With Computational Docking
Computational docking is a common method to predict Ab-Ag interaction when a complex crystal structure is unavailable and has been combined with HDX-MS to better map epitopes (100, 101). In this section, we will focus on different practices for implementing HDX-MS in Ab-Ag docking as restraints in sampling and model generation and as filters in model selection.
Overview of Computational Ab-Ag Docking
The protein-protein docking process involves two main steps: effectively sampling various docking configurations and accurately ranking the decoys by free energy score. There are many algorithms for protein–protein docking such as RosettaDock (102–104), DOT (105), HADDOCK (106), ZDOCK (107), ClusPro (108), PatchDock/SymmDock (109), and FTDOCK (110). In this paper, RosettaDock in the Rosetta software suite is chosen to exemplify a typical docking protocol (111).
RosettaDock is a Monte Carlo-based docking algorithm that employs rigid-body docking of two interacting partners and optimizes their side-chain conformation (102–104). The algorithm requires a structure of both proteins in an initial docking pose as input. This is either manually arranged if prior structural information about probable regions of interaction is known or is randomized through a global docking step (102, 112). RosettaDock starts with a low-resolution (coarse grain) docking step. A Monte Carlo search is performed with rigid-body movements (namely adaptive rotation and translational moves) around the surface of the binding partners being represented in centroid-mode. For Ag–Ab interactions, the search space for Ab is limited to the six CDR loops as they are known to be the binding sites for the Ag epitope (113). The best scoring model from the low-resolution stage is adopted for high-resolution docking (all-atom refinement). All-atom side chains of docking partners are restored in place of centroid atoms, followed by additional docking position refinement and side-chain optimization steps (112). The performance of a docking attempt and evaluation of the best model can be assessed by the total energy and the energetic enhancement of the interface (binding interface energy), as well as the Cα RMSD to the best scoring model.
Predicting a protein-protein interaction is challenging due to many possible docked conformations. Although Ab recognition of Ag is largely limited to the CDR loops, Ab-Ag interactions present a unique challenge for computational docking. The structural flexibility of the CDR loops, especially HCDR3, and the homology model-based inaccuracy of Ab can confound docking (114, 115). As conformational changes after complexation and error in Abs homology modeling are taken into consideration, Ab-specific moves have been incorporated into the RosettaDock protocol (SnugDock): refining of HCDR2 and HCDR3 loops after the low-resolution stage and sampling of VH–VL orientations during the high-resolution stage (113, 116). Nevertheless, if motions upon Ag binding is substantial or the Ab models are substandard, it would be challenging, if not improbable to accurately predict the native binding structures.
The chance of achieving an accurate docking model will be enhanced by including experimentally obtained restraints - a strategy that has become increasingly popular (117, 118). Various experimental methods have been employed in combination with computational docking such as NMR, site-directed mutagenesis, electron paramagnetic resonance, low-resolution cryo-EM, and XL-MS (41, 117, 119–122). Here, we focus on how HDX-MS and computational docking are combined to identify the epitope of various Ab-Ag interactions. Given a diverse collection of restraints and filters in Rosetta, knowledge derived from HDX-MS experiments can be incorporated into the sampling strategy (Figure 5). For example, the starting docking pose is manually arranged to a configuration compatible with the experimental knowledge, which allows for the exploration of only a small region of conformational space and improves sampling density around the putative binding sites. Additionally, distance-based filters (or restraints) can be set during either low- or high-resolution docking stage or both to bias sampling towards docking poses compatible with the experimental data. Moreover, HDX-MS data can be applied to select native models, either manually or as filters or ideally being incorporated into the scoring function (Figure 5).
Figure 5 The protein-protein docking procedure from RosettaDock – a multiscale Monte-Carlo based algorithm with different stages for HDX-MS data to be incorporated. HDX-MS data can be applied to the sampling stage (including the arrangement of the starting pose, low-resolution docking, and high-resolution docking) as well as the scoring stage. Figure is adapted from (102).
Different Practices for Implementing HDX-MS as Restraints in Computational Ab-Ag Docking
In a recent benchmark study, blind Ab-Ag docking using current docking algorithms was shown to achieve a success rate of approximately 66% for near-native prediction using CAPRI (Critical Assessment of Predicted Interactions) criteria of Medium accuracy (115). Although there has not been any benchmark study to examine the success rate of HDX-MS in conjunction with Ab-Ag docking, evidence in the literature has pointed to significant improvement in docking performance with HDX-MS.
HDX-MS signals have been combined with docking simulations in different protein modeling suites (i.e., MOE, PatchDock, ZDOCK, RosettaDock (102, 103, 112, 123) to derive Ab-Ag complex structures in several epitope mapping studies (e.g.,11, 14, 27, 32, 36). A summary of their protocols and docking performance is presented in Table 2. The ideal starting components for docking are crystal structures of unbound Ag and unbound Ab (in the form of Fab fragments or variable fragment Fv). In the absence of an experimentally determined Ab structure, a homology modeled Ab is often used instead, representing a more common Ab-Ag docking case. One study used models of HDX-predicted Ab peptides (32). The number of sampling poses ranges from 50,000 to 100,000. The detailed practices of implementing HDX-MS as restraints in computational docking vary between studies and are depicted in Figure 6.
Table 2 A summary of studies integrating HDX-MS and docking to predict Ab-Ag native poses and their performance.
Figure 6 Different practices have been used to incorporate HDX-MS data into Ab-Ag docking. HDX-MS data are provided in form of identified epitope peptides (e.g., peptide 219-230 with a decrease in deuterium level of at least 4Da across 4 timepoints). Docking components: Ag crystal structure (blue; HDX-identified peptide: red) and Ab (Fab or Fv) crystal structure or homology model - (yellow; CDRs: pink). Sampling stage: (A) Restrain the docking sites. Favorable potential (red sphere) - spheres surrounding the backbone N atoms in the epitope are filled with favorable values. Each atom of the Ab lying within this sphere would contribute a favorable potential to the energy function. Distance restraint (left right arrows) - a minimum number of residues from each epitope peptide must be within a specified threshold distance to the Ab. Lenient approach: one residue from the epitope peptide to be at the interface. Stringent approach: the minimum number of contact residues correlates with the decrease in deuterium level (i.e., 4 residues in this case). (B) Restrain the non-docking sites: non-epitope peptides and non-CDR regions (grey) are blocked from docking using distance restraint. Scoring stage: HDX-MS data is applied as filters in form of: the number of satisfied distance restraints is - the higher the count (favorable contact) is, the more optimal a docked pose is considered to be; visual analysis - manually inspection of the docked poses on whether the epitope is at the interface; SASA evaluation - SASA calculation of the epitope peptide in the unbound form must be larger than the bound form.
Not all of these studies were carried out in comparison to blind docking. However, those that did demonstrated a higher number of energetically favorable contacts in the docking poses and better interface root-mean-square deviation (i-RMSD) of the top models (27, 32). Aside from Ab-Ag complexes, docking studies with HDX-MS integration were performed for other protein-protein complexes (13, 40, 125). The incorporation of HDX-MS as restraints and filters was shown to enrich the native-like conformations in the prediction and accurately select high-quality, biological-relevant models that would not be the case for docking poses ranked by the regular binding scores.
HDX-MS Data Are Applied as Restraints in the Sampling and Model Generation
HDX-MS difference identifies peptides at the protein binding interfaces. This information can be utilized as restraints during the generation of docked complexes to better focus the conformational sampling space. The restraints are generally in the form of distance restraints (score penalty) or favorable potentials. They facilitate one of the two goals: favoring the presence of residues in HDX-predicted epitope peptides at the interface or hindering residues outside of these peptides from participating in binding.
To propel the presence of HDX-MS predicted epitope peptides at the interface, these peptides are assigned as favorable docking sites (11, 14, 32, 36). This is accomplished by applying fixed distance restraints between residues in HDX-detected epitope peptides and residues in the Ab. Since most Abs interact with Ags through the CDRs, the restraints for Ab are often applied to residues in the Ab CDRs or residues in HDX-MS-detected paratope peptides if an HDX-MS experiment on paratope mapping was done. The interpretation of the distance restraint is that the sampling poses are required to contain a minimum number of residues from each epitope peptide within a specified threshold distance to certain residues in the Ab, or an energy penalty is applied.
The distance restraint comprises three elements whose parameters are seldom explicitly stated. The first element is the minimum number of residues from each HDX-MS peptide to be restrained at the interface. The qualitative, lenient approach does not take into account the ΔHDX signal intensity. Regardless of the magnitude of the relative deuterium uptake difference in a peptide, the lenient, qualitative restraint only requires at least one residue in each interacting peptide to be at the interface (14, 125). A more stringent approach is to correlate this number with the deuterium uptake difference between the unbound and the bound states (125). Roberts et al. reported that one approach could offer more optimal results than the other depending on the protein it is applied to. While the stringent approach allows for quantitative HDX-MS data interpretation in form of restraints, it runs the risk of overestimating the number of residues in the HDX-MS peptides that truly contact the other molecule, which would bias the docking process to incorrect configurations. This is because the increased level of solvent protection is not only caused by protein interaction but also allosteric effects, especially in the case of structural stabilization. Therefore, the interpretation of HDX-MS interacting peptides must be confident for the stringent approach, perhaps by additionally inspecting the 3D structure of the unbound protein. Otherwise, it might be better to interpret HDX-MS data as lenient, qualitative restraints. Perhaps a hybrid approach can be used where the stringent restraints are applied to confidently identified HDX-MS peptides (either through visual inspection of 3D structure or complementary data from other experiments) and lenient restraints are applied to the remaining HDX-MS peptides. We have not seen such a hybrid approach reported.
The second element is the nature of the distance restraint. Unlike chemical crosslinking, HDX-MS fails to provide a numeric distance threshold between specific atoms for docking. Thus, for implementing HDX-MS restraints, the distance and the atom of each of the two residues for which this distance being measured are improvised by researchers. They vary between studies, such as 10Å between Cα atoms of two residues (14) or 7Å between the backbone nitrogen atom of one residue and any heavy atom of another residue (125). One study examined a range of distance values and suggested that the optimal distance value can be slightly different for different protein systems (125). The third element to consider is the magnitude of the applied energy penalty function. A too harsh score penalty will likely hamper the sampling process by trapping structures in a local energetic minimum, which might result in a uniform population of complex models with the wrong conformation.
Another way to favor HDX-MS peptides as docking sites, besides using distance restraints, is through added potentials, although this is less commonly used. The docking study of Roberts et al. was performed with DOT rigid-body docking calculation (105, 125). In one protein, backbone nitrogen atoms of HDX-MS predicted peptides were assigned favorable potential. Each atom in the other protein residing within specific distance cut-off to these nitrogen atoms would contribute a favorable potential to the free energy score. A variety of sphere sizes and values within them were examined and the optimal values were chosen (125). While docking with added favorable potentials endeavors to maximize the number of contacts between specific residues of one protein to another, it neglects that one molecule must simultaneously contact all the interacting peptides on the other molecule. Meanwhile, the distance restraint has greater flexibility to enforce simultaneous contact by multiple regions.
Instead of requiring HDX-MS peptides to be buried at the interface, a different approach to apply HDX-MS data to the sampling stage is excluding regions with identical deuterium exchange profiles before and after complexation from the interface (20). In this approach, all residues in non-epitope peptides and non-CDR peptides are blocked from docking using distance restraints (27, 32). This approach relies on the notion that a common HDX-identified peptide is larger than the actual contacting segment and thereby contains most of the true contacting residues (27). As a result, it is safe to assume that residues in non-epitope peptides fail to participate in protein-protein interactions. However, this assumption is inapplicable to regions rich in proline because these are the “blind spots” in HDX-MS experiment; no inference can be drawn about them from HDX-MS data.
HDX-MS Data Are Applied as Restraints in the Scoring and Model Selection
A major challenge in protein docking is identifying native models among the entire ensemble of docking outputs. In studies leveraging HDX-MS data to evaluate docked models, the bulk of the evaluation relies on the total score and binding energy computed by the docking software (13, 14, 32, 40, 125–127). Based on these energy functions, an initial list of 50,000-100,000 models can be curated to an ensemble of the top 25-2,000 models. HDX-MS data are then incorporated to distinguish correct complexes from these top-scoring models. One crude way to utilize HDX-MS data for model selection is visual inspection (14, 40, 126, 127). The top 10-25 models are manually inspected for agreement with HDX-MS data: the binding regions with reduced exchange rate upon complexation should reside in the interface. In contrast, regions with constant exchange rate should be absent from the interacting regions. Visual analysis is commonly performed in studies combining docking with both XL-MS and HDX-MS where XL-MS is used to provide distance restraints during sampling and HDX-MS is used to narrow down the best complex models at the final stage (40, 126, 127).
Other studies implemented HDX-MS data as filters to eliminate incorrect configurations. The filter can be based on solvent accessible area (SASA) calculations where SASA of the interacting regions predicted by HDX-MS in the bound protein is compared with the SASA of its unbound form. A model is considered to be accurate when the SASA of these regions in the docked complex are all smaller than that of the unbound (13). Another form of HDX-MS filter is the number of distance restraints being satisfied. Docking models are eliminated if they fail to fulfill a minimum number of distance restraints (125). Using this criterion, Deng et al. ranked the docking models where the optimal poses had the highest numbers of favorable contacts between the two proteins (32). It is worth noting that applying HDX-MS data as distance restraints during both the sampling and the scoring stage can result in incorrect configurations due to overfitting. To avoid overfitting models when using HDX-MS data, some HDX-MS restraints should be omitted from the sampling stage and instead used to validate the ensemble following refinement. Nevertheless, this approach only works if there are multiple HDX-identified interacting peptides, which might seldom be the case for many HDX-MS experiments.
Overall, although many studies have been conducted using different software packages, there is currently no common best practice to implement HDX-MS restraints or guidance on how to use these restraints in docking simulations, nor has a consecutive method been developed for how to select models fulfilling HDX-MS restraints.
Current Docking Simulations Have Yet to Capture the Complexity of HDX-MS Data
Although hybrid methods for combining experimental data and docking simulations have been quite well-developed for techniques such as cryo-EM, NMR, XL-MS, or electron paramagnetic resonance (117), an optimized method for integrating HDX-MS data into computational docking, in general, and for Ab-Ag complexes, specifically, is needed. A brief overview of the challenges associated with this task is presented in Figure 7. The opportunities that these deficiencies provide are described in the following paragraphs.
Figure 7 Challenges of HDX-MS data incorporation into docking. 1) Quantitative HDX-MS restraint: HDX-MS data is incapable of providing quantitative geometric distances between atoms, thus, hindering its application to the sampling process. 2) Potential misinterpretation of HDX-MS data. True positive signals can be undetected due to insufficient coverage and inherent blind spots or overlooked. False positive signals are very probable due to manual data interpretation, noise, and allosteric effects. 3) HDX-MS weighted scoring function. The evaluation of docking poses based on HDX-MS data is still largely qualitative. 4) High-quality dataset for benchmarking. HDX-MS data and crystal structure of an Ab-Ag complex in most cases are exclusively available. There is a need for establishing a database for HDX-MS epitope mapping data.
Quantitative HDX-MS Restraint
As mentioned above, relatively crude atom-atom restraints have been most commonly used to restrain the distance between residues in HDX-predicted interacting peptides, with no regard to the intensity of HDX-MS signals. The geometric distances between specific atoms deduced from HDX-MS data are ambiguous for usage as quantitative restraints in model building. There is an opportunity for benchmarking a set of optimized parameters constituting HDX-MS distance restraints. Ideally, the rigidity of the restraint correlates with the magnitude of the relative deuterium uptake difference for each peptide to some degree (a balance between the stringent approach and the lenient approach).
While the number of native-like structures can be significantly enriched in the prediction with HDX-MS data being applied as restraints, the expectation of accurate docking with HDX-MS where near-native configurations are always generated is unreasonable. Regardless of how HDX-MS data are applied to restrain the docking sites (either as distance restraints or favorable added potential), the inherent properties of HDX-MS data mentioned above might mislead docking experiments (125). First, HDX-MS are sparse, in which information is insufficient to fully constrain the structure. Second, HDX-MS provides no information about proline residues. Both properties can move the best ranking complexes away from the correct conformation.
Potential Misinterpretation of HDX-MS Data
HDX-MS data can be ambiguous. True positive interactions can go undetected due to insufficient coverage and inherent blind spots (peptides rich in proline) in HDX-MS experiments. Even when detected, a true interacting peptide can be overlooked in HDX-MS experiments. Such a case was demonstrated by Pandit et al. by comparing the interactions of three different Ab-Ag complexes between HDX-MS data and X-ray crystallography data (27). Although 35 contacting residues were detected by X-ray crystallography, only 20 of them were correctly identified by HDX-MS. There were nine contact residues undetected (in the blind spot regions) and six contact residues detected but their deuterium uptake remained unchanged upon complexation (27). Why this happens is poorly understood. As described above, the process of HDX-MS of proteins is highly complex and translation between HDX-MS and structure is still being investigated (98, 128).
In addition, HDX-MS data can be non-specific with moderately high false-positive signals. There are recommended thresholds on the difference in D levels that would statistically be considered as significant (129). However, due to different experimental set-ups and auxiliary information known to the researchers, manual review and assessment are often called for and various studies assign the statistically significant differences in HDX-MS peptides differently. Thus, it is challenging to properly account for experimental errors and noise in HDX-MS experiments. False positives are also possible because of allosteric effects, as mentioned above.
Overall, HDX-MS data are sparse and maybe partially incorrect or incomplete. As a result, a promising approach applying HDX-MS as experimental restraints should balance the weighting of HDX-MS data in qualitative and quantitative docking for improvement in docking accuracy.
HDX-MS Weighted Scoring Function
Evaluating docked models by HDX-MS has been a qualitative procedure. A better way to quantify the agreement of docking models with HDX-MS data would be to alter the scoring function for each calculated metric to consider the sparseness of the HDX-MS data. HDX-NMR data has been incorporated into a scoring function used for computational de novo protein structure prediction (130). Considering residual solvent exposure and structural flexibility, the weighted scoring function improved RMSD of the selected model in 12 out of 15 proteins and by 1.42 Å on average (130–132). This study paves the way for future development of a scoring algorithm for HDX-MS sparse data in docking protocols, which would significantly enhance Ab-Ag complex model selection.
High-Quality Datasets for Benchmarking
One challenge for developing a computational method that incorporates sparse experimental HDX-MS data in Ab-Ag docking is high-quality datasets for computational benchmarking are insufficient. Ideal Ab-Ag complexes for benchmarking will require both HDX-MS datasets and crystal structures of the Ab-Ag complex. Nevertheless, most Ab-Ag complexes with HDX-MS data lack crystal structures because HDX-MS experiments are often performed when the proteins are unamenable to X-ray crystallography. This deficiency provides the opportunity for establishing a database to deposit HDX-MS epitope mapping datasets. The development of an HDX-NMR incorporated scoring system was made possible thanks to a large HDX-NMR dataset on protein folding and stability available via the Start2Fold database (130, 133). Similar to Start2Fold, a framework for deposition and analysis of HDX-MS experimental data targeting protein complexes would tremendously benefit integrative computational modeling effort (19, 51).
Suppose the obstacles mentioned above could be resolved for computational modeling, a typical HDX-MS experiment will be sufficient to restrict the sampling space and output an ensemble of near-native, high-resolution Ab-Ag complex models. HDX-MS could be used more reliably and with greater throughput for epitope identification for Ab engineering.
Future Directions
HDX-MS is uniquely suited to characterize protein interactions, even for complexes that are challenging to study by other biophysical techniques. The biggest intrinsic limitation of HDX-MS experiment is its resolution being at the peptide-level and being limited by the observed sequence coverage. New developments over the past decades have continuously pushed the limits of what is feasible for HDX-MS (51, 57, 134, 135). Improved separation and digestion techniques like ion mobility and combined proteases are being used to obtain different sets of overlapping peptides, and thus increase the resolution through a subtractive approach. With the continued development of mathematical algorithms estimating single amide deuterium uptake from HDX-MS datasets and the increase in accessibility to gas-phase fragmentation methods along with the advancement in their data analysis software, residue-resolution HDX-MS may become more established and routine in the future.
Advances in HDX-MS prediction models have been made and were applied to discriminate between conformational poses by a comparison to experimental HDX profiles. Nevertheless, accurate computational simulation of HDX-MS data from protein sequences has proven challenging so far as the combination of all structural determinants underlying HDX kinetics are not fully understood. The development of successful methods relies heavily on the greater availability of residue-level experimental HDX data for proteins with known structures (136). This deficiency is being addressed by the establishment of the Stat2Fold database and the emerging protocols in HDX-MS experiments enabling single residue HDX datasets. Collaboration between experimental and computational researchers will provide more usable data for model development to be collected, which provides more structural insights on HDX-MS and increases the prospect of accurate HDX prediction models. In another approach to improve protein complex modeling, the incorporation of HDX-MS into molecular docking, HDX-MS data is often applied qualitatively using distance restraints to corroborate the produced docking models, with little regard to the intensity of the experimental signals. Enhanced sampling and ranking of docking models will require benchmarking of more sophisticated methods that exploit deeper layers of HDX-MS data. The future of this area is moving towards docking protocols that enable quantitative usage of HDX-MS data as restraint and weighted scoring functions to accurately generate atomic-level models of Ab-Ag complexes. Overall, an integrated platform of HDX-MS and protein complex modeling, once established, will be a powerful tool in elucidating Ab-Ag binding interfaces with high-resolution and minimum turnaround time, especially in the absence of other structural information. This advancement will propel HDX-MS to become an indispensable tool for reliable, routine, and high-throughput epitope mapping in the near future.
Author Contributions
The manuscript was mainly written by MT and CS with contributions of all authors. All authors have given approval to the final version of the manuscript.
Funding
This work was supported by a grant from the National Institute of Health (NIH) (R01 GM073151) and the RosettaCommons. Work in the Meiler laboratory is supported by the NIH (U01 AI150739 and U19 AI117905).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Lu R-M, Hwang Y-C, Liu IJ, Lee C-C, Tsai H-Z, Li H-J, et al. Development of Therapeutic Antibodies for the Treatment of Diseases. J Biomed Sci (2020) 27(1):1. doi: 10.1186/s12929-019-0592-z
2. Rodgers KR, Chou RC. Therapeutic Monoclonal Antibodies and Derivatives: Historical Perspectives and Future Directions. Biotechnol Adv (2016) 34(6):1149–58. doi: 10.1016/j.biotechadv.2016.07.004
3. Buss NAPS, Henderson SJ, McFarlane M, Shenton JM, de Haan L. Monoclonal Antibody Therapeutics: History and Future. Curr Opin Pharmacol (2012) 12(5):615–22. doi: 10.1016/j.coph.2012.08.001
4. Walsh G. Biopharmaceutical Benchmarks 2014. Nat Biotechnol (2014) 32(10):992–1000. doi: 10.1038/nbt.3040
5. Liu JK. The History of Monoclonal Antibody Development - Progress, Remaining Challenges and Future Innovations. Ann Med Surg (2012) (2014) 3(4):113–6. doi: 10.1016/j.amsu.2014.09.001
6. Crowe JE Jr. Principles of Broad and Potent Antiviral Human Antibodies: Insights for Vaccine Design. Cell Host Microbe (2017) 22(2):193–206. doi: 10.1016/j.chom.2017.07.013
7. Sela-Culang I, Kunik V, Ofran Y. The Structural Basis of Antibody-Antigen Recognition. Front Immunol (2013) 4:302(302). doi: 10.3389/fimmu.2013.00302
8. Kwong PD, DeKosky BJ, Ulmer JB. Antibody-Guided Structure-Based Vaccines. Semin Immunol (2020) 50:101428. doi: 10.1016/j.smim.2020.101428
9. Sevy AM, Wu NC, Gilchuk IM, Parrish EH, Burger S, Yousif D, et al. Multistate Design of Influenza Antibodies Improves Affinity and Breadth Against Seasonal Viruses. Proc Natl Acad Sci (2019) 116(5):1597. doi: 10.1073/pnas.1806004116
10. Xiao K, Zhao Y, Choi M, Liu H, Blanc A, Qian J, et al. Revealing the Architecture of Protein Complexes by an Orthogonal Approach Combining Hdxms, Cxms, and Disulfide Trapping. Nat Protoc (2018) 13(6):1403–28. doi: 10.1038/nprot.2018.037
11. Huang RYC, Krystek SR, Felix N, Graziano RF, Srinivasan M, Pashine A, et al. Hydrogen/Deuterium Exchange Mass Spectrometry and Computational Modeling Reveal a Discontinuous Epitope of an Antibody/Tl1a Interaction. mAbs (2018) 10(1):95. doi: 10.1080/19420862.2017.1393595
12. Huang J, Honda W. Ced: A Conformational Epitope Database. BMC Immunol (2006) 7(1):7. doi: 10.1186/1471-2172-7-7
13. Zhang MM, Beno BR, Huang RYC, Adhikari J, Deyanova EG, Li J, et al. An Integrated Approach for Determining a Protein–Protein Binding Interface in Solution and an Evaluation of Hydrogen–Deuterium Exchange Kinetics for Adjudicating Candidate Docking Models. Anal Chem (2019) 91(24):15709–17. doi: 10.1021/acs.analchem.9b03879
14. Bennett MR, Dong J, Bombardi RG, Soto C, Parrington HM, Nargi RS, et al. Human V(H)1-69 Gene-Encoded Human Monoclonal Antibodies Against Staphylococcus Aureus Isdb Use at Least Three Distinct Modes of Binding to Inhibit Bacterial Growth and Pathogenesis. mBio (2019) 10(5):e02473–19. doi: 10.1128/mBio.02473-19
15. Zhang Z, Smith DL. Determination of Amide Hydrogen Exchange by Mass Spectrometry: A New Tool for Protein Structure Elucidation. Protein Sci (1993) 2(4):522–31. doi: 10.1002/pro.5560020404
16. Wales TE, Engen JR. Hydrogen Exchange Mass Spectrometry for the Analysis of Protein Dynamics. Mass Spectromet Rev (2006) 25(1):158–70. doi: 10.1002/mas.20064
17. Konermann L, Pan J, Liu Y-H. Hydrogen Exchange Mass Spectrometry for Studying Protein Structure and Dynamics. Chem Soc Rev (2011) 40(3):1224–34. doi: 10.1039/C0CS00113A
18. Chalmers MJ, Busby SA, Pascal BD, West GM, Griffin PR. Differential Hydrogen/Deuterium Exchange Mass Spectrometry Analysis of Protein-Ligand Interactions. Expert Rev Proteomics (2011) 8(1):43–59. doi: 10.1586/epr.10.109
19. Masson GR, Burke JE, Ahn NG, Anand GS, Borchers C, Brier S, et al. Recommendations for Performing, Interpreting and Reporting Hydrogen Deuterium Exchange Mass Spectrometry (Hdx-Ms) Experiments. Nat Methods (2019) 16(7):595–602. doi: 10.1038/s41592-019-0459-y
20. Wei H, Mo J, Tao L, Russell RJ, Tymiak AA, Chen G, et al. Hydrogen/Deuterium Exchange Mass Spectrometry for Probing Higher Order Structure of Protein Therapeutics: Methodology and Applications. Drug Discov Today (2014) 19(1):95–102. doi: 10.1016/j.drudis.2013.07.019
21. Balasubramaniam D, Komives EA. Hydrogen-Exchange Mass Spectrometry for the Study of Intrinsic Disorder in Proteins. Biochim Biophys Acta (BBA) - Proteins Proteomics (2013) 1834(6):1202–9. doi: 10.1016/j.bbapap.2012.10.009
22. Bangaru S, Lang S, Schotsaert M, Vanderven HA, Zhu X, Kose N, et al. A Site of Vulnerability on the Influenza Virus Hemagglutinin Head Domain Trimer Interface. Cell (2019) 177(5):1136–52.e18. doi: 10.1016/j.cell.2019.04.011
23. Huang RYC, Iacob RE, Krystek SR, Jin M, Wei H, Tao L, et al. Characterization of Aggregation Propensity of a Human Fc-Fusion Protein Therapeutic by Hydrogen/Deuterium Exchange Mass Spectrometry. J Am Soc Mass Spectrom (2017) 28(5):795–802. doi: 10.1021/jasms.8b05521
24. Gallagher ES, Hudgens JW. Chapter Fourteen - Mapping Protein–Ligand Interactions With Proteolytic Fragmentation, Hydrogen/Deuterium Exchange-Mass Spectrometry. In: Kelman Z, editor. Methods Enzymol, vol. 566. Cambridge, Massachusetts: Academic Press (2016). p. 357–404.
25. Puchades C, Kűkrer B, Diefenbach O, Sneekes-Vriese E, Juraszek J, Koudstaal W, et al. Epitope Mapping of Diverse Influenza Hemagglutinin Drug Candidates Using Hdx-Ms. Sci Rep (2019) 9(1):4735. doi: 10.1038/s41598-019-41179-0
26. Deng B, Lento C, Wilson DJ. Hydrogen Deuterium Exchange Mass Spectrometry in Biopharmaceutical Discovery and Development – A Review. Anal Chimica Acta (2016) 940:8–20. doi: 10.1016/j.aca.2016.08.006
27. Pandit D, Tuske SJ, Coales SJ, SY E, Liu A, Lee JE, et al. Mapping of Discontinuous Conformational Epitopes by Amide Hydrogen/Deuterium Exchange Mass Spectrometry and Computational Docking. J Mol Recog (2012) 25(3):114–24. doi: 10.1002/jmr.1169
28. Hansen J, Baum A, Pascal KE, Russo V, Giordano S, Wloga E, et al. Studies in Humanized Mice and Convalescent Humans Yield a Sars-Cov-2 Antibody Cocktail. Science (2020) 369(6506):1010. doi: 10.1126/science.abd0827
29. Narang D, Lento C, Wilson DJ. Hdx-Ms: An Analytical Tool to Capture Protein Motion in Action. Biomedicines (2020) 8(7):224. doi: 10.3390/biomedicines8070224
30. Wang W, Singh S, Zeng DL, King K, Nema S. Antibody Structure, Instability, and Formulation. J Pharm Sci (2007) 96(1):1–26. doi: 10.1002/jps.20727
31. Zhu S, Liuni P, Ettorre L, Chen T, Szeto J, Carpick B, et al. Hydrogen–Deuterium Exchange Epitope Mapping Reveals Distinct Neutralizing Mechanisms for Two Monoclonal Antibodies Against Diphtheria Toxin. Biochemistry (2019) 58(6):646–56. doi: 10.1021/acs.biochem.8b01123
32. Deng B, Zhu S, Macklin AM, Xu J, Lento C, Sljoka A, et al. Suppressing Allostery in Epitope Mapping Experiments Using Millisecond Hydrogen / Deuterium Exchange Mass Spectrometry. mAbs (2017) 9(8):1327–36. doi: 10.1080/19420862.2017.1379641
33. Bennett MR, Bombardi RG, Kose N, Parrish EH, Nagel MB, Petit RA, et al. Human Mabs to Staphylococcus Aureus Isda Provide Protection Through Both Heme-Blocking and Fc-Mediated Mechanisms. J Infect Dis (2019) 219(8):1264–73. doi: 10.1093/infdis/jiy635
34. Sevy AM, Healey JF, Deng W, Spiegel PC, Meeks SL, Li R. Epitope Mapping of Inhibitory Antibodies Targeting the C2 Domain of Coagulation Factor VIII by Hydrogen-Deuterium Exchange Mass Spectrometry. J Thromb Haemost (2013) 11(12):2128–36. doi: 10.1111/jth.12433
35. Turner HL, Pallesen J, Lang S, Bangaru S, Urata S, Li S, et al. Potent Anti-Influenza H7 Human Monoclonal Antibody Induces Separation of Hemagglutinin Receptor-Binding Head Domains. PLoS Biol (2019) 17(2):e3000139–e. doi: 10.1371/journal.pbio.3000139
36. Fields JK, Kihn K, Birkedal GS, Klontz EH, Sjöström K, Günther S, et al. Molecular Basis of Selective Cytokine Signaling Inhibition by Antibodies Targeting a Shared Receptor. Front Immunol (2021) 12:779100. doi: 10.3389/fimmu.2021.779100
37. Möller IR, Slivacka M, Hausner J, Nielsen AK, Pospíšilová E, Merkle PS, et al. Improving the Sequence Coverage of Integral Membrane Proteins During Hydrogen/Deuterium Exchange Mass Spectrometry Experiments. Anal Chem (2019) 91(17):10970–8. doi: 10.1021/acs.analchem.9b00973
38. Houde D, Engen JR. Conformational Analysis of Recombinant Monoclonal Antibodies With Hydrogen/Deuterium Exchange Mass Spectrometry. Methods Mol Biol (2013) 988:269–89. doi: 10.1007/978-1-62703-327-5_17
39. Park I-H, Venable JD, Steckler C, Cellitti SE, Lesley SA, Spraggon G, et al. Estimation of Hydrogen-Exchange Protection Factors From Md Simulation Based on Amide Hydrogen Bonding Analysis. J Chem Inf Model (2015) 55(9):1914–25. doi: 10.1021/acs.jcim.5b00185
40. Zhang MM, Huang RYC, Beno BR, Deyanova EG, Li J, Chen G, et al. Epitope and Paratope Mapping of Pd-1/Nivolumab by Mass Spectrometry-Based Hydrogen–Deuterium Exchange, Cross-Linking, and Molecular Docking. Anal Chem (2020) 92(13):9086–94. doi: 10.1021/acs.analchem.0c01291
41. Narang D, James DA, Balmer MT, Wilson DJ. Protein Footprinting, Conformational Dynamics, and Core Interface-Adjacent Neutralization “Hotspots” in the Sars-Cov-2 Spike Protein Receptor Binding Domain/Human Ace2 Interaction. J Am Soc Mass Spectrom (2021) 32(7):1593–600. doi: 10.1021/jasms.0c00465
42. Xu J, Cui K, Shen L, Shi J, Li L, You L, et al. Crl Activates Transcription by Stabilizing Active Conformation of the Master Stress Transcription Initiation Factor. eLife (2019) 8:e50928. doi: 10.7554/eLife.50928
43. Engen JR, Komives EA. Complementarity of Hydrogen/Deuterium Exchange Mass Spectrometry and Cryo-Electron Microscopy. Trends Biochem Sci (2020) 45(10):906–18. doi: 10.1016/j.tibs.2020.05.005
44. Rand KD, Zehl M, Jørgensen TJD. Measuring the Hydrogen/Deuterium Exchange of Proteins at High Spatial Resolution by Mass Spectrometry: Overcoming Gas-Phase Hydrogen/Deuterium Scrambling. Account Chem Res (2014) 47(10):3018–27. doi: 10.1021/ar500194w
45. Iacob RE, Engen JR. Hydrogen Exchange Mass Spectrometry: Are We Out of the Quicksand? J Am Soc Mass Spectrom (2012) 23(6):1003–10. doi: 10.1007/s13361-012-0377-z
46. Zehl M, Rand KD, Jensen ON, Jørgensen TJD. Electron Transfer Dissociation Facilitates the Measurement of Deuterium Incorporation Into Selectively Labeled Peptides With Single Residue Resolution. J Am Chem Soc (2008) 130(51):17453–9. doi: 10.1021/ja805573h
47. Huang RYC, Chen G. Higher Order Structure Characterization of Protein Therapeutics by Hydrogen/Deuterium Exchange Mass Spectrometry. Anal Bioanal Chem (2014) 406(26):6541–58. doi: 10.1007/s00216-014-7924-3
48. Gramlich M, Hays HCW, Crichton S, Kaiser PD, Heine A, Schneiderhan-Marra N, et al. Hdx-Ms for Epitope Characterization of a Therapeutic Antibody Candidate on the Calcium-Binding Protein Annexin-A1. Antibodies (2021) 10(1):11. doi: 10.3390/antib10010011
49. Jensen PF, Comamala G, Trelle MB, Madsen JB, Jørgensen TJD, Rand KD. Removal of N-Linked Glycosylations at Acidic Ph by Pngase a Facilitates Hydrogen/Deuterium Exchange Mass Spectrometry Analysis of N-Linked Glycoproteins. Anal Chem (2016) 88(24):12479–88. doi: 10.1021/acs.analchem.6b03951
50. Comamala G, Madsen JB, Voglmeir J, Du YM, Jensen PF, Østerlund EC, et al. Deglycosylation by the Acidic Glycosidase Pngase H(+) Enables Analysis of N-Linked Glycoproteins by Hydrogen/Deuterium Exchange Mass Spectrometry. J Am Soc Mass Spectrom (2020) 31(11):2305–12. doi: 10.1021/jasms.0c00258
51. Trabjerg E, Nazari ZE, Rand KD. Conformational Analysis of Complex Protein States by Hydrogen/Deuterium Exchange Mass Spectrometry (Hdx-Ms): Challenges and Emerging Solutions. TrAC Trends Anal Chem (2018) 106:125–38. doi: 10.1016/j.trac.2018.06.008
52. Hudgens JW, Gallagher ES, Karageorgos I, Anderson KW, Filliben JJ, Huang RYC, et al. Interlaboratory Comparison of Hydrogen–Deuterium Exchange Mass Spectrometry Measurements of the Fab Fragment of Nistmab. Anal Chem (2019) 91(11):7336–45. doi: 10.1021/acs.analchem.9b01100
53. Ahn J, Cao M-J, Yu YQ, Engen JR. Accessing the Reproducibility and Specificity of Pepsin and Other Aspartic Proteases. Biochim Biophys Acta (BBA) - Proteins Proteomics (2013) 1834(6):1222–9. doi: 10.1016/j.bbapap.2012.10.003
54. Nirudodhi SN, Sperry JB, Rouse JC, Carroll JA. Application of Dual Protease Column for Hdx-Ms Analysis of Monoclonal Antibodies. J Pharm Sci (2017) 106(2):530–6. doi: 10.1016/j.xphs.2016.10.023
55. Tremblay CY, Limpikirati P, Vachet RW. Complementary Structural Information for Stressed Antibodies From Hydrogen–Deuterium Exchange and Covalent Labeling Mass Spectrometry. J Am Soc Mass Spectrom (2021) 32(5):1237–48. doi: 10.1021/jasms.1c00072
57. Oganesyan I, Lento C, Wilson DJ. Contemporary Hydrogen Deuterium Exchange Mass Spectrometry. Methods (2018) 144:27–42. doi: 10.1016/j.ymeth.2018.04.023
58. Abzalimov RR, Kaltashov IA. Controlling Hydrogen Scrambling in Multiply Charged Protein Ions During Collisional Activation: Implications for Top-Down Hydrogen/Deuterium Exchange Ms Utilizing Collisional Activation in the Gas Phase. Anal Chem (2010) 82(3):942–50. doi: 10.1021/ac9021874
59. Hamuro Y, Coales SJ. Optimization of Feasibility Stage for Hydrogen/Deuterium Exchange Mass Spectrometry. J Am Soc Mass Spectrom (2018) 29(3):623–9. doi: 10.1021/jasms.8b05771
60. Garcia Natalie K, Guttman M, Ebner Jamie L, Lee Kelly K. Dynamic Changes During Acid-Induced Activation of Influenza Hemagglutinin. Structure (2015) 23(4):665–76. doi: 10.1016/j.str.2015.02.006
61. Illiano A, Pinto G, Melchiorre C, Carpentieri A, Faraco V, Amoresano A. Protein Glycosylation Investigated by Mass Spectrometry: An Overview. Cells (2020) 9(9):1986. doi: 10.3390/cells9091986
62. Zhu R, Zacharias L, Wooding KM, Peng W, Mechref Y. Glycoprotein Enrichment Analytical Techniques: Advantages and Disadvantages. Methods Enzymol (2017) 585:397–429. doi: 10.1016/bs.mie.2016.11.009
63. Fruton JS, Bergmann M. The Specificity of Pepsin Action. Science (1938) 87(2268):557. doi: 10.1126/science.87.2268.557
64. Rob T, Gill PK, Golemi-Kotra D, Wilson DJ. An Electrospray Ms-Coupled Microfluidic Device for Sub-Second Hydrogen/Deuterium Exchange Pulse-Labelling Reveals Allosteric Effects in Enzyme Inhibition. Lab Chip (2013) 13(13):2528–32. doi: 10.1039/c3lc00007a
65. Wilson DJ, Konermann L. A Capillary Mixer With Adjustable Reaction Chamber Volume for Millisecond Time-Resolved Studies by Electrospray Mass Spectrometry. Anal Chem (2003) 75(23):6408–14. doi: 10.1021/ac0346757
66. Grauslund LR, Calvaresi V, Pansegrau W, Norais N, Rand KD. Epitope and Paratope Mapping by Hdx-Ms Combined With Spr Elucidates the Difference in Bactericidal Activity of Two Anti-Nada Monoclonal Antibodies. J Am Soc Mass Spectrom (2021) 32(7):1575–82. doi: 10.1021/jasms.0c00431
67. Nguyen D, Mayne L, Phillips MC, Walter Englander S. Reference Parameters for Protein Hydrogen Exchange Rates. J Am Soc Mass Spectrom (2018) 29(9):1936–9. doi: 10.1007/s13361-018-2021-z
68. Bai Y, Milne JS, Mayne L, Englander SW. Primary Structure Effects on Peptide Group Hydrogen Exchange. Proteins (1993) 17(1):75–86. doi: 10.1002/prot.340170110
69. Cieplak-Rotowska MK, Tarnowski K, Rubin M, Fabian MR, Sonenberg N, Dadlez M, et al. Structural Dynamics of the Gw182 Silencing Domain Including Its Rna Recognition Motif (Rrm) Revealed by Hydrogen-Deuterium Exchange Mass Spectrometry. J Am Soc Mass Spectrom (2018) 29(1):158–73. doi: 10.1007/s13361-017-1830-9
70. Zhu S, Shala A, Bezginov A, Sljoka A, Audette G, Wilson DJ. Hyperphosphorylation of Intrinsically Disordered Tau Protein Induces an Amyloidogenic Shift in Its Conformational Ensemble. PLoS One (2015) 10(3):e0120416. doi: 10.1371/journal.pone.0120416
71. Kan Z-Y, Walters BT, Mayne L, Englander SW. Protein Hydrogen Exchange at Residue Resolution by Proteolytic Fragmentation Mass Spectrometry Analysis. Proc Natl Acad Sci (2013) 110(41):16438. doi: 10.1073/pnas.1315532110
72. Saltzberg DJ, Broughton HB, Pellarin R, Chalmers MJ, Espada A, Dodge JA, et al. A Residue-Resolved Bayesian Approach to Quantitative Interpretation of Hydrogen–Deuterium Exchange From Mass Spectrometry: Application to Characterizing Protein–Ligand Interactions. J Phys Chem B (2017) 121(15):3493–501. doi: 10.1021/acs.jpcb.6b09358
73. Keppel TR, Weis DD. Mapping Residual Structure in Intrinsically Disordered Proteins at Residue Resolution Using Millisecond Hydrogen/Deuterium Exchange and Residue Averaging. J Am Soc Mass Spectrom (2015) 26(4):547–54. doi: 10.1007/s13361-014-1033-6
74. Althaus E, Canzar S, Ehrler C, Emmett MR, Karrenbauer A, Marshall AG, et al. Computing H/D-Exchange Rates of Single Residues From Data of Proteolytic Fragments. BMC Bioinf (2010) 11(1):424. doi: 10.1186/1471-2105-11-424
75. Babić D, Kazazić S, Smith DM. Resolution of Protein Hydrogen/Deuterium Exchange by Fitting Amide Exchange Probabilities to the Peptide Isotopic Envelopes. Rapid Commun Mass Spectromet (2019) 33(15):1248–57. doi: 10.1002/rcm.8460
76. Zhang Z. Complete Extraction of Protein Dynamics Information in Hydrogen/Deuterium Exchange Mass Spectrometry Data. Anal Chem (2020) 92(9):6486–94. doi: 10.1021/acs.analchem.9b05724
77. Hamuro Y, SY E. Determination of Backbone Amide Hydrogen Exchange Rates of Cytochrome C Using Partially Scrambled Electron Transfer Dissociation Data. J Am Soc Mass Spectrom (2018) 29(5):989–1001. doi: 10.1007/s13361-018-1892-3
78. Hamuro Y. Determination of Equine Cytochrome C Backbone Amide Hydrogen/Deuterium Exchange Rates by Mass Spectrometry Using a Wider Time Window and Isotope Envelope. J Am Soc Mass Spectrom (2017) 28(3):486–97. doi: 10.1021/jasms.8b05485
79. Gessner C, Steinchen W, Bédard S, J. Skinner J, Woods VL, Walsh TJ, et al. Computational Method Allowing Hydrogen-Deuterium Exchange Mass Spectrometry at Single Amide Resolution. Sci Rep (2017) 7(1):3789. doi: 10.1038/s41598-017-03922-3
80. Skinner SP, Radou G, Tuma R, Houwing-Duistermaat JJ, Paci E. Estimating Constraints for Protection Factors From Hdx-Ms Data. Biophys J (2019) 116(7):1194–203. doi: 10.1016/j.bpj.2019.02.024
81. Bai Y, Sosnick TR, Mayne L, Englander SW. Protein Folding Intermediates: Native-State Hydrogen Exchange. Science (1995) 269(5221):192–7. doi: 10.1126/science.7618079
82. Liu T, Pantazatos D, Li S, Hamuro Y, Hilser VJ, Woods VL. Quantitative Assessment of Protein Structural Models by Comparison of H/D Exchange Ms Data With Exchange Behavior Accurately Predicted by Dxcorex. J Am Soc Mass Spectrom (2012) 23(1):43–56. doi: 10.1021/jasms.8b04128
83. Best RB, Vendruscolo M. Structural Interpretation of Hydrogen Exchange Protection Factors in Proteins: Characterization of the Native State Fluctuations of Ci2. Structure (2006) 14(1):97–106. doi: 10.1016/j.str.2005.09.012
84. Nikita VD, Oxana VG. Prediction of Residue Status to Be Protected or Not Protected From Hy-Drogen Exchange Using Amino Acid Sequence Only. Open Biochem J (2008) 2:77–80. doi: 10.2174/1874091x00802010077
85. Dovidchenko NV, Lobanov MY, Garbuzynskiy SO, Galzitskaya OV. Prediction of Amino Acid Residues Protected From Hydrogen-Deuterium Exchange in a Protein Chain. Biochem Biokhimiia (2009) 74(8):888–97. doi: 10.1134/s0006297909080100
86. Craig PO, Lätzer J, Weinkam P, Hoffman RMB, Ferreiro DU, Komives EA, et al. Prediction of Native-State Hydrogen Exchange From Perfectly Funneled Energy Landscapes. J Am Chem Soc (2011) 133(43):17463–72. doi: 10.1021/ja207506z
87. Petruk AA, Defelipe LA, Rodríguez Limardo RG, Bucci H, Marti MA, Turjanski AG. Molecular Dynamics Simulations Provide Atomistic Insight Into Hydrogen Exchange Mass Spectrometry Experiments. J Chem Theory Comput (2013) 9(1):658–69. doi: 10.1021/ct300519v
88. Sljoka A, Wilson D. Probing Protein Ensemble Rigidity and Hydrogen-Deuterium Exchange. Phys Biol (2013) 10(5):56013. doi: 10.1088/1478-3975/10/5/056013
89. Persson F, Halle B. How Amide Hydrogens Exchange in Native Proteins. Proc Natl Acad Sci USA (2015) 112(33):10383–8. doi: 10.1073/pnas.1506079112
90. Vertrees J, Barritt P, Whitten S, Hilser VJ. Corex/Best Server: A Web Browser-Based Program That Calculates Regional Stability Variations Within Protein Structures. Bioinformatics (2005) 21(15):3318–9. doi: 10.1093/bioinformatics/bti520
91. Vendruscolo M, Paci E, Dobson CM, Karplus M. Rare Fluctuations of Native Proteins Sampled by Equilibrium Hydrogen Exchange. J Am Chem Soc (2003) 125(51):15686–7. doi: 10.1021/ja036523z
92. Borysik AJ. Simulated Isotope Exchange Patterns Enable Protein Structure Determination. Angewandte Chemie Int Edit (2017) 56(32):9396–9. doi: 10.1002/anie.201704604
93. Bradshaw RT, Marinelli F, Faraldo-Gómez JD, Forrest LR. Interpretation of Hdx Data by Maximum-Entropy Reweighting of Simulated Structural Ensembles. Biophys J (2020) 118(7):1649–64. doi: 10.1016/j.bpj.2020.02.005
94. Harris MJ, Raghavan D, Borysik AJ. Quantitative Evaluation of Native Protein Folds and Assemblies by Hydrogen Deuterium Exchange Mass Spectrometry (Hdx-Ms). J Am Soc Mass Spectrom (2019) 30(1):58–66. doi: 10.1007/s13361-018-2070-3
95. Claesen J, Politis A. Poppet: A New Method to Predict the Protection Factor of Backbone Amide Hydrogens. J Am Soc Mass Spectrom (2019) 30(1):67–76. doi: 10.1007/s13361-018-2068-x
96. Mohammadiarani H, Shaw VS, Neubig RR, Vashisth H. Interpreting Hydrogen–Deuterium Exchange Events in Proteins Using Atomistic Simulations: Case Studies on Regulators of G-Protein Signaling Proteins. J Phys Chem B (2018) 122(40):9314–23. doi: 10.1021/acs.jpcb.8b07494
97. Marcsisin SR, Engen JR. Hydrogen Exchange Mass Spectrometry: What Is It and What Can It Tell Us? Anal Bioanal Chem (2010) 397(3):967–72. doi: 10.1007/s00216-010-3556-4
98. McAllister RG, Konermann L. Challenges in the Interpretation of Protein H/D Exchange Data: A Molecular Dynamics Simulation Perspective. Biochemistry (2015) 54(16):2683–92. doi: 10.1021/acs.biochem.5b00215
99. Skinner JJ, Lim WK, Bédard S, Black BE, Englander SW. Protein Hydrogen Exchange: Testing Current Models. Protein Sci (2012) 21(7):987–95. doi: 10.1002/pro.2082
100. Gershoni JM, Roitburd-Berman A, Siman-Tov DD, Tarnovitski Freund N, Weiss Y. Epitope Mapping: The First Step in Developing Epitope-Based Vaccines. BioDrugs (2007) 21(3):145–56. doi: 10.2165/00063030-200721030-00002
101. Sivasubramanian A, Sircar A, Chaudhury S, Gray JJ. Toward High-Resolution Homology Modeling of Antibody Fv Regions and Application to Antibody-Antigen Docking. Proteins (2009) 74(2):497–514. doi: 10.1002/prot.22309
102. Chaudhury S, Berrondo M, Weitzner BD, Muthu P, Bergman H, Gray JJ. Benchmarking and Analysis of Protein Docking Performance in Rosetta V3.2. PLoS One (2011) 6(8):e22477–e. doi: 10.1371/journal.pone.0022477
103. Chaudhury S, Gray JJ. Conformer Selection and Induced Fit in Flexible Backbone Protein–Protein Docking Using Computational and Nmr Ensembles. J Mol Biol (2008) 381(4):1068–87. doi: 10.1016/j.jmb.2008.05.042
104. Wang C, Bradley P, Baker D. Protein–Protein Docking With Backbone Flexibility. J Mol Biol (2007) 373(2):503–19. doi: 10.1016/j.jmb.2007.07.050
105. Mandell JG, Roberts VA, Pique ME, Kotlovyi V, Mitchell JC, Nelson E, et al. Protein Docking Using Continuum Electrostatics and Geometric Fit. Protein Engineer Design Select (2001) 14(2):105–13. doi: 10.1093/protein/14.2.105
106. Kastritis PL, Rodrigues JPGLM, Bonvin AMJJ. Haddock2p2i: A Biophysical Model for Predicting the Binding Affinity of Protein–Protein Interaction Inhibitors. J Chem Inf Model (2014) 54(3):826–36. doi: 10.1021/ci4005332
107. Pierce BG, Wiehe K, Hwang H, Kim B-H, Vreven T, Weng Z. Zdock Server: Interactive Docking Prediction of Protein–Protein Complexes and Symmetric Multimers. Bioinformatics (2014) 30(12):1771–3. doi: 10.1093/bioinformatics/btu097
108. Comeau SR, Gatchell DW, Vajda S, Camacho CJ. Cluspro: An Automated Docking and Discrimination Method for the Prediction of Protein Complexes. Bioinformatics (2004) 20(1):45–50. doi: 10.1093/bioinformatics/btg371
109. Schneidman-Duhovny D, Inbar Y, Nussinov R, Wolfson HJ. Patchdock and Symmdock: Servers for Rigid and Symmetric Docking. Nucleic Acids Res (2005) 33(suppl_2):W363–W7. doi: 10.1093/nar/gki481
110. Gabb HA, Jackson RM, Sternberg MJE. Modelling Protein Docking Using Shape Complementarity, Electrostatics and Biochemical Information11edited by J. Thornton J Mol Biol (1997) 272(1):106–20. doi: 10.1006/jmbi.1997.1203
111. Bender BJ, Cisneros A, Duran AM, Finn JA, Fu D, Lokits AD, et al. Protocols for Molecular Modeling With Rosetta3 and Rosettascripts. Biochemistry (2016) 55(34):4748–63. doi: 10.1021/acs.biochem.6b00444
112. Gray JJ, Moughon S, Wang C, Schueler-Furman O, Kuhlman B, Rohl CA, et al. Protein–Protein Docking With Simultaneous Optimization of Rigid-Body Displacement and Side-Chain Conformations. J Mol Biol (2003) 331(1):281–99. doi: 10.1016/S0022-2836(03)00670-3
113. Weitzner BD, Jeliazkov JR, Lyskov S, Marze N, Kuroda D, Frick R, et al. Modeling and Docking of Antibody Structures With Rosetta. Nat Protoc (2017) 12(2):401–16. doi: 10.1038/nprot.2016.180
114. Sela-Culang I, Alon S, Ofran Y. A Systematic Comparison of Free and Bound Antibodies Reveals Binding-Related Conformational Changes. J Immunol (2012) 189(10):4890–9. doi: 10.4049/jimmunol.1201493
115. Guest JD, Vreven T, Zhou J, Moal I, Jeliazkov JR, Gray JJ, et al. An Expanded Benchmark for Antibody-Antigen Docking and Affinity Prediction Reveals Insights Into Antibody Recognition Determinants. Structure (2021) 29(6):606–21. doi: 10.1016/j.str.2021.01.005
116. Sircar A, Gray JJ. Snugdock: Paratope Structural Optimization During Antibody-Antigen Docking Compensates for Errors in Antibody Homology Models. PLoS Comput Biol (2010) 6(1):e1000644–e. doi: 10.1371/journal.pcbi.1000644
117. Seffernick JT, Lindert S. Hybrid Methods for Combined Experimental and Computational Determination of Protein Structure. J Chem Phys (2020) 153(24):240901. doi: 10.1063/5.0026025
118. Schoeder CT, Schmitz S, Adolf-Bryfogle J, Sevy AM, Finn JA, Sauer MF, et al. Modeling Immunity With Rosetta: Methods for Antibody and Antigen Design. Biochemistry (2021) 60(11):825–46. doi: 10.1021/acs.biochem.0c00912
119. Sivasubramanian A, Chao G, Pressler HM, Wittrup KD, Gray JJ. Structural Model of the Mab 806-Egfr Complex Using Computational Docking Followed by Computational and Experimental Mutagenesis. Structure (2006) 14(3):401–14. doi: 10.1016/j.str.2005.11.022
120. Thornburg NJ, Nannemann DP, Blum DL, Belser JA, Tumpey TM, Deshpande S, et al. Human Antibodies That Neutralize Respiratory Droplet Transmissible H5n1 Influenza Viruses. J Clin Invest (2013) 123(10):4405–9. doi: 10.1172/JCI69377
121. Hauri S, Khakzad H, Happonen L, Teleman J, Malmström J, Malmström L. Rapid Determination of Quaternary Protein Structures in Complex Biological Samples. Nat Commun (2019) 10(1):192. doi: 10.1038/s41467-018-07986-1
122. Kuenze G, Bonneau R, Leman JK, Meiler J. Integrative Protein Modeling in Rosettanmr From Sparse Paramagnetic Restraints. Structure (2019) 27(11):1721–34.e5. doi: 10.1016/j.str.2019.08.012
123. Marze NA, Jeliazkov JR, Roy Burman SS, Boyken SE, DiMaio F, Gray JJ. Modeling Oblong Proteins and Water-Mediated Interfaces With Rosettadock in Capri Rounds 28–35. Prot: Struct Function Bioinf (2017) 85(3):479–86. doi: 10.1002/prot.25168
124. Pu C, Yan G, Shi J, Li R. Assessing the Performance of Docking Scoring Function, Fep, Mm-Gbsa, and Qm/Mm-Gbsa Approaches on a Series of Plk1 Inhibitors. Medchemcomm (2017) 8(7):1452–8. doi: 10.1039/c7md00184c
125. Roberts VA, Pique ME, Hsu S, Li S. Combining H/D Exchange Mass Spectrometry and Computational Docking to Derive the Structure of Protein–Protein Complexes. Biochemistry (2017) 56(48):6329–42. doi: 10.1021/acs.biochem.7b00643
126. Komolov KE, Du Y, Duc NM, Betz RM, Rodrigues JPGLM, Leib RD, et al. Structural and Functional Analysis of a β2- Adrenergic Receptor Complex With Grk5. Cell (2017) 169(3):407–21.e16. doi: 10.1016/j.cell.2017.03.047
127. Lin S-J, Chen Y-F, Hsu K-C, Chen Y-L, Ko T-P, Lo C-F, et al. Structural Insights to the Heterotetrameric Interaction Between the Vibrio Parahaemolyticus Piravp and Pirbvp Toxins and Activation of the Cry-Like Pore-Forming Domain. Toxins (2019) 11(4):233. doi: 10.3390/toxins11040233
128. Huang L, So P-K, Yao Z-P. Protein Dynamics Revealed by Hydrogen/Deuterium Exchange Mass Spectrometry: Correlation Between Experiments and Simulation. Rapid Commun Mass Spectromet (2019) 33(S3):83–9. doi: 10.1002/rcm.8307
129. Houde D, Berkowitz SA, Engen JR. The Utility of Hydrogen/Deuterium Exchange Mass Spectrometry in Biopharmaceutical Comparability Studies. J Pharm Sci (2011) 100(6):2071–86. doi: 10.1002/jps.22432
130. Marzolf DR, Seffernick JT, Lindert S. Protein Structure Prediction From Nmr Hydrogen–Deuterium Exchange Data. J Chem Theory Comput (2021) 17(4):2619–29. doi: 10.1021/acs.jctc.1c00077
131. Kim SS, Seffernick JT, Lindert S. Accurately Predicting Disordered Regions of Proteins Using Rosetta Residuedisorder Application. J Phys Chem B (2018) 122(14):3920–30. doi: 10.1021/acs.jpcb.8b01763
132. Seffernick JT, Ren H, Kim SS, Lindert S. Measuring Intrinsic Disorder and Tracking Conformational Transitions Using Rosetta Residuedisorder. J Phys Chem B (2019) 123(33):7103–12. doi: 10.1021/acs.jpcb.9b04333
133. Pancsa R, Varadi M, Tompa P, Vranken WF. Start2fold: A Database of Hydrogen/Deuterium Exchange Data on Protein Folding and Stability. Nucleic Acids Res (2015) 44(D1):D429–D34. doi: 10.1093/nar/gkv1185
134. James EI, Murphree TA, Vorauer C, Engen JR, Guttman M. Advances in Hydrogen/Deuterium Exchange Mass Spectrometry and the Pursuit of Challenging Biological Systems. Chem Rev (2021) 122(8):7562–623. doi: 10.1021/acs.chemrev.1c00279
135. Engen JR, Botzanowski T, Peterle D, Georgescauld F, Wales TE. Developments in Hydrogen/Deuterium Exchange Mass Spectrometry. Anal Chem (2021) 93(1):567–82. doi: 10.1021/acs.analchem.0c04281
Keywords: hydrogen-deuterium exchange mass spectrometry (HDX-MS), antibody-antigen interaction, epitope-paratope identification, protein-protein docking, structure modeling, integrative structural biology
Citation: Tran MH, Schoeder CT, Schey KL and Meiler J (2022) Computational Structure Prediction for Antibody-Antigen Complexes From Hydrogen-Deuterium Exchange Mass Spectrometry: Challenges and Outlook. Front. Immunol. 13:859964. doi: 10.3389/fimmu.2022.859964
Received: 22 January 2022; Accepted: 22 April 2022;
Published: 26 May 2022.
Edited by:
Pui-Kin So, Hong Kong Polytechnic University, Hong Kong SAR, ChinaReviewed by:
Eric J. Sundberg, Emory University, United StatesKannan Sankar, Novartis Institutes for BioMedical Research, United States
Copyright © 2022 Tran, Schoeder, Schey and Meiler. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jens Meiler, amVuc0BtZWlsZXJsYWIub3Jn; Kevin L. Schey, ay5zY2hleUB2YW5kZXJiaWx0LmVkdQ==