Solvated interaction energy: from small-molecule to antibody drug design

Purisima, Enrico O.; Corbeil, Christopher R.; Gaudreault, Francis; Wei, Wanlei; Deprez, Christophe; Sulea, Traian

doi:10.3389/fmolb.2023.1210576

REVIEW article

Front. Mol. Biosci. , 07 June 2023

Sec. Protein Biochemistry for Basic and Applied Sciences

Volume 10 - 2023 | https://doi.org/10.3389/fmolb.2023.1210576

This article is part of the Research Topic Computational Methods for Protein Characterization: In Memoriam of Donald Abraham View all 6 articles

Solvated interaction energy: from small-molecule to antibody drug design

Enrico O. Purisima

Christopher R. Corbeil

Francis Gaudreault

Wanlei Wei

Christophe Deprez

Traian Sulea*

Human Health Therapeutics Research Centre, National Research Council Canada, Montreal, QC, Canada

Scoring functions are ubiquitous in structure-based drug design as an aid to predicting binding modes and estimating binding affinities. Ideally, a scoring function should be broadly applicable, obviating the need to recalibrate and refit its parameters for every new target and class of ligands. Traditionally, drugs have been small molecules, but in recent years biologics, particularly antibodies, have become an increasingly important if not dominant class of therapeutics. This makes the goal of having a transferable scoring function, i.e., one that spans the range of small-molecule to protein ligands, even more challenging. One such broadly applicable scoring function is the Solvated Interaction Energy (SIE), which has been developed and applied in our lab for the last 15 years, leading to several important applications. This physics-based method arose from efforts to understand the physics governing binding events, with particular care given to the role played by solvation. SIE has been used by us and many independent labs worldwide for virtual screening and discovery of novel small-molecule binders or optimization of known drugs. Moreover, without any retraining, it is found to be transferrable to predictions of antibody-antigen relative binding affinities and as accurate as functions trained on protein-protein binding affinities. SIE has been incorporated in conjunction with other scoring functions into ADAPT (Assisted Design of Antibody and Protein Therapeutics), our platform for affinity modulation of antibodies. Application of ADAPT resulted in the optimization of several antibodies with 10-to-100-fold improvements in binding affinity. Further applications included broadening the specificity of a single-domain antibody to be cross-reactive with virus variants of both SARS-CoV-1 and SARS-CoV-2, and the design of safer antibodies by engineering of a pH switch to make them more selective towards acidic tumors while sparing normal tissues at physiological pH.

1 Introduction

Structure-based drug design depends on computational methods for predicting binding modes and estimating binding affinities. These typically rely on scoring functions that can be classified into four categories: empirical, knowledge-based, physics-based and, more recently, artificial intelligence-based, encompassing descriptor-based machine learning to deep learning approaches (Gohlke and Klebe, 2002; Gilson and Zhou, 2007; Liu and Wang, 2015; Geng et al., 2019; Dhakal et al., 2022). For the last 15 years, our group has been developing and applying Solvated Interaction Energy (SIE), a structure-based scoring function for predicting intermolecular binding affinities in aqueous solution (Naim et al., 2007; Cui et al., 2008; Sulea et al., 2011; Sulea and Purisima, 2012b; Sulea et al., 2016; Vivcharuk et al., 2017). Falling within the physics-based category, scoring functions such as SIE continue to be attractive as their results are more readily interpretable due to the connection to the underlying physics enabling a rationally-driven modulation of binding affinities. As its name suggests, solvation is an important component of SIE given its major role in binding and is captured through a continuum solvation model.

In this overview, we describe the development of the SIE scoring function and its applications. A notable characteristic of this scoring function is its versatility. Although initially developed in the context of predicting binding affinities for protein-ligand interactions involving small molecules, SIE has found much broader applicability. We give examples of this versatility as we survey applications to docking and virtual screening for both small-molecule ligands and biologics. A demanding test of the usefulness and performance of any scoring function is its application to a wide variety of biological systems as well as its use in the hands of users from other laboratories aside from that of the developers. To assess those, we analyzed the usage and performance of SIE as reported in close to 400 publications.

2 The SIE scoring function approach

2.1 Original calibration of SIE

The overarching theme for the development of the SIE scoring function was to leverage existing work in force fields while keeping the number of fitted parameters to a minimum to guard against overfitting. Leveraging the works of AMBER (Case et al., 2005; Hornak et al., 2006) and GAFF (Wang et al., 2004) for the van der Waals and electrostatics interactions allowed SIE to use well-established sets of basic parameters for these terms. In addition, solvation terms from a continuum solvation model were supplemented. The linear combination of these terms gave rise to a functional form of SIE with five parameters that were fitted to reproduce experimental binding affinities (Eq. 1) (Naim et al., 2007). It should be noted that only 3 parameters (ρ, D_in and γ) affect the actual correlation with experiment, e.g., the relative ranking of affinities.

Δ G_{b i n d} (ρ, D_{i n}, α, γ, C) = α [{E_{v d W} + E}_{c o u l} (D_{i n}) + Δ G_{b i n d}^{R} (ρ, D_{i n}) + γ Δ M S A (ρ)] + C (1)

E_vdW and E_coul are the intermolecular van der Waals and Coulomb energies using the AMBER/GAFF force field. The Coulomb energy depends on the parameter D_in, the solute interior dielectric constant, as does the change in solvation reaction field energy, ΔG^R_bind. The ΔMSA term is the change in molecular surface area, and its contribution to the nonpolar solvation energy is proportional to area with a coefficient of γ. Both ΔG^R_bind and ΔMSA depend on the choice of radii used to define the solute-solvent dielectric boundary. These are set to the AMBER/GAFF Lennard-Jones radii linearly scaled by a factor ρ, one of the fitting parameters. The coefficient α is a global scaling factor and C a translation constant to bring the SIE score to the same magnitude as the experimental values of a training set.

The five parameters were fitted by minimizing the mean absolute deviation of the predicted versus experimental binding free energies for 11 targets comprising 99 protein-ligand complexes (Naim et al., 2007). These targets had six or more representative protein-ligand crystal structures with corresponding published binding affinity data. The published fitted parameters in use since 2008 are α = 0.1048, D_in = 2.25, ρ = 1.1, γ = 0.0129 kcal/(mol·Å²) and C = −2.89 kcal/mol (Cui et al., 2008). Note that these are slightly modified from the original 2007 fitted parameters (Naim et al., 2007). Overall, the correlation yielded a mean absolute deviation of about 1.4 kcal/mol in binding free energy for the 99 complexes.

We noted that the global proportionality coefficient α significantly scales down the sum of the various energy terms. We interpreted this as roughly capturing entropy-enthalpy compensation (Sharp, 2001; Reynolds and Holloway, 2011), i.e., the more negative ΔH is, the greater the −TΔS cost of binding will be. Gilson and coworkers have also observed a strong correlation between changes in computed configurational entropy, ΔS, and changes in potential energy plus solvation, Δ(U + W), upon binding in their studies of host-guest complexes (Chang and Gilson, 2004; Chen et al., 2004). One can roughly think of this as the tendency of stronger interactions to narrow the energy well of the complex, increasing the entropic cost of binding. In their study, Gilson and coworkers found that the entropic cost, −TΔS, cancels out about 90% of Δ(U + W). That degree of compensation is similar to the scaling by α of the interaction energy plus solvation in SIE. Thus, although SIE does not explicitly include an entropy term (aside from the solvation entropy included in the nonpolar surface area term), some of the entropic effects are implicitly contained in the α scaling factor.

2.2 Incorporation of solvation models

The solvation contribution is a critical term in the SIE score. This is incorporated using a continuum solvation model with the electrostatic component calculated using a boundary element solution (Purisima and Nilar, 1995; Purisima, 1998) of the Poisson equation. Our implementation of the boundary element method (BEM) for continuum electrostatics was noteworthy in that it was one of the earliest implementations of BEM that was computationally efficient enough to be applied to macromolecular systems (Purisima and Nilar, 1995). The nonpolar contribution was proportional to the molecular surface area. Over time it was realized that the usual continuum solvation model based on a table of atomic radii for a set of atom types and the use of just a surface area term for all of the nonpolar contributions had serious deficiencies. In particular, it could not capture the observed charge asymmetry of reaction field energies (Purisima and Sulea, 2009). In addition, the simple surface area model missed some of the nuances of solute-solvent van der Waals interactions. To address these deficiencies, the FiSH (First Shell Hydration) solvation model was developed (Corbeil et al., 2010). This makes the Born radii dependent not just on the atom type but also on the local electrostatic potential felt at the dielectric boundary. In addition, a continuum van der Waals model was incorporated that had two terms, one for the interaction with the first shell of water around the solute, and a second one for the interaction with more distant water molecules. Both the standard and FiSH solvation models are available within the SIE program.

2.3 Single-structure and MD-trajectory modes

SIE can be applied to a single conformation of a complex or it can be applied to snapshots of a molecular dynamics (MD) simulation from which SIE averages are calculated (Cui et al., 2008). A software package, sietraj, consisting of scripts and executables used to process a single conformation or an AMBER-generated MD trajectory is downloadable from https://mm.nrc-cnrc.gc.ca/sietraj/ (Sulea and Purisima, 2012b). Virtual alanine scanning of selected residues using an MD trajectory can be done as well. Calculating an average SIE from snapshots of an MD trajectory allows sampling conformations around an energy minimum and has the potential advantage of reducing the bias that may come from a single static conformation. However, for virtual screening applications, scoring a single conformation from a well-prepared energy-minimized structure can give satisfactory results and sufficient enrichment.

2.4 Relation to other physics-based methods

The different physics-based scoring functions in use vary in terms of the van der Waals parameters, partial charges, treatment of solvation and conformational sampling. Most have a simplified solvation term for computational speed. Of the more commonly used ones, MM-PBSA (Massova and Kollman, 2000; Rastelli et al., 2010) is the most similar to SIE. In fact, the two methods are sometimes used side by side to corroborate the results of each method. Just like SIE, it uses the AMBER/GAFF force field for van der Waals parameters and partial charges. It also models solvation effects with a high-quality continuum solvation model. MM-PBSA does have the further ability to incorporate ionic strength, which SIE does not have. Another difference between the two methods is that MM-PBSA has an explicit entropic term based on a normal mode calculation, i.e., the curvature at the bottom of the multi-dimensional potential energy well of an energy-minimized snapshot. However, this can make it significantly more expensive than SIE when analyzing an MD trajectory. Interestingly, although MM-PBSA has a more detailed accounting of entropy, MM-PBSA greatly over-estimates the magnitude of the absolute binding free energy while SIE scores are typically closer in magnitude to the measured binding affinities.

Most of the computational time of SIE is spent on the boundary element solution (BEM) of the Poisson equation for the continuum electrostatics solvation model. The SIE score requires three calculations–the solvation free energy of the complex and of each partner in the free state. As an example of computational cost, for a complex of two proteins each about 120 residues, these three calculations altogether take about 8 CPU seconds on a single core (Intel Xeon Silver 4116).

More detailed but computationally expensive physics-based methods are alchemical free energy methods (Chodera et al., 2011; Mey et al., 2020). These methods calculate the relative free energy change by gradually transforming one molecular entity into another and require extensive molecular dynamics sampling. Although in principle more rigorous than end-point methods such as SIE, in the blind tests (Skillman, 2012; Muddana et al., 2014; Gaieb et al., 2018) of predicting binding affinities mentioned in Section 3.2, they have not shown a clear advantage over the simpler end-point methods in terms of accuracy.

3 Small-molecule drug design

Intended for applications in small-molecule drug design, the original training of the SIE function was carried on small-molecule–protein binding data. The robustness of the trained SIE function in terms of accuracy and transferability was demonstrated by its many prospective and retrospective applications in laboratories worldwide, compiled and analyzed in Section 5. We also unbiasedly tested and benchmarked SIE during the past 15 years in community-wide challenges, including SAMPL (Statistical Assessment of the Modeling of Proteins and Ligands; https://www.samplchallenges.org/), CSAR (Community Structure-Activity Resource; http://www.csardock.org/) and D3R-GR (Drug Design Data Resource–Grand Challenge; https://drugdesigndata.org/about/grand-challenge) (Table 1). We not only tested SIE’s accuracy to predict binding affinities, but also its underlying solvation models as well as its derivative applications to ligand docking and virtual screening.

TABLE 1

TABLE 1. Testing of SIE and related methods in community-wide challenges.

3.1 Solvation free energy

The prediction accuracy and transferability of the BEM and FiSH solvation models underpinning SIE was tested early on. The first stringent test was predicting hydration free energies on the challenging drug-like SAMPL-1 blind data set of 63 highly polyfunctional organic compounds (Sulea et al., 2009). Surprisingly, we found that the BEM electrostatic-only solvation model afforded smaller absolute errors and was more transferable than a more complex continuum electrostatics-dispersion (CED) solvation model that added continuum nonpolar solvation terms parametrized by atom types. Indeed, even if the prospective predictions of the more complex CED model were highly correlated to experiment (R² = 0.82), they had systematic errors, mainly associated to compounds with multiple hydrogen bonds. This apparent shortcoming prompted the evolution of the CED solvation model into the FiSH solvation model, by separating the first shell of hydration and parametrizing both electrostatic and non-electrostatic terms based on explicit-solvent simulation data (Corbeil et al., 2010; Sulea et al., 2010). Testing of the FiSH model on the obscured set of 23 highly diverse and polyfunctional compounds of the SAMPL-2 challenge indicated improved accuracy and transferability relative to its CED predecessor model, with mean unsigned errors (MUE) more consistently below 2 kcal/mol (Purisima et al., 2010). However, accuracy was still varied across functional classes, calling for more detailed parametrization zooming on particular chemical classes and polyfunctional compounds. This was demonstrated in SAMPL-3, which challenged solvation predictors with 36 poly-chlorinated analogs (Sulea and Purisima, 2012a). A stark difference was observed between the accuracies of FiSH solvation predictions for aliphatic poly-Cl (R² = 0.52; MUE = 0.66 kcal/mol) and aromatic poly-Cl (R² = 0.05; MUE = 3.43 kcal/mol) compounds. Recalibration of the aromatic Cl parameters on explicit-solvent simulation data improved FiSH model predictions.

3.2 Binding affinity

Most prospective testing of SIE was dedicated to binding affinity predictions. In its first blind test (SAMPL-1), the original standard SIE parametrization achieved reasonable predictions (R² = 0.36; MUE = 0.92 kcal/mol) on the JNK3 data set consisting of 49 diverse ligands, each with its own co-crystal structure with the kinase, in addition to 10 docked models of known inactive analogs (Sulea et al., 2012). Absolute binding affinities were also predicted within the actual range while the inactives were separated reasonably well from the actives, indicating applicability to virtual screening.

SIE underwent stringent testing in SAMPL-3/4 (Skillman, 2012; Muddana et al., 2014), which blindly challenged the SIE’s applicability domain with three extreme scenarios: i) weak-affinity fragment-sized ligands binding to a protein target (trypsin); ii) high-affinity guest ligands binding to small targets (hosts or cages); and iii) ligands exhibiting a very narrow dynamic range below 2 kcal/mol for binding to a protein target (HIV-I) (Sulea et al., 2012; Hogues et al., 2014). Importantly, affinity predictions were made on computationally docked ligands, for which the Wilma-SIE method (Section 3.3) was employed. SIE provided affinity predictions with an MUE of 2.24 kcal/mol for the trypsin–fragments set, which were significantly improved by incorporating the newer FiSH solvation model (MUE 0.98 kcal/mol). This was also found on in the HIV-I set. SIE predictions in the host-guest systems were acceptable (R² 0.5–0.7) but suffered from an overestimation in absolute terms, which was corrected by rescaling the entropy-related factor, α, which may depend on the rigidity of the target molecule. Even with experimentally solved binding modes, SIE predictions lacked correlation with experimental affinities in the HIV-I set having binding affinities within 2 kcal/mol range; nonetheless, SIE was able to correctly signal the narrowness of the data range. Another conclusion from the HIV-I test set was that using a common protein structure for all ligands can reduce the noise.

SIE testing in the CSAR-2013/14 (Carlson, 2016) and D3R-GC2 (Gaieb et al., 2018) blind challenges allowed assessment of performance and transferability across a wider range of protein systems representative of real-life applications (Hogues et al., 2016; Hogues et al., 2018b). Affinity ranking of congeneric ligands after cross-docking was reasonably achieved in the SBP, Syk and TrmD systems, with Spearman rank-order correlation coefficients (S) ∼0.6. Poor ranking of FXA ligands was possibly due to protein domains not included in the calculations. Ligand preparation in the Syk set underscored the critical role of correct assignment of protonation states to the SIE performance. Including the FiSH model improved cross-docking but worsened affinity predictions, which pointed to a need for further fine-tuning of this newer solvation model. The FRX set from the D3R-GC2 blind challenge posed the formidable task of predicting ligand binding affinity to a highly flexible receptor. A possible cause for the difficulty of SIE function to predict binding affinities in this scenario is the internal energy strain arising from conformational differences in the receptor across complexes, which may need to be properly incorporated into SIE for flexible targets.

The most extensive testing of SIE purely for affinity prediction was done on the CSAR-2010 scoring set (Dunbar et al., 2011) consisting of high-resolution co-crystal structures for 343 protein-ligand complexes with high-quality binding affinity data spanning 18 kcal/mol and highly diverse protein targets (Sulea et al., 2011). SIE predicted binding affinities for the curated CSAR-NRC-HiQ dataset that were well in the range of experimental values (MUE = 1.98 kcal/mol; R² = 0.38). Predictions were found to be very sensitive to the assignment of protonation and tautomeric states in the complex, and to the treatment of metal ions near the protein-ligand interface. Retraining of the SIE function on this large and diverse set gave marginal improvements with small changes in optimal parameters and was not warranted.

3.3 Docking

A natural extension of the SIE function is docking, i.e., ranking binding modes (poses) of a ligand to a protein target. To this end, SIE was integrated into the exhaustive docking program Wilma (Sulea et al., 2012; Hogues et al., 2014; Hogues et al., 2016). Briefly, Wilma uses a brute-force, exhaustive searching approach where interaction modes with the rigid protein of all the discrete rotational and translational states of ligand conformations are enumerated, scored, clustered and ranked using a simple fast-scoring function. A few hundreds top-ranked poses produced by Wilma are then energy-minimized and rescored by SIE. The goal is to provide an accurate docking solution as the top-1 SIE-scored pose. The same SIE parametrization used for affinity prediction is also employed for docking, allowing consistency between pose ranking for each ligand and affinity ranking between different ligands.

Extensive testing of Wilma-SIE indicated that ligand docking can be achieved with high accuracy and is an easier task than binding affinity scoring. The power of Wilma-SIE in pose selection and cross-docking against multiple targets and ligand classes was unequivocally demonstrated in the CSAR-2013/14 blind challenge (Hogues et al., 2016). In all 24 pose-selection tests on 4 different protein targets, Wilma-SIE ranked the native pose as best among carefully generated sets of decoy conformations. Large score separations of native poses indicated robustness in pose scoring. Cross-dockings were also accomplished with high accuracies for various systems, with ligand median RMSD (mRMSD) values around 1 Å from the crystal structures. Both Wilma-SIE and Wilma-SIE + FiSH generated docking predictions among the best-performing submissions. In terms of consistency and system transferability, they were the only submissions with mRMSDs below 1.5 Å on every system. This level of performance was for top-1 poses ranked by SIE or SIE + FiSH over multiple target conformations. Using SIE + FiSH for pose scoring lead to somewhat better docking accuracy overall as well as for individual targets, with mRMSD of 0.6 Å by Wilma-SIE + FiSH in the Syk set.

Although the optimal regime of Wilma-SIE is for high-affinity ligands with low-to-moderate flexibility, the SAMPL-4 blind test on the HIV-I set demonstrated that Wilma-SIE can sometimes dock accurately even weak-affinity ligands (K_D > 0.1 mM) with high flexibility (>8 rotatable bonds) (Hogues et al., 2014). In D3R-GC2, the rigid-protein docking method Wilma-SIE faced the FXR target that exhibits significant backbone movement in response to ligand binding (Hogues et al., 2018b). Use of the conformational ensembles from publicly available structures of FXR allowed Wilma-SIE to predict poses with mRMSD of 1.4 Å on the set of 36 FXR diverse ligands, and rank amongst the best pose-prediction methods of the challenge. However, the success rate would have been much lower if only a single structure were used.

3.4 Virtual screening

With excellent scoring abilities for both binding affinity and pose selection, one of the most practical applications of SIE is virtual screening of compound libraries against a given target protein structure. Typical libraries of available multi-million drug-like compounds, e.g., ZINC (Sterling and Irwin, 2015), can be efficiently processed by Wilma-SIE, which is fully scalable and parallelizable on available computational resources. The first assessment of the SIE performance in virtual screening indicated excellent enrichments of true actives within decoy sets for estrogen receptor and thymidine kinase as screening targets, particularly in the latter more challenging system having weak binding affinities for the true binders (Naim et al., 2007). Wilma-SIE can be used for screening of not only drug-like but also fragment-like ligands, as demonstrated in the SAMPL-3 blind challenge on trypsin screening, where it achieved a good enrichment of the 20 true actives amongst 500 fragment-like ligand library with an AUC-ROC of ∼0.7 (Sulea et al., 2012). The early enrichment performance was particularly good, with 50% of true actives recovered with false-positive rates of 15% for Wilma-SIE and 3% for Wilma-SIE + FiSH. The SAMPL-4 blind test showed that Wilma-SIE is not suited for detection of promiscuous weak and flexible ligands, although even in such difficult cases it can lead to better-than-random virtual screening results (Hogues et al., 2014).

4 Biotherapeutics design

4.1 Antibody-antigen affinity ranking

With the rise of monoclonal antibodies (mAbs) as a promising class of biotherapeutics, the SIE function was evaluated for its ability to predict protein-protein binding affinities. An immediate application is antibody optimization, with a long-term goal towards de novo protein engineering. A first study assessed the transferability of the original SIE parametrization to predict changes in antibody-antigen binding affinities (Sulea et al., 2016). To this end, we assembled Single-Point Mutant Antibody Binding (SiPMAB), a dataset of 212 antibody mutants from 7 systems having high-resolution crystal structures of parental antibodies and high-quality binding affinity measurements. The SIE function coupled with a protocol limited to sampling only the mutated side chain was able to reasonably predict relative binding affinities (S ∼0.6) without any reparameterization of the original SIE function trained on small-molecule binding. Binding affinity ranking performance was maintained for each of the 7 systems and other subsets including non-alanine and charge-altering mutations. Performance was further enhanced using consensus ranking over multiple scoring functions alongside SIE, such as FoldX (Guerois et al., 2002) and Rosetta (Kortemme and Baker, 2002; Ó Conchúir et al., 2015). The consensus scores were obtained by converting the scores from the various scoring functions into normalized z-scores and producing an average z-score for each mutant. This facilitated combining the disparate magnitudes and scales of the different scoring functions.

4.2 Affinity maturation of antibodies

Traditional experimental approaches such as library display and screening are incapable of thoroughly exploring the vast mutational space available to a typical antibody complementarity determining region (CDR; ∼60 residues). One meaningful way to systematically explore and prioritize this space is to examine single-point mutants first and then combine validated hot spots into multiple-point mutants. A cost-effective protocol is to leverage the speed of in silico screening followed by experimental validation of only a small number of predicted hits in each mutation round. To this end, we developed Assisted Design of Antibody and Protein Therapeutics (ADAPT), a platform that interleaves structure-based virtual screening mutagenesis with experimental testing in order to optimize the binding affinity of a biologic (antibody) to its target (antigen) (Vivcharuk et al., 2017). The ADAPT-based affinity maturation eliminates false-positive predictions in two ways: i) early experimental validation of top-scored virtual hits, mainly at the single mutation stage, and ii) consensus affinity scoring over SIE and other popular scoring functions like FoldX and Rosetta. The platform is also designed to preserve protein folding upon mutation by using a computational filter.

Prospective applications of ADAPT affinity maturation in real-life projects (Table 2) led to 10–100-fold improvements in the dissociation constant (K_D) for several Fab fragments of mAbs that originally bound their antigens with 0.05–50 nM affinities (Vivcharuk et al., 2017). ADAPT has also been applied successfully to improve the binding affinity of a single-domain antibody (sdAb) against Clostridium difficile toxin A by 10-fold, which led to improved functional efficacy and thermal stability of the optimized sdAb (Sulea et al., 2018). To achieve affinity improvements via ADAPT, only about 30–50 single to triple mutants need to be recombinantly produced and tested, a significant reduction of the aforementioned available mutational space.

TABLE 2

TABLE 2. Prospective applications of SIE via ADAPT to antibody engineering.

The utility of the consensus score is highlighted in Table 2 of Vivcharuk et al. (2017). For the bH1-VEGF system, only SIE had a good z-score for the G99D mutation. FoldX and Rosetta scored it poorly and ranked it 522 and 725, respectively. Due to SIE, the consensus z-score of this single mutant brought it within the top 50 consensus z-scores, making it to the list for experimental validation. In the bH1-HER2 system, only FoldX scored the I29R mutant well. SIE and Rosetta z-scores had them at ranks 107 and 153, respectively. However, the consensus z-score made it within the top 50 cutoff. In the Herceptin-HER2 system, only Rosetta had a good z-score for D102F, while the FoldX and SIE z-scores ranked it at 88 and 51, respectively. Again, the consensus z-score brought it within the cutoff for experimental consideration. All of these single mutants turned out to be components of the best triple mutants for their respective systems. They would have been missed for one or more systems had we relied on a single scoring function. The consensus scoring approach was also shown to be superior to each individual component scoring function on the SiPMAB data set in terms of AUC-ROC (Sulea et al., 2016).

In certain cases, it is beneficial to controllably weaken binding affinity, for example, to reduce toxicity of antibody-drug conjugates (ADCs) used in oncology. This approach takes advantage of the bivalency of mAbs and higher expression of antigens on tumor versus normal cells. ADAPT and SIE were employed to design a set of antibody mutants that evenly modulate binding affinity within a 4 kcal/mol range leading to identification of optimal ADCs with improved therapeutic windows (Zwaagstra et al., 2019).

ADAPT was used to broaden the specificity of an anti-SARS-CoV-1 sdAb that had only weak cross-reactivity with SARS-CoV-2 (Sulea et al., 2022). By applying ADAPT with the constraint of dual-affinity optimization simultaneously against coronaviruses from distinct phylogenetic clades, optimized sdAbs were found that neutralized the major variants of concern within the SARS-CoV-2 clade with superior pan-specificity and potency relative to the parental antibody.

A requirement for ADAPT is the availability of 3D structural data for the protein-protein interface subjected to affinity optimization, ideally from crystallographic experiments. Encouragingly, a recent study reports ADAPT-based affinity maturation of a weak-affinity anti-CD47 sdAb (K_D of 278 nM) by 87-fold based on an antigen-bound sdAb structure derived by homology modeling, molecular dynamics and protein-protein docking (Cheng et al., 2019).

4.3 Engineering pH-sensitive antibodies

For therapeutic applications in oncology, specific binding under the slightly acidic pH of solid tumors can reduce off-tumor binding and toxicity on normal cells living under physiological pH. Given that the average pKa of histidine in proteins is ∼6.4, virtual His screening of antibody CDR appears as a suitable approach to achieve pH-selective antigen engagement. To do this in ADAPT, SIE calculations are performed twice for each His mutant, in the protonated and neutral states, and then referenced the parental complex in both pHs. This approach was successfully applied to introduce pH-dependent binding in a variant of trastuzumab (Herceptin) binding the Her2 antigen overexpressed in breast cancer (Sulea et al., 2020). Designed antibody His mutants bound stronger to acidic cancer cells than to normal cells under physiological pH, and inhibited cell growth under acidic pH but not under physiological pH. In contrast, the parental antibody impacted tumor and normal cells similarly. In a larger-scale application, ADAPT and SIE could be used to retrofit the entire anticancer pipeline of antibodies with available 3D structures in complex with their onco-antigens (Wei et al., 2022). ADAPT could be similarly employed to engineer pH selectivity in the opposite direction for the design of recycling antibodies. In this scenario, overexpressed targets captured under physiological pH can be degraded more readily if they disengage from recycling antibodies in the slightly acidic endosomes inside the cells.

5 SIE in the wild

A literature analysis involving manual inspection of 377 distinct citations of 15 key methodological papers on SIE (Naim et al., 2007; Cui et al., 2008; Sulea et al., 2011; Sulea and Purisima, 2012b; Purisima and Hogues, 2012; Sulea et al., 2012; Hogues et al., 2014; Henry et al., 2016; Hogues et al., 2016; Sulea et al., 2016; Vivcharuk et al., 2017; Hogues et al., 2018a; Hogues et al., 2018b; Sulea et al., 2018; Sulea et al., 2020) (as of 21 December 2021) was undertaken (Supplementary Table S1). Only 36 of those citations are self-citations (10%). 50 of the 377 citations (13%) are for a different scoring function, called GBVI/WSA (Corbeil et al., 2012), calibrated on the SIE training dataset (Supplementary Table S2) and having a formalism similar to SIE but differing in its surface area calculations and force-field used. SIE has been used by hundreds of scientific groups around the globe (Supplementary Figure S1) with nearly 29% of citations from North-America, 31% from Asia, and 26% from Europe.

This literature was scrutinized to determine the use of SIE in research. In almost half of citations, SIE was used either prospectively (40 articles) to design new molecules or retrospectively (146 articles) to rationalize binding or a biological process. Predicted binding affinities were collected for instances in which experimental binding affinities were also reported. In total, two subsets of 275 and 150 data points for small-molecule and antibody complexes were collected from 65 to 5 articles, respectively (Supplementary Tables S3-S4). The small-molecule subset combines both prospective and retrospective data given the difficulty of discerning the two in some articles. The antibody subset combines all prospective data from internal studies in which SIE was applied to design novel mutants. As described earlier, SIE has also been tested extensively in community studies for small-molecule binding [SAMPL-1/3/4 (Sulea et al., 2012; Hogues et al., 2014), CSAR-2010/13/14 (Sulea et al., 2011; Hogues et al., 2016), D3R-GC2 (Hogues et al., 2018b)] as well as in-house studies of relative binding affinities for antibodies [SiPMAB (Sulea et al., 2016)]. From these studies, 975 and 212 data points were supplemented for small molecules and antibodies, respectively (Supplementary Tables S5-S6). To avoid skewing analyses towards these benchmark studies, these large datasets were treated separately.

Predicted binding affinities were plotted against experimental data for small molecules (Figure 1) and biologics (Figure 2). Overall, SIE performs well on the published data for 275 small molecules, achieving good ranking (S of 0.76) and MUE of 1.49 kcal/mol. Both values are close to the published results for SIE on the training set (S of 0.79; MUE of 1.38 kcal/mol). Out of the 65 articles, only 10 of them contained targets present in the SIE training set (Supplementary Table S7). Individual correlations in most systems are reasonable, demonstrating transferability of SIE across multiple targets, even for those not part of the training set.

FIGURE 1

FIGURE 1. Predicted versus experimental absolute binding affinities for small-molecules. Data from external articles (colored symbols) and community-wide challenges (gray symbols). The “Small Molecules” set combines all data from external articles. Number of compounds (N), Spearman rank-order correlation coefficient (S), Pearson correlation coefficient (R²) and mean unsigned error (MUE, kcal/mol) are listed for each set.

FIGURE 2

FIGURE 2. Predicted versus experimental relative binding affinities for biologics. Data from prospective studies (colored symbols) and in-house SiPMAB set (gray symbols). The “Biologics” set combines all data from literature. Statistical parameters as in Figure 1.

Aside from the overall binding affinities, analysis of the individual components of the SIE score can provide insights into the nature of the binding interactions. In a study of a Pan-BCR-ABL kinase inhibitor, the SIE components of the interaction of the inhibitor with the native and fourteen mutant BCR-ABL kinases revealed the relative importance of the electrostatic and van der Waals contributions to binding for the various mutants (Tanneeru and Guruprasad, 2013). SIE has also been used for virtual alanine scanning, systematically replacing selected residues with alanine and recomputing the predicted binding affinity. For example, it was used to identify which residues contribute the most to the binding of a peptide inhibitor to the MurA enzyme of Pseudomonas aeruginosa (Lima et al., 2017). Another application of SIE is in structural studies. In a study of putative binding modes of inhibitors to acetylcholinesterase, the SIE score in conjunction with qualitative structural analysis of the modeled structures was used to predict which binding mode was the most likely one (Galdeano et al., 2012). Drug resistance is a major concern in therapeutics. In a study of the drug resistance arising from HIV-Protease mutants, both SIE and MM-PBSA showed that the decrease in van der Waals interactions between the inhibitors and the protein was the driving force in conferring resistance (Wang et al., 2020). Polar interactions hardly contributed to drug resistance.

For biologics, 5 internal publications yielded 150 antibody mutants with both predicted and experimental binding affinities relative to the respective parental antibodies (Vivcharuk et al., 2017; Sulea et al., 2018; Zwaagstra et al., 2019; Sulea et al., 2020; Corbeil et al., 2021). When compared to the published results for small molecules, the quantitative prediction of relative binding affinity for antibodies is not as successful (S = 0.58, MUE = 0.90 kcal/mol). The overall correlation is mainly driven by three studies, Vivcharuk et al. (2017); Sulea et al. (2018); Zwaagstra et al. (2019) in which SIE was used to predict binding affinity changes mostly for single mutants relative to a parental antibody. They are similar in spirit to the SIE benchmark study on the SiPMAB set (Sulea et al., 2016), hence their comparable performances (Figure 2). The other two studies required introduction of additional degrees of freedom in the modeling approach, which may explain their poorer correlations. In Sulea et al. (2020) SIE was used to design mutants that selectively bound in the acidic tumor microenvironment, which necessitated predicting binding affinities for both neutral and acidic pH, which may have compounded prediction errors. Despite the low correlation observed, these predictions still proved useful (Section 4.3). In Corbeil et al. (2021) a mutational engineering endeavor was undertaken by attempting to redesign the entire CDR H3 loop of an antibody. The requirement of predicting the protein loop backbone conformation significantly increased affinity prediction errors. Protein flexibility and solvation are some of the areas which may require further development for improving predictions of relative protein-protein binding affinities.

6 Perspectives and conclusion

As with any scoring function, there is room for further improvement. A key component of the SIE scoring function is its solvation model, which has evolved in sophistication over time. Further refinement of the FiSH solvation model continues to be an active area of development. The original FiSH model was parameterized on small organic molecules. A possible refinement for applications to protein-protein interactions would be fine-tuning the parameters such as the born radii using molecular dynamics simulations of amino acids and short peptides with explicit water molecules as a reference. The scoring function also lacks an explicit conformational entropy term. Currently, an overall scaling factor is meant to capture entropy-enthalpy compensation, which can account for overall trends but is incapable of reproducing more granular details of entropic contributions to the binding free energy. Empirical entropic terms such as found, for example, in the FoldEF scoring function could be added. The conformational sampling as a by-product of the different methods used in the consensus approach does address flexibility to some extent, which may explain some of its improved performance.

Applications of machine learning and AI have exploded across almost all disciplines. The use of AI together with physics-based methods is a powerful combination. As discussed above, consensus scoring has been key in enhancing the robustness of ADAPT in antibody design. The incorporation of one or more AI-derived scoring functions (Li et al., 2021) as part of the consensus score could provide complementary information not well captured in the current physics-based functions. AI tools could also improve the SIE scoring function itself by optimizing the parameters in its solvation model. Moreover, since SIE is dependent on a 3D structure of the systems of interest, the increasing capability of AI structure prediction methods (Jumper et al., 2021; Akdel et al., 2022; Bryant et al., 2022) will have a collateral benefit in broadening the scope of molecular targets that SIE and other physics-based scoring functions can be applied to.

Despite its present limitations, SIE has been successfully applied across multiple systems by various research groups. Although initially developed for small-molecule affinity prediction, the literature we have surveyed highlighted its versatility as demonstrated in the variety of applications through the years that have ranged from small-molecule docking and virtual screening to the design of biologics. This wide applicability is remarkable given the parsimonious number of fitting parameters in the original calibration of the SIE scoring function as well as being relatively computationally inexpensive.

Author contributions

TS contributed to the conception of the main sections of the review. CC, FG, WW, and CD contributed to SIE in the wild by first identifying and reading the relevant literature using the SIE method, and then collecting and curating actual and calculated binding affinity data. CC produced the scatter plots in Figures 1, 2. TS wrote the first manuscript draft. CC and EP wrote sections of the manuscript. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2023.1210576/full#supplementary-material

References

Akdel, M., Pires, D. E. V., Pardo, E. P., Jänes, J., Zalevsky, A. O., Mészáros, B., et al. (2022). A structural biology community assessment of AlphaFold2 applications. Nat. Struct. Mol. Biol. 29, 1056–1067. doi:10.1038/s41594-022-00849-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Bryant, P., Pozzati, G., and Elofsson, A. (2022). Improved prediction of protein-protein interactions using AlphaFold2. Nat. Commun. 13, 1265. doi:10.1038/s41467-022-28865-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Carlson, H. A. (2016). Lessons learned over four benchmark exercises from the Community Structure-Activity Resource. J. Chem. Inf. Model. 56, 951–954. doi:10.1021/acs.jcim.6b00182

PubMed Abstract | CrossRef Full Text | Google Scholar

Case, D. A., Cheatham, T. E., Darden, T., Gohlke, H., Luo, R., Merz, K. M., et al. (2005). The Amber biomolecular simulation programs. J. Comput. Chem. 26, 1668–1688. doi:10.1002/jcc.20290

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, C. E., and Gilson, M. K. (2004). Free energy, entropy, and induced fit in host-guest recognition: Calculations with the second-generation mining minima algorithm. J. Amer. Chem. Soc. 126, 13156–13164. doi:10.1021/ja047115d

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, W., Chang, C. E., and Gilson, M. K. (2004). Calculation of cyclodextrin binding affinities: Energy, entropy, and implications for drug design. Biophys. J. 87, 3035–3049. doi:10.1529/biophysj.104.049494

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, X., Wang, J., Kang, G., Hu, M., Yuan, B., Zhang, Y., et al. (2019). Homology modeling-based in silico affinity maturation improves the affinity of a nanobody. Int. J. Mol. Sci. 20, 4187. doi:10.3390/ijms20174187

PubMed Abstract | CrossRef Full Text | Google Scholar

Chodera, J. D., Mobley, D. L., Shirts, M. R., Dixon, R. W., Branson, K., and Pande, V. S. (2011). Alchemical free energy methods for drug discovery: Progress and challenges. Curr. Opin. Struct. Biol. 21, 150–160. doi:10.1016/j.sbi.2011.01.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Corbeil, C. R., Manenda, M. S., Sulea, T., Baardsnes, J., Picard, M. E., Hogues, H., et al. (2021). Redesigning an antibody H3 loop by virtual screening of a small library of human germline-derived sequences. Sci. Rep. 11, 21362. doi:10.1038/s41598-021-00669-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Corbeil, C. R., Sulea, T., and Purisima, E. O. (2010). Rapid prediction of solvation free energy. 2. The first-shell hydration (FiSH) continuum model. J. Chem. Theory Comput. 6, 1622–1637. doi:10.1021/ct9006037

PubMed Abstract | CrossRef Full Text | Google Scholar

Corbeil, C. R., Williams, C. I., and Labute, P. (2012). Variability in docking success rates due to dataset preparation. J. Comput. Aided Mol. Des. 26, 775–786. doi:10.1007/s10822-012-9570-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Cui, Q., Sulea, T., Schrag, J. D., Munger, C., Hung, M.-N., Naim, M., et al. (2008). Molecular dynamics-solvated interaction energy studies of protein-protein interactions: The MP1-p14 scaffolding complex. J. Mol. Biol. 379, 787–802. doi:10.1016/j.jmb.2008.04.035

PubMed Abstract | CrossRef Full Text | Google Scholar

Dhakal, A., Mckay, C., Tanner, J. J., and Cheng, J. (2022). Artificial intelligence in the prediction of protein-ligand interactions: Recent advances and future directions. Brief. Bioinform. 23, bbab476. doi:10.1093/bib/bbab476

PubMed Abstract | CrossRef Full Text | Google Scholar

Dunbar, J. B., Smith, R. D., Yang, C.-Y., Ung, P. M.-U., Lexa, K. W., Khazanov, N. A., et al. (2011). CSAR benchmark exercise of 2010: Selection of the protein-ligand complexes. J. Chem. Inf. Model. 51, 2036–2046. doi:10.1021/ci200082t

PubMed Abstract | CrossRef Full Text | Google Scholar

Gaieb, Z., Liu, S., Gathiaka, S., Chiu, M., Yang, H., Shao, C., et al. (2018). D3R Grand challenge 2: Blind prediction of protein-ligand poses, affinity rankings, and relative binding free energies. J. Comput.-Aided Mol. Des. 32, 1–20. doi:10.1007/s10822-017-0088-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Galdeano, C., Viayna, E., Sola, I., Formosa, X., Camps, P., Badia, A., et al. (2012). Huprine–tacrine heterodimers as anti-amyloidogenic compounds of potential interest against Alzheimer’s and prion diseases. J. Med. Chem. 55, 661–669. doi:10.1021/jm200840c

PubMed Abstract | CrossRef Full Text | Google Scholar

Geng, C., Xue, L. C., Roel-Touris, J., and Bonvin, A. M. (2019). Finding the DDG spot: Are predictors of binding affinity changes upon mutations in protein–protein interactions ready for it? WIREs Comput. Mol. Sci. 9, e1410. doi:10.1002/wcms.1410

CrossRef Full Text | Google Scholar

Gilson, M. K., and Zhou, H. X. (2007). Calculation of protein-ligand binding affinities. Annu. Rev. Biophys. Biomol. Struct. 36, 21–42. doi:10.1146/annurev.biophys.36.040306.132550

PubMed Abstract | CrossRef Full Text | Google Scholar

Gohlke, H., and Klebe, G. (2002). Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. Angew. Chem. Int. Ed. Engl. 41, 2644–2676. doi:10.1002/1521-3773(20020802)41:15<2644::AID-ANIE2644>3.0.CO;2-O

PubMed Abstract | CrossRef Full Text | Google Scholar

Guerois, R., Nielsen, J. E., and Serrano, L. (2002). Predicting changes in the stability of proteins and protein complexes: A study of more than 1000 mutations. J. Mol. Biol. 320, 369–387. doi:10.1016/S0022-2836(02)00442-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Henry, K. A., Sulea, T., Faassen, H. V., Hussack, G., Purisima, E. O., Mackenzie, C. R., et al. (2016). A rational engineering strategy for designing protein A-binding camelid single-domain antibodies. PLoS ONE 11, e0163113. doi:10.1371/journal.pone.0163113

PubMed Abstract | CrossRef Full Text | Google Scholar

Hogues, H., Gaudreault, F., Corbeil, C. R., Deprez, C., Sulea, T., and Purisima, E. O. (2018a). ProPOSE: Direct exhaustive protein–protein docking with side chain flexibility. J. Chem. Theory Comput. 14, 4938–4947. doi:10.1021/acs.jctc.8b00225

PubMed Abstract | CrossRef Full Text | Google Scholar

Hogues, H., Sulea, T., Gaudreault, F., Corbeil, C. R., and Purisima, E. O. (2018b). Binding pose and affinity prediction in the 2016 D3R Grand Challenge 2 using the Wilma-SIE method. J. Comput. Aided Mol. Des. 32, 143–150. doi:10.1007/s10822-017-0071-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Hogues, H., Sulea, T., and Purisima, E. O. (2016). Evaluation of the Wilma-SIE virtual screening method in Community Structure-Activity Resource 2013 and 2014 blind challenges. J. Chem. Inf. Model. 56, 955–964. doi:10.1021/acs.jcim.5b00278

PubMed Abstract | CrossRef Full Text | Google Scholar

Hogues, H., Sulea, T., and Purisima, E. O. (2014). Exhaustive docking and solvated interaction energy scoring: Lessons learned from the SAMPL4 challenge. J. Comput. Aided Mol. Des. 28, 417–427. doi:10.1007/s10822-014-9715-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Hornak, V., Abel, R., Okur, A., Strockbine, B., Roitberg, A., and Simmerling, C. (2006). Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins 65, 712–725. doi:10.1002/prot.21123

PubMed Abstract | CrossRef Full Text | Google Scholar

Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. doi:10.1038/s41586-021-03819-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Kortemme, T., and Baker, D. (2002). A simple physical model for binding energy hot spots in protein-protein complexes. Proc. Nat. Acad. Sci. U.S.A. 99, 14116–14121. doi:10.1073/pnas.202485799

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, H., Sze, K.-H., Lu, G., and Ballester, P. J. (2021). Machine-learning scoring functions for structure-based virtual screening. WIREs Comput. Mol. Sci. 11, e1478. doi:10.1002/wcms.1478

CrossRef Full Text | Google Scholar

Lima, A. H., Dos Santos, A. M., Alves, C. N., and Lameira, J. (2017). Computed insight into a peptide inhibitor preventing the induced fit mechanism of MurA enzyme from Pseudomonas aeruginosa. Chem. Biol. Drug Des. 89, 599–607. doi:10.1111/cbdd.12882

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., and Wang, R. (2015). Classification of current scoring functions. J. Chem. Inf. Model. 55, 475–482. doi:10.1021/ci500731a

PubMed Abstract | CrossRef Full Text | Google Scholar

Massova, I., and Kollman, P. A. (2000). Combined molecular mechanical and continuum solvent approach (MM-PBSA/GBSA) to predict ligand binding. Perspect. Drug Discov. Des. 18, 113–135. doi:10.1023/a:1008763014207

CrossRef Full Text | Google Scholar

Mey, A. S. J. S., Allen, B. K., Macdonald, H. E. B., Chodera, J. D., Hahn, D. F., Kuhn, M., et al. (2020). Best practices for alchemical free energy calculations [article v1.0]. Living J. Comp. Mol. Sci. 2, 18378. doi:10.33011/livecoms.2.1.18378

CrossRef Full Text | Google Scholar

Muddana, H. S., Fenley, A. T., Mobley, D. L., and Gilson, M. K. (2014). The SAMPL4 host–guest blind prediction challenge: An overview. J. Comput. Aided Mol. Des. 28, 305–317. doi:10.1007/s10822-014-9735-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Naim, M., Bhat, S., Rankin, K. N., Dennis, S., Chowdhury, S. F., Siddiqi, I., et al. (2007). Solvated interaction energy (SIE) for scoring protein-ligand binding affinities. 1. Exploring the parameter space. J. Chem. Inf. Model. 47, 122–133. doi:10.1021/ci600406v

PubMed Abstract | CrossRef Full Text | Google Scholar

Ó Conchúir, S., Barlow, K. A., Pache, R. A., Ollikainen, N., Kundert, K., O'Meara, M. J., et al. (2015). A web Resource for standardized benchmark datasets, metrics, and Rosetta protocols for macromolecular modeling and design. PLoS ONE 10, e0130433. doi:10.1371/journal.pone.0130433

PubMed Abstract | CrossRef Full Text | Google Scholar

Purisima, E. O., Corbeil, C. R., and Sulea, T. (2010). Rapid prediction of solvation free energy. 3. Application to the SAMPL2 challenge. J. Comput. Aided Mol. Des. 24, 373–383. doi:10.1007/s10822-010-9341-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Purisima, E. O. (1998). Fast summation boundary element method for calculating solvation free energies of macromolecules. J. Comput. Chem. 19, 1494–1504. doi:10.1002/(sici)1096-987x(199810)19:13<1494::aid-jcc6>3.0.co;2-l

CrossRef Full Text | Google Scholar

Purisima, E. O., and Hogues, H. (2012). Protein–ligand binding free energies from exhaustive docking. J. Phys. Chem. B 116, 6872–6879. doi:10.1021/jp212646s

PubMed Abstract | CrossRef Full Text | Google Scholar

Purisima, E. O., and Nilar, S. H. (1995). A simple yet accurate boundary element method for continuum dielectric calculations. J. Comput. Chem. 16, 681–689. doi:10.1002/jcc.540160604

CrossRef Full Text | Google Scholar

Purisima, E. O., and Sulea, T. (2009). Restoring charge asymmetry in continuum electrostatics calculations of hydration free energies. J. Phys. Chem. B 113, 8206–8209. doi:10.1021/jp9020799

PubMed Abstract | CrossRef Full Text | Google Scholar

Rastelli, G., Rio, A. D., Degliesposti, G., and Sgobba, M. (2010). Fast and accurate predictions of binding free energies using MM-PBSA and MM-GBSA. J. Comput. Chem. 31, 797–810. doi:10.1002/jcc.21372

PubMed Abstract | CrossRef Full Text | Google Scholar

Reynolds, C. H., and Holloway, M. K. (2011). Thermodynamics of ligand binding and efficiency. ACS Med. Chem. Lett. 2, 433–437. doi:10.1021/ml200010k

PubMed Abstract | CrossRef Full Text | Google Scholar

Sharp, K. (2001). Entropy-enthalpy compensation: Fact or artifact? Protein Sci. 10, 661–667. doi:10.1110/ps.37801

PubMed Abstract | CrossRef Full Text | Google Scholar

Skillman, A. G. (2012). SAMPL3: Blinded prediction of host–guest binding affinities, hydration free energies, and trypsin inhibitors. J. Comput. Aided Mol. Des. 26, 473–474. doi:10.1007/s10822-012-9580-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Sterling, T., and Irwin, J. J. (2015). ZINC 15-ligand discovery for everyone. J. Chem. Inf. Model. 55, 2324–2337. doi:10.1021/acs.jcim.5b00559

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Baardsnes, J., Stuible, M., Rohani, N., Tran, A., Parat, M., et al. (2022). Structure-based dual affinity optimization of a SARS-CoV-1/2 cross-reactive single-domain antibody. PLoS ONE 17, e0266250. doi:10.1371/journal.pone.0266250

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Corbeil, C. R., and Purisima, E. O. (2010). Rapid prediction of solvation free energy. 1. An extensive test of linear interaction energy (LIE). J. Chem. Theory Comput. 6, 1608–1621. doi:10.1021/ct9006025

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Cui, Q., and Purisima, E. O. (2011). Solvated interaction energy (SIE) for scoring protein–ligand binding affinities. 2. Benchmark in the CSAR-2010 scoring exercise. J. Chem. Inf. Model. 51, 2066–2081. doi:10.1021/ci2000242

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Hogues, H., and Purisima, E. O. (2012). Exhaustive search and solvated interaction energy (SIE) for virtual screening and affinity prediction. J. Comput. Aided Mol. Des. 26, 617–633. doi:10.1007/s10822-011-9529-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Hussack, G., Ryan, S., Tanha, J., and Purisima, E. O. (2018). Application of assisted design of antibody and protein therapeutics (ADAPT) improves efficacy of a Clostridium difficile toxin A single-domain antibody. Sci. Rep. 8, 2260. doi:10.1038/s41598-018-20599-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., and Purisima, E. O. (2012a). Predicting hydration free energies of polychlorinated aromatic compounds from the SAMPL-3 data set with FiSH and LIE models. J. Comput. Aided Mol. Des. 26, 661–667. doi:10.1007/s10822-011-9522-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., and Purisima, E. O. (2012b). The solvated interaction energy method for scoring binding affinities. Methods Mol. Biol. 819, 295–303. doi:10.1007/978-1-61779-465-0_19

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Rohani, N., Baardsnes, J., Corbeil, C. R., Deprez, C., Cepero-Donates, Y., et al. (2020). Structure-based engineering of pH-dependent antibody binding for selective targeting of solid-tumor microenvironment. mAbs 12, 1682866. doi:10.1080/19420862.2019.1682866

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Vivcharuk, V., Corbeil, C. R., Deprez, C., and Purisima, E. O. (2016). Assessment of solvated interaction energy function for ranking antibody–antigen binding affinities. J. Chem. Inf. Model. 56, 1292–1303. doi:10.1021/acs.jcim.6b00043

PubMed Abstract | CrossRef Full Text | Google Scholar

Sulea, T., Wanapun, D., Dennis, S., and Purisima, E. O. (2009). Prediction of SAMPL-1 hydration free energies using a continuum electrostatics-dispersion model. J. Phys. Chem. B 113, 4511–4520. doi:10.1021/jp8061477

PubMed Abstract | CrossRef Full Text | Google Scholar

Tanneeru, K., and Guruprasad, L. (2013). Ponatinib is a pan-BCR-ABL kinase inhibitor: MD simulations and SIE study. PLOS ONE 8, e78556. doi:10.1371/journal.pone.0078556

PubMed Abstract | CrossRef Full Text | Google Scholar

Vivcharuk, V., Baardsnes, J., Deprez, C., Sulea, T., Jaramillo, M., Corbeil, C. R., et al. (2017). Assisted design of antibody and protein therapeutics (ADAPT). PLoS ONE 12, e0181490. doi:10.1371/journal.pone.0181490

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A., and Case, D. A. (2004). Development and testing of a general Amber force field. J. Comput. Chem. 25, 1157–1174. doi:10.1002/jcc.20035

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, R.-G., Zhang, H.-X., and Zheng, Q.-C. (2020). Revealing the binding and drug resistance mechanism of amprenavir, indinavir, ritonavir, and nelfinavir complexed with HIV-1 protease due to double mutations G48T/L89M by molecular dynamics simulations and free energy analyses. Phys. Chem. Chem. Phys. 22, 4464–4480. doi:10.1039/c9cp06657h

PubMed Abstract | CrossRef Full Text | Google Scholar

Wei, W., Corbeil, C. R., Gaudreault, F., Deprez, C., Purisima, E. O., and Sulea, T. (2022). Antibody mutations favoring pH-dependent binding in solid tumor microenvironments: Insights from large-scale structure-based calculations. Proteins 90, 1538–1546. doi:10.1002/prot.26340

PubMed Abstract | CrossRef Full Text | Google Scholar

Zwaagstra, J. C., Sulea, T., Baardsnes, J., Radinovic, S., Cepero-Donates, Y., Robert, A., et al. (2019). Binding and functional profiling of antibody mutants guides selection of optimal candidates as antibody drug conjugates. PLoS ONE 14, e0226593. doi:10.1371/journal.pone.0226593

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: binding affinity, solvation, protein-ligand interaction, scoring function, antibody affinity maturation

Citation: Purisima EO, Corbeil CR, Gaudreault F, Wei W, Deprez C and Sulea T (2023) Solvated interaction energy: from small-molecule to antibody drug design. Front. Mol. Biosci. 10:1210576. doi: 10.3389/fmolb.2023.1210576

Received: 22 April 2023; Accepted: 26 May 2023;
Published: 07 June 2023.

Edited by:

Francesca Spyrakis, University of Turin, Italy

Reviewed by:

Jian Wang, The Pennsylvania State University, United States
Enrique Marcos, Spanish National Research Council (CSIC), Spain

Copyright © 2023 Purisima, Corbeil, Gaudreault, Wei, Deprez and Sulea. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Traian Sulea, dHJhaWFuLnN1bGVhQG5yYy1jbnJjLmdjLmNh

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Solvated interaction energy: from small-molecule to antibody drug design

1 Introduction

2 The SIE scoring function approach

2.1 Original calibration of SIE

2.2 Incorporation of solvation models

2.3 Single-structure and MD-trajectory modes

2.4 Relation to other physics-based methods

3 Small-molecule drug design

3.1 Solvation free energy

3.2 Binding affinity

3.3 Docking

3.4 Virtual screening

4 Biotherapeutics design

4.1 Antibody-antigen affinity ranking

4.2 Affinity maturation of antibodies

4.3 Engineering pH-sensitive antibodies

5 SIE in the wild

6 Perspectives and conclusion

Author contributions

Conflict of interest

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good