PERSPECTIVE article

Front. Chem., 16 August 2019

Sec. Medicinal and Pharmaceutical Chemistry

Volume 7 - 2019 | https://doi.org/10.3389/fchem.2019.00564

Pushing the Ligand Efficiency Metrics: Relative Group Contribution (RGC) Model as a Helpful Strategy to Promote a Fragment “Rescue” Effect

  • 1. Grupo de Diseño de Productos y Procesos (GDPP), School of Chemical Engineering, Universidad de los Andes, Bogotá, Colombia

  • 2. Laboratorio de Fisiología Molecular, Instituto Nacional de Salud, Bogotá, Colombia

Abstract

The ligand efficiency (LE) indexes have long been used as decision-making criteria in drug discovery and development. However, in the context of fragment-based drug design (FBDD), these metrics often exhibit a strong emphasis toward the selection of highly efficient “core” fragments for potential optimization, which are not usually considered as parts of a larger molecule with a size typical for a drug. In this study, we present a relative group contribution (RGC) model intended to predict the efficiency of a drug-sized compound in terms of its component fragments. This model could be useful not only in rapidly predicting all the possible combinations of promising fragments from an earlier hit discovery stage, but also in enabling a relatively low-LE fragment to become part of a drug-sized compound as long as it is “rescued” by other high-LE fragments.

Introduction

Ligand efficiency metrics have been applied as a decision-making strategy nearly universally accepted during the last two decades (Hopkins et al., 2014). They are intended to compare the quality of hits and leads during hit to lead and lead optimization phases of drug discovery (Cavalluzzi et al., 2017). Among these metrics, the ligand efficiency (LE), firstly described by Hopkins et al. (2004), continues to be the most widely used index to discriminate between promising molecules and those which are not (Reynolds, 2015). LE is defined by the following formula:

where ΔG = –RTInKd, N (also known as HAC or heavy atom count) represents the number of heavy, non-hydrogen atoms, and Kd corresponds to the equilibrium dissociation constant.

As shown in Equation (1), LE normalizes the potency by size, specifically representing the average contribution ΔG (Gibbs free energy) per heavy atom. LE is typically used in FBDD as cut-off criterion to retrieve just high-LE fragments in a screening process (Murray and Verdonk, 2006; Schultes et al., 2010). Intriguingly, because LE usually considers fragments as independent chemical entities, some other metrics have emerged to consider the change in affinity as a fragment is developed into a larger, high-affinity drug-sized compound. One prime example is the group efficiency (GE), described in 2008 (Verdonk and Rees, 2008), which allows measuring the contribution to the binding efficiency of a particular group of atoms added to an existing lead molecule:

where ΔΔG = ΔG(B)-ΔG(A) and ΔN = (N(B)-N(A); in other words, ΔΔG is equal to the difference between the Gibbs free energy of the existing molecule (or fragment) “A” and the new combined molecule “B” and ΔN corresponds to the difference between the number of non-hydrogen atoms of molecules “A” and “B.” However, GE is based on a pairwise comparison of structurally closely related compounds and, hence, it is frequently applied for optimization of a high-efficient fragment (Hopkins et al., 2014). Therefore, a rapid and simple method for comparing efficiency of different fragments as part of a whole, including if they are dissimilar to each other (or if they occupy different pockets in the target molecule), needs to be developed.

Several independent studies in the past two decades have indicated an overemphasis on potency by the pharmaceutical industry (Albert et al., 2007; Hopkins et al., 2014). Still, other factors such as chemical novelty (Medina-Franco et al., 2014), selectivity fine-tuning (Costantino and Barlocco, 2018), structural alerts avoidance (Jasial et al., 2017), and synthetic accessibility (Fukunishi et al., 2014) are increasingly playing a key role in drug design. Considering these non-mutually exclusive events, we hypothesize that fragments that not necessarily exhibit a high efficiency level during a screening procedure, either virtual or experimental, would still have the possibility of taking part in a complete drug-sized compound.

In this study, we propose that a relative group contribution (RGC) model based on the efficiency of its component fragments may estimate the efficiency of a drug-sized compound. This model calculates the minimum efficiency required for unknown fragments by considering the efficiency of those already known, which facilitates a rapid elucidation of the best combinations of fragments. Likewise, this model facilitates that fragments with a relatively low efficiency may not necessarily be eliminated at an early stage of the screening process and, consequently, may become eventually represented as chemical moieties within the final candidate compound -a phenomenon herein referred to as fragment “rescue” effect.

Theoretical Framework of the RGC Model

The proposed model is based on three main assumptions:

  • The efficiency of an entire molecule may be estimated as the weighted root mean square (WMRS) of the efficiency of its component fragments. The rational for using this type of mean is intended to (1) cope with the negative values of ΔG and (2) consider effectively the potentially different number of non-hydrogen atoms (N) for each component fragment.

  • The efficiency of each fragment (regarded as the entire ratio and not just the quotient) is, in principle, dependent on each other, excepting in cases of two fragments when the N for them is equal to each other (and then their weight is equivalent), or in cases when three or more fragments are involved.

  • The efficiency of each fragment is directly calculated from the ΔG resulting in its direct interaction with a specific location (i.e., binding site or pocket) in a particular receptor.

Our hypothesis assumes, according to its first principle, that the WRMS of the LEs of the fragments composing an entire molecule () corresponds to the actual LE of this latter (LET) (A comprehensive list of mathematical terms is shown in Supplementary Material). Because this mean is intended to be proportionally similar to the real, total LE of an entire molecule (LET), we refer herein to it as the apparent total LE ():

For clarity of the RGC concept, we consider first the as a simple arithmetic mean:

where LEi corresponds to the LE of the component fragments (LE1, LE2, etc.), and x refers to the number of fragments composing the molecule. Therefore, once LE is expressed in terms of the Equation (1):

where ΔGi is the change in Gibbs free energy for each composing fragment (ΔG1, ΔG2, etc.) up to a maximum number of fragments x and Ni correspond to the number of non-hydrogen atoms of each fragment. Similarly, ΔGT and NT represent the change in Gibbs free energy and number of non-hydrogen atoms for the entire molecule, respectively. Should be remembered that (5) is based on an arithmetic mean and hence it assumes that N is equal among all composing fragments, so that it would just be applicable in this specific scenario.

If we consider the Equation (5) for a molecule composed by a single fragment:

we can observe that correspond to the LE value of the unique component fragment, namely LE1, which supports a scenario where the component fragment is also the entire “final” molecule. However, expressing (5) for a molecule composed by two fragments:

we could notice that, in contrast to the one-fragment case, the existence of more than one compound allows for the solution of the equation in terms of a particular fragment:

The last equation poses a simple but important principle: Starting from an “ideal” , a particular low-LE fragment can be successfully chosen or “rescued” by one or more high-LE fragments. Now, if we consider a three-fragment case:

Interestingly, for this case, even if we assume in this example that we know LE1, it is still possible to consider a LE value for the unknown fragments LE2 and LE3 grouping them together into a single term:

where the LE delta (LEδ) corresponds to a transient, “ideal” value intended to be equal for all the fragments which individual LE is still unknown. Therefore, as we will discuss below, this value will be modified as long as new LE values are known for fragments, independently of their position.

Likewise, assuming also that we just know LE1, we could express for the two-fragment case:

which would indicate that:

This result suggests, as we also discuss below, that a LEδ is expected to equal the LE value of a last fragment to be known (LEu), independently of the number of fragments and their position. This behavior corresponds to a subtractive average (SA). Remarkably, although the nature of LEδ is somewhat similar to the cumulative average (CA) or moving average (MA), the number of fragments with unknown LE is continuously decreasing and LEδ does not “run” within a predetermined window size.

Finally, assuming also that we just know LE1, we could express for the one-fragment case:

indicating that LEδ can only be calculated if there are at least two starting fragments and, more importantly, it is especially useful in cases of three or more of them.

At this point, think of the as a whole for an undetermined series of fragments:

If we assume again that we just know LE1, it is possible to rearrange using LEδ:

where (x-1) corresponds to the coefficient of the LEδ value independently of the number of starting fragments as shown in Equations (10, 11, 13). Hence, if we resolve for LEδ:

Now, if we take into account that the number 1 in this equation actually corresponds to the number of known LE values of fragments or a, we can observe that:

What means that the more (x-a) tends to 1, the more LEδ tends to LEu, just as we saw previously in Equation (12). Finally, considering the formula for LEδ in terms of an undetermined number of fragments with different known and unknown LE values:

where the first summation term indicates the “ideal” sum of LE values for the existing fragments (LEi) as if they would have the same value, and the second summation term refers to the “real” sum of all fragments which LE value is already known (LEj). On the other hand, if a = o, LE0 would not proceed as a real value (and by extension the second summation term). Therefore, in this specific case a consequence would be that:

Now, after having explained the basic concepts of RGC and LEδ, we could express in terms of the WMRS according to our hypothesis:

where LEi corresponds to the LE of each component fragment, wi refers to the weight of each fragment (depending on the N of each one) and x refers to the number of fragments.

Likewise, our formula for LEδ would be:

The additional term in this equation, namely Wδ, corresponds to the “ideal” weight of all fragments with unknown LE value, as if they would have the same value. Just as with LEδ, this parameter is expected to change with every new LE value of fragment known, until the value (weight) corresponding to the last fragment with unknown LE value is adopted.

Fragment Selection and LET Prediction by the RGC Model

As presented in a hypothetical example (Figure 1), three central premises may be elucidated for the RGC model by selecting hit compounds, starting from a cut-off LE value:

  • A low-efficiency fragment (i.e., with a LE < ) can be rescued IF there exists high-efficiency fragments (i.e., with a LE > ) in ALL the other positions (i.e., binding sites or pockets) of the target molecule, usually a protein. If this condition is not satisfied, the fragment could be automatically rejected from the set of possible combinations of fragments for an entire molecule.

  • If there are no high-efficiency fragments in all the other positions simultaneously, a molecular fragment with low-efficiency fragment can still be rescued IF at least one fragment in any other position with an efficiency high enough to reach exists, in average with the first, the . This implies, therefore, that once a fragment is rescued by one or more high-efficiency fragments in other positions, any fragment in subsequent unexplored positions is just required to have a mid-efficiency (with a LE ≅ LEδ ideal) as a minimum. Additionally, each time LEδ changes, a differential “pushing” over the LE of unknown fragments occurs in terms of ΔG and/or N, which could be potentially modified to achieve an acceptable LE value and could therefore be selected.

  • Even if a low-efficiency fragment is not rescued after implementing the strategies stated in Equation (1, 2), a potential rescue could still take place exploring a more diverse library sample of fragments, assuming that (1) you are dealing with a number of fragments well-below under the maximum theoretical chemical space for them (a population of about 107 molecules) and (2) the fragment is not an outlier compared to other fragments intended to combine.

Figure 1

In addition, and according to our preliminary results, we found that values predicted by this model were consistent with the LET values calculated experimentally from a set of 16 drug-sized molecules taken from scientific literature (Figure 2, Table 1S).

Figure 2

Discussion

The present study pretends to propose the RGC model as an innovative and effective approach to apply in drug design. This model and, especially, the fragment “rescue” effect that is conceptually implicit, offer an alternative for the long-standing FBDD paradigm of designing compounds merely based on the intrinsic binding energy of fragments, facilitating the introduction of other decision–making criteria that are becoming increasingly common.

If the principles of the RGC model are considered together, it is possible to elucidate two major advantages. First, we count on a limited amount of data and in order to more clearly reveal any trends, appears to increase as much as the LET, and there appears to be no dramatic shift toward higher efficiencies for particular fragments or protein targets. Secondly, since a particular fragment could be directly rejected early in the process and there are many fragments by pocket in a typical FBDD campaign, this model might dramatically reduce the computational and synthetic costs, respectively (which is especially true in cases of three or more pockets).

The RGC model is, however, not free of inherent shortcomings. As a LE-derived metric, all fragments are assumed to maintain equal orientations both individually and as part of a larger chemical compound (Zartler and Shapiro, 2008), and phenomena such as hot spots (Zerbe et al., 2012; Rathi et al., 2017) or synergy (also called “super-additivity”) (Hebeisen et al., 2008; Nazaré et al., 2012) are not directly considered. Likewise, because its average-based nature, ΔG of each fragment is normalized not only at the number of non-hydrogen atoms but also on the number of component fragments. Therefore, the less accurate (or more extreme) ΔG values for fragments in each position are, the greater the difference expected between the and the LET. However, we believe that the impact of these hurdles could be minimized, improving the confidence of any potential fragment “rescue,” if additional energy terms derived from rigid body barrier (ΔGrigid), linker binding (ΔGbinding) or strain (ΔGstrain) (Murray and Verdonk, 2006; Cherry and Mitchell, 2008) are included and the ΔG values are both accurate and above a reasonable cut-off value.

A final examination about the implications of this work leads us to assert that the fragment “rescue” phenomenon is far from being new: it has already occurred and continues to occur, but usually during a lead optimization instead an earlier hit discovery phase. The rationale for this statement lies in two central facts. First, we currently know that the LE value of a compound tends to decrease during optimization process (Bembenek et al., 2009), while both its lipophilicity and its MW (and, hence, the number of non-hydrogen atoms -N) tends to increase (Ferenczy and Keseru, 2016). Second, the energy of supramolecular interactions is widely known to be largely different depending on the chemical moiety involved and, thus, ionic and hydrogen bonds are expected to account for a larger part of the drug-receptor binding energy (i.e., its ΔG) compared with hydrophobic interactions (Ermondi and Caron, 2006). Therefore, it is decidedly inviting to believe that, in many drug discovery initiatives, both the increase in LE and the decrease in MW and lipophilicity observed during lead optimization-could be explained by the addition of large and hydrophobic chemical moieties such as those cyclic aliphatic, which have a much smaller ΔG/N ratio compared to other fragments. Interestingly, these aliphatic moieties have been recently suggested by some authors to be more “developable” compared its aromatic analogs, which could be an additional factor behind this phenomenon (Lovering et al., 2009).

The RGC model presented in this study is based on the assumption that the LE of a drug-sized molecule may be estimated using the relative contribution of each component fragment. We believe this model could serve as a complementary benchmark for medicinal chemists in experimental or virtual fragment-based screening campaigns. Likewise, we consider that the RGC model could be implemented with other metrics based on either LE or a potency/size ratio and could be eventually adjusted to consider not only “linking” but also “growing” or “merging” as alternative fragment elaboration strategies.

Statements

Data availability statement

The raw data supporting the conclusions of this manuscript will be made available by the authors, without undue reservation, to any qualified researcher.

Author contributions

AV planned and performed the entire in silico work presented in this study, contributed to the analysis and interpretation of data, and assisted the writing, editing, and submission of this manuscript. AG made substantial contributions to the data analysis, critical revision for important intellectual content, and document editing. All authors have read and approved the final manuscript.

Funding

We acknowledge funds from the Colombian Department of Science, Technology and Innovation COLCIENCIAS Grant No. 727.

Acknowledgments

The authors gratefully acknowledge Professor Marco de Vivo from Istituto Italiano di Tecnologia (IIT), in Genoa (Italy) for helpful discussions and recommendations.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2019.00564/full#supplementary-material

References

  • 1

    AlbertJ. S.BlombergN.BreezeA.BrownA.BurrowsJ.EdwardsP.et al. (2007). An integrated approach to fragment-based lead generation: philosophy, strategy and case studies from AstraZeneca's Drug Discovery Programmes. Curr. Top. Med. Chem.7, 16001629. 10.2174/156802607782341091

  • 2

    BembenekS. D.ToungeB. A.ReynoldsC. H. (2009). Ligand efficiency and fragment-based drug discovery. Drug Discov. Today14, 278283. 10.1016/j.drudis.2008.11.007

  • 3

    CavalluzziM. M.MangiatordiG. F.NicolottiO.LentiniG. (2017). Ligand efficiency metrics in drug discovery: the pros and cons from a practical perspective. Expert Opin. Drug Discov.12, 10871104. 10.1080/17460441.2017.1365056

  • 4

    CherryM.MitchellT. (2008). Introduction to fragment-based drug discovery, in Fragment-Based Drug Discovery, eds ZartlerE. R.ShapiroM. J.132. 10.1002/9780470721551.ch1

  • 5

    CostantinoL.BarloccoD. (2018). Designing Approaches to Multitarget Drugs, in Drug Selectivity: An Evolving Concept in Medicinal Chemistry, 161205.

  • 6

    ErmondiG.CaronG. (2006). Recognition forces in ligand-protein complexes: blending information from different sources. Biochem. Pharmacol.72, 16331645. 10.1016/j.bcp.2006.05.022

  • 7

    FerenczyG. G.KeseruG. M. (2016). Ligand efficiency metrics and their use in fragment optimizations, in Fragment-Based Drug Discovery Lessons and Outlook, eds ErlansonD. A.JahnkeW. (Wiley-VCH Verlag GmbH & Co. KGaA), 7598. 10.1002/9783527683604.ch04

  • 8

    FukunishiY.KurosawaT.MikamiY.NakamuraH. (2014). Prediction of synthetic accessibility based on commercially available compound databases. J. Chem. Inf. Model.54, 32593267. 10.1021/ci500568d

  • 9

    HebeisenP.KuhnB.KohlerP.GublerM.HuberW.KitasE.et al. (2008). Allosteric FBPase inhibitors gain 10(5) times in potency when simultaneously binding two neighboring AMP sites. Bioorganic Med. Chem. Lett.18, 47084712. 10.1016/j.bmcl.2008.06.103

  • 10

    HopkinsA. L.GroomC. R.AlexA. (2004). Ligand efficiency: a useful metric for lead selection. Drug Discov. Today9, 430431. 10.1016/S1359-6446(04)03069-7

  • 11

    HopkinsA. L.Keser,üG. M.LeesonP. D.ReesD. C.ReynoldsC. H. (2014). The role of ligand efficiency metrics in drug discovery. Nat. Rev. Drug Discov.13, 105121. 10.1038/nrd4163

  • 12

    JasialS.HuY.BajorathJ. (2017). How frequently are pan-assay interference compounds active? Large-scale analysis of screening data reveals diverse activity profiles, low global hit frequency, and many consistently inactive compounds. J. Med. Chem. 60, 38793886. 10.1021/acs.jmedchem.7b00154

  • 13

    LoveringF.BikkerJ.HumbletC. (2009). Escape from flatland: increasing saturation as an approach to improving clinical success. J. Med. Chem.52, 67526756. 10.1021/jm901241e

  • 14

    Medina-FrancoJ. L.Martinez-MayorgaK.MeuriceN. (2014). Balancing novelty with confined chemical space in modern drug discovery. Expert Opin. Drug Discov.9, 151165. 10.1517/17460441.2014.872624

  • 15

    MurrayC. W.VerdonkM. L. (2006). Entropic consequences of linking ligands, in Fragment-Based Approaches in Drug Discovery, eds MannholdR.KubinyiH.FolkersG.JahnkeW.ErlansonD. A. (Weinheim: Wiley-VCG Verlag GmbH & Co. KGaA), 5566. 10.1002/3527608761.ch3

  • 16

    NazaréM.MatterH.WillD. W.WagnerM.UrmannM.CzechJ.et al. (2012). Fragment deconstruction of small, potent factor xa inhibitors: exploring the superadditivity energetics of fragment linking in protein-ligand complexes. Angew. Chemie Int. Ed.51, 905911. 10.1002/anie.201107091

  • 17

    RathiP. C.LudlowR. F.HallR. J.MurrayC. W.MortensonP. N.VerdonkM. L. (2017). Predicting “Hot” and “Warm” Spots for Fragment Binding. J. Med. Chem.60, 40364046. 10.1021/acs.jmedchem.7b00366

  • 18

    ReynoldsC. H. (2015). Ligand efficiency metrics: why all the fuss?Future Med. Chem.7, 13631365. 10.4155/fmc.15.70

  • 19

    SchultesS.De GraafC.HaaksmaE. E. J.De EschI. J. P.LeursR.KrämerO. (2010). Ligand efficiency as a guide in fragment hit selection and optimization. Drug Discov. Today Technol.7, 157162. 10.1016/j.ddtec.2010.11.003

  • 20

    VerdonkM. L.ReesD. C. (2008). Group efficiency: a guideline for hits-to-leads chemistry. Chem. Med. Chem. 3, 11791180. 10.1002/cmdc.200800132

  • 21

    ZartlerE. R.ShapiroM. J. (eds.). (2008). Designing a fragment process to fit your needs, in Fragment-Based Drug Discovery (John Wiley & Sons, Ltd.), 1537. 10.1002/9780470721551.ch2

  • 22

    ZerbeB. S.HallD. R.VajdaS.WhittyA.KozakovD. (2012). Relationship between hot spot residues and ligand binding hot spots in protein-protein interfaces. J. Chem. Inf. Model.52, 22362244. 10.1021/ci300175u

Summary

Keywords

ligand efficiency metrics, fragment-based screening, property-based design, drug discovery, protein-ligand interactions, structure-activity relationship, fragment library

Citation

Vásquez AF and González Barrios A (2019) Pushing the Ligand Efficiency Metrics: Relative Group Contribution (RGC) Model as a Helpful Strategy to Promote a Fragment “Rescue” Effect. Front. Chem. 7:564. doi: 10.3389/fchem.2019.00564

Received

19 May 2019

Accepted

24 July 2019

Published

16 August 2019

Volume

7 - 2019

Edited by

Jose L. Medina-Franco, National Autonomous University of Mexico, Mexico

Reviewed by

Sergio Hidalgo Figueroa, Instituto Potosino de Investigación Científica y Tecnológica (IPICYT), Mexico; Salvatore Guccione, University of Catania, Italy

Updates

Copyright

*Correspondence: Andrés González Barrios

This article was submitted to Medicinal and Pharmaceutical Chemistry, a section of the journal Frontiers in Chemistry

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics