- 1Human Health Therapeutics Research Centre, National Research Council Canada, Montreal, QC, Canada
- 2Computational Protein Design and Modeling, Computational Science, Moderna Therapeutics, Cambridge, MA, United States
- 3Research Center of Drug and Vaccine Development, National Institute of Infectious Diseases, Tokyo, Japan
Editorial on the Research Topic
Progress and challenges in computational structure-based design and development of biologic drugs
Biotherapeutics have emerged as a major class of pharmaceuticals, encompassing monoclonal antibodies, recombinant human proteins and enzymes, fusion proteins, antibody drug conjugates, multi-specific formats, peptides, and vaccines. These modalities serve a wide range of therapeutic areas, including immune-oncology, inflammation, cardiovascular, metabolic, infectious, and rare diseases (DeFrancesco, 2019; Kang and Jung, 2020; Lu et al., 2020; Kaplon et al., 2023). Recent advancements in structure determination, structure prediction, bioanalytical characterization, and machine learning have established in silico approaches as a key toolbox employed in the biologic drug discovery and development pipelines (Fischman and Ofran, 2018; Norman et al., 2020; Fernandez-Quintero et al., 2023). Additionally, physics-based molecular modeling and simulation methods, along with empirical linear models, have matured to routine implementation during biotherapeutic drug candidate selection and optimization. However, the accuracy of these predictions can be improved. Further refinements will be welcomed, particularly towards binding affinity predictions and developability assessments.
At the same time, with the fast-paced infusion of artificial intelligence in various research areas that impact daily life, we are witnessing a new chapter being written in biological drug design (Kim et al., 2023). A wide range of nonlinear models, from machine learning to unsupervised deep neural networks and language models, is now emerging. These models, fueled by still modest but expanding biological and structural datasets, are complementing classical methods (Baek et al., 2021; Jumper et al., 2021; Kryshtafovych et al., 2021; Bennett et al., 2023; Lin et al., 2023; Wodak et al., 2023). There are also increasing attempts to integrate advanced computational methods with next-generation sequencing, either from synthetic or natural libraries, to enhance the efficient hit-to-lead optimization and de novo discovery of tight binders with favorable developability profiles (Makowski et al., 2021; Mason et al., 2021; Hie et al., 2023; Parkinson et al., 2023). While past structure-based efforts were focused on optimizing existing antibody candidates, the emerging trend is the hopeful possibility of de novo discovery of antibody-based and other biologic drugs via in silico methods. This opens prospects for extensive application of computational techniques in biologic drug discovery and development.
This Research Topic comes at an opportune time when the accuracy limits are being pushed for classical methods and foundations are being established for machine learning methods. These complementary tools are poised to enhance the entire biologics drug development pipeline, from the molecular design and property optimization to large-scale manufacturability. In this Research Topic, readers will find a breadth of computational approaches, ranging from established 3D structure and physics-based techniques to innovative explorations in sequence-based non-linear machine learning models.
An overview of the current state and opportunities for synergistic use of computation and experimentation in this field is provided by Bauer et al. The authors described their vision of Biopharmaceutical Informatics and discuss already available computational methods at each stage of the biologic drug design-discovery-optimization-development pipeline. The authors have provided useful cues on how best to apply these in silico methods and how to combine them with experimental approaches to maximize the odds and efficiency of arriving at biologics that are both effective and developable.
Fundamental understandings of molecular properties of the drug candidates and their targets are essential to advance both biologic discovery and development. Di Rienzo et al. focused on discerning the rules that define antibody-antigen recognition as a fundamental step in the rational design and engineering of functional antibodies with desired properties. Their novel method, which is based on the 3D Zernike polynomials to generate shape and electrostatic descriptors capturing both global and local protein surface physicochemical properties, accurately classified types of antibody-antigen interfaces solely based on paratope surface information. Fernandez-Quintero et al. took a deep dive into seemingly similar interfaces between the various Ig-folded domains that make up a monoclonal antibody structure. Using classical MD simulations and analyses, they revealed and compared contact maps that can be used to inform selection of favorable point mutations for the design of bispecific antibodies. In their case study, Paul et al. described the well-recognized yet inadequately understood trade-off between binding affinity and thermal stability, which can have significant implications during the lead candidate optimization stage. Using classical force-field methods, molecular dynamics, and amino-acid hydropathy, they observed affinity-stability correlations and patterns in key pairs of residues called hotspots.
Novel tools are also reviewed in this Research Topic. For example, Jaszczyszyn et al. assembled a timely review of recent advances of deep-learning based tools for structural modeling the variable regions of antibodies. In addition to cataloguing underlying algorithms and benchmarking their performance, the authors offered their perspective on how the emerging high accuracy of antibody paratope modeling can influence the field of biologics drug discovery. Engelberger et al. provided the energy breakdown guided protein design (ENDURE) tool for accurately assessing energetic contributions from individual and combinatorial mutations to the overall protein stability. An interesting feature is the residue depth analysis which enables tracking the energetic contributions of mutations occurring in different spatial layers of the protein structure. Spoendlin et al. introduced the second iteration of their structural profiling of antibodies to cluster by epitope (SPACE) tool, which builds upon the recent progress in machine learning antibody structure prediction and a novel clustering protocol. It improved data coverage and identified even more diverse clusters of antibodies that bind to the same epitope. These tools are expected to further advance rational design of biotherapeutics.
Proof-of-concept studies illustrating novel screening campaigns that combine computational design with experimental data are also presented. Arras et al. combined next-generation sequencing of semi-immune/semi-synthetic libraries built on a humanized VHH framework with machine learning, data processing, and model building for simultaneous optimization of affinity and developability. The proposed typical early drug discovery methodology generated diverse and potent VHH hits against NKp46 protein without requiring further humanization and developability optimization, thereby accelerating drug discovery. Gaudreault et al. focused on protein-protein docking with flexible side chains while retaining rigid protein backbone to discover novel binders against predefined target epitopes. Their approach was applied to randomized libraries of surface mutations introduced in a rigid protein scaffold called DARPin, leading to the design and experimental validation of an enriched small set of hits against a predefined epitope on the BCL-W target protein.
The literature in this field is growing rapidly. Our Research Topic does not cover all computational aspects of biologic drug discovery. Nonetheless, the articles compiled here hopefully offer timely snapshots of key components along a biologic drug’s discovery, design, and development.
Author contributions
TS: Writing–original draft, Writing–review and editing. SK: Writing–original draft, Writing–review and editing. DK: Writing–original draft, Writing–review and editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. DK acknowledges support from Japan Agency for Medical Research and Development (JP23wm0325047).
Acknowledgments
The editors would like to thank all the authors and expert reviewers who have participated in the preparation and evaluation of manuscripts presented in this Research Topic.
Conflict of interest
SK is an employee of Moderna Therapeutics.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Baek, M., DiMaio, F., Anishchenko, I., Dauparas, J., Ovchinnikov, S., Lee, G. R., et al. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science 373, 871–876. doi:10.1126/science.abj8754
Bennett, N. R., Coventry, B., Goreshnik, I., Huang, B., Allen, A., Vafeados, D., et al. (2023). Improving de novo protein binder design with deep learning. Nat. Commun. 14, 2625. doi:10.1038/s41467-023-38328-5
DeFrancesco, L. (2019). Drug pipeline 1Q19. Nat. Biotechnol. 37, 579–580. doi:10.1038/s41587-019-0146-7
Fernandez-Quintero, M. L., Ljungars, A., Waibl, F., Greiff, V., Andersen, J. T., Gjolberg, T. T., et al. (2023). Assessing developability early in the discovery process for novel biologics. MAbs 15, 2171248. doi:10.1080/19420862.2023.2171248
Fischman, S., and Ofran, Y. (2018). Computational design of antibodies. Curr. Opin. Struct. Biol. 51, 156–162. doi:10.1016/j.sbi.2018.04.007
Hie, B. L., Shanker, V. R., Xu, D., Bruun, T. U. J., Weidenbacher, P. A., Tang, S., et al. (2023). Efficient evolution of human antibodies from general protein language models. Nat. Biotechnol. 2023, 1763. doi:10.1038/s41587-023-01763-2
Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. doi:10.1038/s41586-021-03819-2
Kang, T. H., and Jung, S. T. (2020). Reprogramming the constant region of immunoglobulin G subclasses for enhanced therapeutic potency against cancer. Biomolecules 10, 382. doi:10.3390/biom10030382
Kaplon, H., Crescioli, S., Chenoweth, A., Visweswaraiah, J., and Reichert, J. M. (2023). Antibodies to watch in 2023. MAbs 15, 2153410. doi:10.1080/19420862.2022.2153410
Kim, J., McFee, M., Fang, Q., Abdin, O., and Kim, P. M. (2023). Computational and artificial intelligence-based methods for antibody development. Trends Pharmacol. Sci. 44, 175–189. doi:10.1016/j.tips.2022.12.005
Kryshtafovych, A., Schwede, T., Topf, M., Fidelis, K., and Moult, J. (2021). Critical assessment of methods of protein structure prediction (CASP)-Round XIV. Proteins 89, 1607–1617. doi:10.1002/prot.26237
Lin, Z., Akin, H., Rao, R., Hie, B., Zhu, Z., Lu, W., et al. (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379, 1123–1130. doi:10.1126/science.ade2574
Lu, R. M., Hwang, Y. C., Liu, I. J., Lee, C. C., Tsai, H. Z., Li, H. J., et al. (2020). Development of therapeutic antibodies for the treatment of diseases. J. Biomed. Sci. 27, 1. doi:10.1186/s12929-019-0592-z
Makowski, E. K., Wu, L., Gupta, P., and Tessier, P. M. (2021). Discovery-stage identification of drug-like antibodies using emerging experimental and computational methods. MAbs 13, 1895540. doi:10.1080/19420862.2021.1895540
Mason, D. M., Friedensohn, S., Weber, C. R., Jordi, C., Wagner, B., Meng, S. M., et al. (2021). Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning. Nat. Biomed. Eng. 5, 600–612. doi:10.1038/s41551-021-00699-9
Norman, R. A., Ambrosetti, F., Bonvin, A., Colwell, L. J., Kelm, S., Kumar, S., et al. (2020). Computational approaches to therapeutic antibody design: established methods and emerging trends. Brief. Bioinform. 21, 1549–1567. doi:10.1093/bib/bbz095
Parkinson, J., Hard, R., and Wang, W. (2023). The RESP AI model accelerates the identification of tight-binding antibodies. Nat. Commun. 14, 454. doi:10.1038/s41467-023-36028-8
Keywords: binding affinity, specificity, therapeutic antibody, developability, in silico method, molecular modeling, molecular simulations, machine learning
Citation: Sulea T, Kumar S and Kuroda D (2024) Editorial: Progress and challenges in computational structure-based design and development of biologic drugs. Front. Mol. Biosci. 11:1360267. doi: 10.3389/fmolb.2024.1360267
Received: 22 December 2023; Accepted: 01 February 2024;
Published: 07 February 2024.
Edited and reviewed by:
Didier Devaurs, University of Edinburgh, United KingdomCopyright © 2024 Sulea, Kumar and Kuroda. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Traian Sulea, dHJhaWFuLnN1bGVhQG5yYy1jbnJjLmdjLmNh; Sandeep Kumar, c2FuZGVlcC5rdW1hckBtb2Rlcm5hdHguY29t; Daisuke Kuroda, ZGt1cm9kYUBuaWlkLmdvLmpw