- 1Signaling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, Germany
- 2Institute of Biology II, University of Freiburg, Freiburg, Germany
- 3Institute of Biology III, University of Freiburg, Freiburg, Germany
Several platforms exist to perform molecular docking to computationally predict binders to a specific protein target from a library of ligands. The reverse, that is, docking a single ligand to various protein targets, can currently be done by very few web servers, which limits the search to a small set of pre-selected human proteins. However, the possibility to in silico predict which targets a compound identified in a high-throughput drug screen bind would help optimize and reduce the costs of the experimental workflow needed to reveal the molecular mechanism of action of a ligand. Here, we present ReverseDock, a blind docking web server based on AutoDock Vina specifically designed to allow users with no computational expertise to dock a ligand to 100 protein structures of their choice. ReverseDock increases the number and type of proteins a ligand can be docked to, making the task of in silico docking of a ligand to entire families of proteins straightforward. We envision ReverseDock will support researchers by providing the possibility to apply inverse docking computations using web browser. ReverseDock is available at: https://reversedock.biologie.uni-freiburg.de/
Introduction
Drug design efforts have benefited from the recent expansion of the small-molecule chemical space (Polishchuk et al., 2013). However, ligands chosen for a specific target protein may inadvertently inhibit other proteins within a particular pathway (Xie et al., 2011), or a ligand may bind multiple proteins from distinct pathways (Zhou et al., 2019). Several computational approaches are being developed to identify the target protein of a given ligand (Moumbock et al., 2019; Galati et al., 2021). Among these strategies, in cases where both the ligand and protein are novel, the utility of reverse docking protocols becomes evident, as machine learning and similarity-based screenings require previously known structures of protein–ligand pairs. Despite the increasing number of crystal structures, a significant proportion of potential protein–ligand complex structures remain uncharacterized. Consequently, reverse docking protocols emerge as a promising tool to bridge this gap.
While a few web servers exist for docking a ligand to multiple proteins, such as Acid (Wang et al., 2019), TarFisDock (Li et al., 2006) (offline as of 17.11.2022), and idTarget (Wang et al., 2012) (offline as of 17.11.2022), all of these servers restrict the analysis to pre-selected human drug target proteins (809, 698, and 2091, respectively). This limitation poses a challenge for users wishing to explore a list of proteins based on their interests. Conversely, Webina (Kochnev et al., 2020) and SeamDock (Murail et al., 2021) web servers permit users to submit their preferred ligands and proteins; however, this is limited to a single ligand–protein pair. As a result, a gap currently exists in the availability of a docking service capable of docking a given ligand to multiple user-submitted proteins. AutoDock Vina is one of the most commonly used open-source molecular docking software designed for the accurate prediction of protein–ligand interactions (Trott and Olson, 2010). It employs a hybrid search algorithm by combining genetic algorithms with a scoring function based on empirical binding affinity data (Trott and Olson, 2010). The hybrid search algorithm allows AutoDock Vina to efficiently explore the conformational space of ligands within a protein’s binding site, identifying energetically favorable binding modes and estimating binding affinities (Eberhardt et al., 2021). AutoDock Vina’s computational efficiency, combined with its ability to predict binding interactions with high accuracy, has made it an indispensable tool in virtual screening, lead optimization, and structure-based drug design. Furthermore, AutoDock Vina has two orders of magnitude speed and better docking pose accuracy compared to AutoDock 4, making it an ideal tool for high-throughput virtual screening applications (Chang et al., 2010; Nguyen et al., 2020). To bring the full power of docking to the experimental community, we developed ReverseDock, an AutoDock Vina-based, easy-to-use blind docking web server allowing users to freely select multiple protein targets for docking their ligand of interest. Furthermore, we demonstrate that among randomly selected proteins, ReverseDock is able to successfully capture the ranking and docking poses of ligands.
Methods
Preparation of docking files
The user can upload a ligand (.mol2) along with their preferred Protein Data Bank (PDB) structures (up to 100 structures, with less than 1,000 amino acids to minimize the risk of job failures due to an extensive search space). PDB files are first processed to remove nucleic acids, and then PDBFixer software (https://github.com/caiyingchun/pdbfixer) is applied to add missing amino acids, replace non-standard amino acids, remove heterogen atoms like water, and add missing heavy atoms. Finally, protonation of the ligand at pH 7 is achieved using Open Babel (O’Boyle et al., 2011), while proteins are protonated using the PROPKA method (Olsson et al., 2011) through pdb2pqr (Dolinsky et al., 2007).
Docking of the submitted ligand by AutoDock Vina
For both input and output purposes, AutoDock Vina (Trott and Olson, 2010) employs the PDBQT (PDB with charges and atom types) molecular structure file format, which includes information about the ligand’s structure, atom types, charges, and torsional degrees of freedom. PDBQT files also contain ROOT, ENDROOT, BRANCH, and ENDBRANCH keywords that are recognized by AutoDock Vina, which establish the torsion tree of the submitted ligand .mol2 file. As such, various rotamers of the submitted ligand can be generated during docking simulations. In ReverseDock, AutoDock Vina required PDBQT files are generated by using the MGLTools software (Morris et al., 2009). The search space for docking is defined as a box with an edge 30 Å larger than that of the target protein in order to avoid steric restrictions on the ligand’s possible binding positions to the target. As recommended by previous studies for the converged docking poses with AutoDock Vina (Agarwal and Smith, 2023), a fixed exhaustiveness score of 64 has been selected for all docking calculations. Exhaustiveness determines the number of iterations and poses that AutoDock Vina will explore during the docking process (Agarwal and Smith, 2023). A higher exhaustiveness value indicates that the software will explore a larger number of possible binding orientations and conformations for the ligand within the binding site. Despite its computational cost, this is beneficial to increase the likelihood of finding the optimal binding pose and improve the accuracy of the predicted binding affinity between the ligand and the protein. In ReverseDock, flexible ligand docking is applied for the docking of each ligand to the submitted proteins by AutoDock Vina (Trott and Olson, 2010).
Presentation of results
The results are displayed in a table, which are sorted by the calculated binding energy in kcal/mol. Each individual top docking ligand pose can be downloaded in PDBQT format. Additionally, each protein–ligand complex can be viewed in a 3D mode for quick analysis with the option to take a snapshot image of the docking pose.
Web server development
ReverseDock employs a microservice architecture that enables flexible scaling. For instance, docking simulations can be distributed across multiple interconnected computers. Services are tasked with docking using AutoDock Vina (Eberhardt et al., 2021), and preparing receptors and ligands for docking using MGLTools (Olsson et al., 2011). The entire process concludes with e-mail dispatch. All services are scripted in Python. Communication between services occurs via the AMQP protocol. Submissions are queued in a manner that optimizes resource usage; a submission can initiate computation without waiting for a prior submission to complete, provided resources are available. The outward-facing API is coded in .NET 6.0, adhering to the Controller–Service–Repository pattern. For persistent data storage, MongoDB is employed, and Redis functions as temporary caching. The front end is crafted using TypeScript, React, and Blueprint, with NGLViewer (Morris et al., 2009) deployed for 3D molecular representation.
Upon submitting a .mol2 file, followed by up to 100 .pdb files or UniProt IDs for retrieving AlphaFold structures, should the user choose to proceed, tasks are disseminated through AMQP for relevant services to consume. Computation commences once resources are at hand, with interim results exhibited on the webpage. Upon processing all targets, an email notification is sent if the user has supplied one during submission.
Demonstration cases
To evaluate the ranking and docking pose accuracy of ReverseDock, we created a list of random protein structures consisting of the following PDB IDs: 1udt, 2oyu, 3g6z, 3pbl, 2nnq, 3kba, 1uyg, 2uz3, 2hzi, 4ldo, 2i0e, 1sqt, 3m2w, 2oj9, 3erd, 3f9m, 1w7x, 2bgs, 2azr, and 2ica. Next, we extracted ligand .mol2 files from these PDB files and applied cross-docking calculations on ReverseDock. The results indicate that ReverseDock is able to identify the correct binding site in 75% (16/20) of the cases and can rank the corresponding protein–ligand complex in the top three positions in 50% (10/20) of the cases (Table 1), demonstrating that our tool can be used for target enrichment purposes of a given ligand. In addition to successful ranking, the best docking positions are also found to have an RMSD smaller than 3Å compared to the crystal structure in 55% (11/20) of the cases. A detailed inspection of incorrect binding site predictions indicates that buried, relatively large, or small ligands are not correctly identified.
TABLE 1. Cross-docking results are obtained by docking each ligand from the indicated PDB structure to all other PDB structures using ReverseDock. The ranking of the target PDB and the ligand RMSD of the first docking pose compared to the crystal structure is provided.
Discussion
As demonstrated by the cross-docking results presented previously, through the utilization of AutoDock Vina with a pre-defined box size 30 Å larger than the target protein’s box size and an exhaustiveness score of 64 (Agarwal and Smith, 2023), ReverseDock can rank docking energies and reproduce the docking pose of the previously identified protein–ligand complex structures.
To ensure the quality and reliability of ReverseDock outputs, it is essential to address potential caveats and pitfalls that could impact the accuracy of the results. The scoring function utilized by Autodock Vina to estimate binding energies comes with limitations. Users should exercise caution when interpreting binding energies, as they may not always precisely reflect experimental results. Furthermore, the accuracy of docking simulations is dependent on the precision of the generated conformations and the extent of conformational space sampling.
Various strategies can be applied to evaluate the ReverseDock results. Visual inspection via molecular visualization software would be helpful in assessing the alignment of predicted binding poses with previously identified protein–ligand complexes. Comparing predicted binding sites with references from experimental structures or literature could aid in assessing the consistency and accuracy of predictions. Employing consensus scoring by using alternative docking tools can enhance confidence, particularly when multiple tools validate a specific binding mode. While binding energies may not be directly comparable to experimental data, comparing relative energies within a ligand set offers insights into relative affinities. Validation against existing data on analogous protein–ligand systems would also be helpful in assessing the quality of the predictions.
Ultimately, the integration of computational predictions with experimental validations, such as binding assays and advanced structure determination techniques such as X-ray crystallography or NMR spectroscopy, is recommended to establish the reliability and relevance of docking results.
We believe that our tool would be valuable for experimental researchers aiming to conduct reverse docking protocols to identify the target of a given ligand.
Data availability statement
The complete source code of ReverseDock as well as the input data used for demonstration cases can be accessed at: https://github.com/orgs/ReverseDock/repositories.
Author contributions
FK implemented the web server with the contributions of KV. BD gave conceptual feedback and contributed to writing the manuscript. MÖ conceived, designed, and supervised the study and wrote the manuscript with input from FK. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Program (Grant agreement No. 101002044 to BD) and by the Excellence Initiatives of the German Federal and State Governments BIOSS (Centre for Biological Signalling Studies; EXC-294) and CIBSS (Centre for Integrative Biological Signalling Studies; EXC-2189).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Agarwal, R., and Smith, J. C. (2023). Speed vs accuracy: effect on ligand pose accuracy of varying box size and exhaustiveness in AutoDock Vina. Mol. Inf. 42 (2), 2200188. doi:10.1002/minf.202200188
Chang, M. W., Ayeni, C., Breuer, S., and Torbett, B. E. (2010). Virtual screening for HIV protease inhibitors: A comparison of AutoDock 4 and Vina. PLOS ONE 5 (8), e11955. doi:10.1371/journal.pone.0011955
Dolinsky, T. J., Czodrowski, P., Li, H., Nielsen, J. E., Jensen, J. H., and Klebe, G., (2007). PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations. Nucleic Acids Res. 35, W522–W525. Web Server issue). doi:10.1093/nar/gkm276
Eberhardt, J., Santos-Martins, D., Tillack, A. F., and Forli, S. (2021). AutoDock Vina 1.2.0: new docking methods, expanded force field, and Python bindings. J. Chem. Inf. Model. 61 (8), 3891–3898. doi:10.1021/acs.jcim.1c00203
Galati, S., Di Stefano, M., Martinelli, E., Poli, G., and Tuccinardi, T. (2021). Recent advances in in silico target fishing. Mol. Basel Switz. 26 (17), 5124. doi:10.3390/molecules26175124
Kochnev, Y., Hellemann, E., Cassidy, K. C., and Durrant, J. D. (2020). Webina: an open-source library and web app that runs AutoDock Vina entirely in the web browser. Bioinformatics 36 (16), 4513–4515. doi:10.1093/bioinformatics/btaa579
Li, H., Gao, Z., Kang, L., Zhang, H., Yang, K., and Yu, K., (2006). TarFisDock: A web server for identifying drug targets with docking approach. Nucleic Acids Res. 34 (2), W219–W224. doi:10.1093/nar/gkl114
Morris, G. M., Huey, R., Lindstrom, W., Sanner, M. F., Belew, R. K., and Goodsell, D. S., (2009). AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem. 30 (16), 2785–2791. doi:10.1002/jcc.21256
Moumbock, A. F. A., Li, J., Mishra, P., Gao, M., and Günther, S. (2019). Current computational methods for predicting protein interactions of natural products. Comput. Struct. Biotechnol. J. 17, 1367–1376. doi:10.1016/j.csbj.2019.08.008
Murail, S., de Vries, S. J., Rey, J., Moroy, G., and Tufféry, P. (2021). SeamDock: an interactive and collaborative online docking resource to assist small compound molecular docking. Front. Mol. Biosci. 8, 716466. doi:10.3389/fmolb.2021.716466
Nguyen, N. T., Nguyen, T. H., Pham, T. N. H., Huy, N. T., Bay, M. V., and Pham, M. Q., (2020). Autodock Vina adopts more accurate binding poses but Autodock4 forms better binding affinity. J. Chem. Inf. Model. 60 (1), 204–211. doi:10.1021/acs.jcim.9b00778
O’Boyle, N. M., Banck, M., James, C. A., Morley, C., Vandermeersch, T., and Hutchison, G. R. (2011). Open babel: an open chemical toolbox. J. Cheminformatics 3 (1), 33. doi:10.1186/1758-2946-3-33
Olsson, M. H. M., Søndergaard, C. R., Rostkowski, M., and Jensen, J. H. (2011). PROPKA3: consistent treatment of internal and surface residues in empirical PKa predictions. J. Chem. Theory Comput. 7 (2), 525–537. doi:10.1021/ct100578z
Polishchuk, P. G., Madzhidov, T. I., and Varnek, A. (2013). Estimation of the size of drug-like chemical space based on GDB-17 data. J. Comput. Aided Mol. Des. 27 (8), 675–679. doi:10.1007/s10822-013-9672-4
Trott, O., and Olson, A. J. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31 (2), 455–461. doi:10.1002/jcc.21334
Wang, F., Wu, F.-X., Li, C.-Z., Jia, C.-Y., Su, S.-W., and Hao, G.-F., (2019). Acid: A free tool for drug repurposing using consensus inverse docking strategy. J. Cheminformatics 11 (1), 73. doi:10.1186/s13321-019-0394-z
Wang, J.-C., Chu, P.-Y., Chen, C.-M., and Lin, J.-H. (2012). IdTarget: A web server for identifying protein targets of small chemical molecules with robust scoring functions and a divide-and-conquer docking approach. Nucleic Acids Res. 40 (W1), W393–W399. doi:10.1093/nar/gks496
Xie, L., Xie, L., and Bourne, P. E. (2011). Structure-based systems biology for analyzing off-target binding. Curr. Opin. Struct. Biol. 21 (2), 189–199. doi:10.1016/j.sbi.2011.01.004
Keywords: AutoDock Vina, web server, drug design, off-target analysis, blind docking
Citation: Krause F, Voigt K, Di Ventura B and Öztürk MA (2023) ReverseDock: a web server for blind docking of a single ligand to multiple protein targets using AutoDock Vina . Front. Mol. Biosci. 10:1243970. doi: 10.3389/fmolb.2023.1243970
Received: 21 June 2023; Accepted: 25 September 2023;
Published: 10 October 2023.
Edited by:
Ozlem Tastan Bishop, Rhodes University, South AfricaReviewed by:
Ozge Sensoy, Istanbul Medipol University, TürkiyeMahmoud A. A. Ibrahim, Minia University, Egypt
Copyright © 2023 Krause, Voigt, Di Ventura and Öztürk. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mehmet Ali Öztürk, bWVobWV0Lm9lenR1ZXJrQGJpb3NzLnVuaS1mcmVpYnVyZy5kZQ==