Machine Learning in Computer-Aided Drug Design

  • 3,461

    Total downloads

  • 20k

    Total views and downloads

About this Research Topic

Submission closed

Background

Machine learning (ML) became an unavoidable tool in contemporary science and technology. Its ability to offer solutions to hard, multidimensional problems makes it an attractive tool for practitioners in diverse fields, from social engineering to structural biology and molecular dynamics, and to complete information games such as chess or Go. The disruptive power of recent advances in deep learning (DL) such as attention networks, or denoising diffusion probabilistic models (DDPM) changed the field of molecular and structural biology and offered a solution to long-standing protein folding problem. However, ML poses a serious problem and that is the dissociation of practical results achieved with ML in a given field from theory. ML is based on solid mathematical work, and new improvements rigorously follow thorough research protocols, but the key problem stays largely unsolved, and that is how rules which trained ML algorithms (neural DL networks in particular) use to make classifications are organized. We understand the algorithms used to optimize the decision rules, and we can analyze the weights distributions in hidden layers, but we cannot clearly decipher, in human-readable terms, how those algorithms emphasize features important for classification (deterministic chaos in latent space). We are, therefore, in possession of a powerful system with non-transferable knowledge (in a sense of non-transferable artistic ability; an artist can be trained, but the ability cannot be simply copied). In some instances, that is not an issue, for example, if the underlying theoretical foundations are well known (e.g., in computational physics/mechanics), and the major task is to produce a more efficient algorithm than the one based on classical theory. For those tasks, the ML algorithms showed to be meaningful, efficient, and useful.

The inability of ML to offer simple, human-readable interpretations of the rules for converting multi-dimensional input data stream into output classes makes ML a closed system, an inaccessible black box, a sort of Deux-ex-Machina, and the movements in latent space that stay largely hidden from human observers (the latent space can be perceived as a space of “hidden variables” in Quantum mechanics). One can argue that this is just an issue of linguistics or epistemology, but we find it crucial if we aim to fully understand biological or physical processes. Conversely, the ability of ML protocols to efficiently extract key features from a given data set makes them vulnerable to input data sampling bias. In scientific terms, the ML algorithms easily find the local minima given the input stream, but the global minima may be out of reach due to limited sampling. This issue is especially emphasized in natural sciences, where sampling bias is almost inevitable. All this shows that we may end up where we have started, with observable phenomena with no clear explanation. That may lead to a saturation effect of the explanatory ability of science, with weak feedback for knowledge improvement.

ML lacks the overall knowledge of the world, i.e., artificial general intelligence (AGI) still does not exist, and the ability to access general knowledge was instrumental to researchers’ ability to cross the sampling gap and guided them toward correct explanations (the Copernican revolution in astronomy, given observational data in the XVI century, would not be possible if Nicolas Copernicus did not possess a wider understanding and knowledge of the world).

In computer-aided-drug-design ML has been used to recognize drug binding sites, binding modes, and conformations, to speed up costly MD calculations (QM/MM in particular), and to optimize potential hits. The aim of practitioners of ML was to reduce the exorbitant costs of drug development and shorten the development and, correspondingly, lengthen the applicability of patent rights. ML can help in that regard, but the above comments still apply. Without a full compendium of cell signaling process, and still not fully resolved principles of molecular interactions and dynamics, machine learning’s ability to filter out unnecessary details from the input stream (experimentally obtained molecular structures, interactions, and clinical data), seems like a short-term success. What is necessary is to understand cause and effect both on the micro and macro level (cells, tissues, organs, individuals), together with effects of timescales and time relativity inside cells and tissues.

To address that we would like to invite practitioners of ML and drug hunters to submit manuscripts to this Research topic showing research that utilizes ML protocols/architectures but offers a detailed and comprehensive interpretation of observed phenomena. The topics that can be addressed with ML and that we are interested in are molecular dynamics (MD) acceleration techniques, implicit solvent improvements, small molecules force field optimizations, cryptic pocket discovery and their physical interpretation, allosteric effects discovery, and their application to undruggable targets. The toxicity in drug discovery is also a topic we would like to see addressed. We would also encourage the application of ML to the analysis of interaction frequencies between genome loci in the nucleus as statistical averages over cell populations, and their relation (and potential clinical use) to single cells analyses via 3C and fluorescence in situ hybridization techniques. The already established fractal and continuous polymer models of chromatin are ripe for deeper interpretations with the help of ML tools together with the experimental data on covalent modifications.

As a tool of choice for data analysis in experimental and observational sciences, ML helped produce a deluge of research papers, many with limited/questionable scientific merit. We would, therefore, avoid simple ML analysis of input data sets, or comparison of different ML algorithms (e.g., neural networks vs. random forests, etc.). Manuscripts that use machine learning, but at the same time offer solid theoretical interpretations of results are, on the other hand welcome. That may not only help drug discovery and molecular biology but also benefit the machine learning field as well, as it may shed light on the underlying processes in the latent space of variables.



Dr. Gunady is currently an employee of Illumina Inc; Dr.Perišić is an employee (Research Scientist) of Redesign Science. All other Topic Editors declare no competing interests.

Research Topic Research topic image

Keywords: machine learning, small molecule force field optimization, drug design

Important note: All contributions to this Research Topic must be within the scope of the section and journal to which they are submitted, as defined in their mission statements. Frontiers reserves the right to guide an out-of-scope manuscript to a more suitable section or journal at any stage of peer review.

Frequently asked questions

  • Frontiers' Research Topics are collaborative hubs built around an emerging theme.Defined, managed, and led by renowned researchers, they bring communities together around a shared area of interest to stimulate collaboration and innovation.

    Unlike section journals, which serve established specialty communities, Research Topics are pioneer hubs, responding to the evolving scientific landscape and catering to new communities.

  • The goal of Frontiers' publishing program is to empower research communities to actively steer the course of scientific publishing. Our program was implemented as a three-part unit with fixed field journals, flexible specialty sections, and dynamically emerging Research Topics, connecting communities of different sizes and maturity.

    Research Topics originate from the scientific community. Many of our Research Topics are suggested by existing editorial board members who have identified critical challenges or areas of interest in their field.

  • As an editor, Research Topics will help you build your journal, as well as your community, around emerging, cutting-edge research. As research trailblazers, Research Topics attract high-quality submissions from leading experts all over the world.

    A thriving Research Topic can potentially evolve into a new specialty section if there is sustained interest and a growing community around it.

  • Each Research Topic must be approved by the specialty chief editor, and it falls under the editorial oversight of our editorial boards, supported by our in-house research integrity team. The same standards and rigorous peer review processes apply to articles published as part of a Research Topic as for any other article we publish.

    In 2023, 80% of the Research Topics we published were edited or co-edited by our editorial board members, who are already familiar with their journal's scope, ethos, and publishing model. All other topics are guest edited by leaders in their field, each vetted and formally approved by the specialty chief editor.

  • Publishing your article within a Research Topic with other related articles increases its discoverability and visibility, which can lead to more views, downloads, and citations. Research Topics grow dynamically as more published articles are added, causing frequent revisiting, and further visibility.

    As Research Topics are multidisciplinary, they are cross-listed in several fields and section journals – increasing your reach even more and giving you the chance to expand your network and collaborate with researchers in different fields, all focusing on expanding knowledge around the same important topic.

    Our larger Research Topics are also converted into ebooks and receive social media promotion from our digital marketing team.

  • Frontiers offers multiple article types, but it will depend on the field and section journals in which the Research Topic will be featured. The available article types for a Research Topic will appear in the drop-down menu during the submission process.

    Check available article types here 

  • Yes, we would love to hear your ideas for a topic. Most of our Research Topics are community-led and suggested by researchers in the field. Our in-house editorial team will contact you to talk about your idea and whether you’d like to edit the topic. If you’re an early-stage researcher, we will offer you the opportunity to coordinate your topic, with the support of a senior researcher as the topic editor. 

    Suggest your topic here 

  • A team of guest editors (called topic editors) lead their Research Topic. This editorial team oversees the entire process, from the initial topic proposal to calls for participation, the peer review, and final publications.

    The team may also include topic coordinators, who help the topic editors send calls for participation, liaise with topic editors on abstracts, and support contributing authors. In some cases, they can also be assigned as reviewers.

  • As a topic editor (TE), you will take the lead on all editorial decisions for the Research Topic, starting with defining its scope. This allows you to curate research around a topic that interests you, bring together different perspectives from leading researchers across different fields and shape the future of your field. 

    You will choose your team of co-editors, curate a list of potential authors, send calls for participation and oversee the peer review process, accepting or recommending rejection for each manuscript submitted.

  • As a topic editor, you're supported at every stage by our in-house team. You will be assigned a single point of contact to help you on both editorial and technical matters. Your topic is managed through our user-friendly online platform, and the peer review process is supported by our industry-first AI review assistant (AIRA).

  • If you’re an early-stage researcher, we will offer you the opportunity to coordinate your topic, with the support of a senior researcher as the topic editor. This provides you with valuable editorial experience, improving your ability to critically evaluate research articles and enhancing your understanding of the quality standards and requirements for scientific publishing, as well as the opportunity to discover new research in your field, and expand your professional network.

  • Yes, certificates can be issued on request. We are happy to provide a certificate for your contribution to editing a successful Research Topic.

  • Research Topics thrive on collaboration and their multi-disciplinary approach around emerging, cutting-edge themes, attract leading researchers from all over the world.

  • As a topic editor, you can set the timeline for your Research Topic, and we will work with you at your pace. Typically, Research Topics are online and open for submissions within a few weeks and remain open for participation for 6 – 12 months. Individual articles within a Research Topic are published as soon as they are ready.

    Find out more about our Research Topics

  • Our fee support program ensures that all articles that pass peer review, including those published in Research Topics, can benefit from open access – regardless of the author's field or funding situation.

    Authors and institutions with insufficient funding can apply for a discount on their publishing fees. A fee support application form is available on our website.

  • In line with our mission to promote healthy lives on a healthy planet, we do not provide printed materials. All our articles and ebooks are available under a CC-BY license, so you can share and print copies.