AUTHOR=Bonanno Etienne , Ebejer Jean-Paul TITLE=Applying Machine Learning to Ultrafast Shape Recognition in Ligand-Based Virtual Screening JOURNAL=Frontiers in Pharmacology VOLUME=10 YEAR=2020 URL=https://www.frontiersin.org/journals/pharmacology/articles/10.3389/fphar.2019.01675 DOI=10.3389/fphar.2019.01675 ISSN=1663-9812 ABSTRACT=
Ultrafast Shape Recognition (USR), along with its derivatives, are Ligand-Based Virtual Screening (LBVS) methods that condense 3-dimensional information about molecular shape, as well as other properties, into a small set of numeric descriptors. These can be used to efficiently compute a measure of similarity between pairs of molecules using a simple inverse Manhattan Distance metric. In this study we explore the use of suitable Machine Learning techniques that can be trained using USR descriptors, so as to improve the similarity detection of potential new leads. We use molecules from the Directory for Useful Decoys-Enhanced to construct machine learning models based on three different algorithms: Gaussian Mixture Models (GMMs), Isolation Forests and Artificial Neural Networks (ANNs). We train models based on full molecule conformer models, as well as the Lowest Energy Conformations (LECs) only. We also investigate the performance of our models when trained on smaller datasets so as to model virtual screening scenarios when only a small number of actives are known