AUTHOR=Liu Ruifeng , AbdulHameed Mohamed Diwan M. , Wallqvist Anders TITLE=Teaching an Old Dog New Tricks: Strategies That Improve Early Recognition in Similarity-Based Virtual Screening JOURNAL=Frontiers in Chemistry VOLUME=7 YEAR=2019 URL=https://www.frontiersin.org/journals/chemistry/articles/10.3389/fchem.2019.00701 DOI=10.3389/fchem.2019.00701 ISSN=2296-2646 ABSTRACT=
High throughput screening (HTS) is an important component of lead discovery, with virtual screening playing an increasingly important role. Both methods typically suffer from lack of sensitivity and specificity against their true biological targets. With ever-increasing screening libraries and virtual compound collections, it is now feasible to conduct follow-up experimental testing on only a small fraction of hits. In this context, advances in virtual screening that achieve enrichment of true actives among top-ranked compounds (“early recognition”) and, hence, reduce the number of hits to test, are highly desirable. The standard ligand-based virtual screening method for large compound libraries uses a molecular similarity search method that ranks the likelihood of a compound to be active against a drug target by its highest Tanimoto similarity to known active compounds. This approach assumes that the distributions of Tanimoto similarity values to all active compounds are identical (i.e., same mean and standard deviation)—an assumption shown to be invalid (Baldi and Nasr,