AUTHOR=Serafim Mateus Sá Magalhães , Pantaleão Simone Queiroz , da Silva Elany Barbosa , McKerrow James H. , O’Donoghue Anthony J. , Mota Bruno Eduardo Fernandes , Honorio Kathia Maria , Maltarollo Vinícius Gonçalves TITLE=The importance of good practices and false hits for QSAR-driven virtual screening real application: a SARS-CoV-2 main protease (Mpro) case study JOURNAL=Frontiers in Drug Discovery VOLUME=3 YEAR=2023 URL=https://www.frontiersin.org/journals/drug-discovery/articles/10.3389/fddsv.2023.1237655 DOI=10.3389/fddsv.2023.1237655 ISSN=2674-0338 ABSTRACT=
Computer-Aided Drug Design (CADD) approaches, such as those employing quantitative structure-activity relationship (QSAR) methods, are known for their ability to uncover novel data from large databases. These approaches can help alleviate the lack of biological and chemical data, but some predictions do not generate sufficient positive information to be useful for biological screenings. QSAR models are often employed to explain biological data of chemicals and to design new chemicals based on their predictions. In this review, we discuss the importance of data set size with a focus on false hits for QSAR approaches. We assess the challenges and reliability of an initial