AUTHOR=Idakwo Gabriel , Thangapandian Sundar , Luttrell Joseph , Zhou Zhaoxian , Zhang Chaoyang , Gong Ping 

TITLE=Deep Learning-Based Structure-Activity Relationship Modeling for Multi-Category Toxicity Classification: A Case Study of 10K Tox21 Chemicals With High-Throughput Cell-Based Androgen Receptor Bioassay Data

JOURNAL=Frontiers in Physiology

VOLUME=Volume 10 - 2019

YEAR=2019

URL=https://www.frontiersin.org/journals/physiology/articles/10.3389/fphys.2019.01044

DOI=10.3389/fphys.2019.01044

ISSN=1664-042X

ABSTRACT=<p>Deep learning (DL) has attracted the attention of computational toxicologists as it offers a potentially greater power for <italic>in silico</italic> predictive toxicology than existing shallow learning algorithms. However, contradicting reports have been documented. To further explore the advantages of DL over shallow learning, we conducted this case study using two cell-based androgen receptor (AR) activity datasets with 10K chemicals generated from the Tox21 program. A nested double-loop cross-validation approach was adopted along with a stratified sampling strategy for partitioning chemicals of multiple AR activity classes (i.e., agonist, antagonist, inactive, and inconclusive) at the same distribution rates amongst the training, validation and test subsets. Deep neural networks (DNN) and random forest (RF), representing deep and shallow learning algorithms, respectively, were chosen to carry out structure-activity relationship-based chemical toxicity prediction. Results suggest that DNN significantly outperformed RF (<italic>p</italic> &lt; 0.001, ANOVA) by 22–27% for four metrics (precision, recall, F-measure, and AUPRC) and by 11% for another (AUROC). Further in-depth analyses of chemical scaffolding shed insights on structural alerts for AR agonists/antagonists and inactive/inconclusive compounds, which may aid in future drug discovery and improvement of toxicity prediction modeling.</p>