AUTHOR=Costa Eduardo , Johnson Kamin J. , Walker Carl A. , O’Brien Jason M. TITLE=Transcriptomic point of departure determination: a comparison of distribution-based and gene set-based approaches JOURNAL=Frontiers in Genetics VOLUME=15 YEAR=2024 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2024.1374791 DOI=10.3389/fgene.2024.1374791 ISSN=1664-8021 ABSTRACT=

A key step in assessing the potential human and environmental health risks of industrial and agricultural chemicals is to determine the toxicity point of departure (POD), which is the highest dose level that causes no adverse effect. Transcriptomic POD (tPOD) values have been suggested to accurately estimate toxicity POD values. One step in the most common approach for tPOD determination involves mapping genes to annotated gene sets, a process that might lead to substantial information loss particularly in species with poor gene annotation. Alternatively, methods that calculate tPOD values directly from the distribution of individual gene POD values omit this mapping step. Using rat transcriptome data for 79 molecules obtained from Open TG-GATEs (Toxicogenomics Project Genomics Assisted Toxicity Evaluation System), the hypothesis was tested that methods based on the distribution of all individual gene POD values will give a similar tPOD value to that obtained via the gene set-based method. Gene set-based tPOD values using four different gene set structures were compared to tPOD values from five different individual gene distribution methods. Results revealed a high tPOD concordance for all methods tested, especially for molecules with at least 300 dose-responsive probesets: for 90% of those molecules, the tPOD values from all methods were within 4-fold of each other. In addition, random gene sets based upon the structure of biological knowledge-derived gene sets produced tPOD values with a median absolute fold change of 1.3–1.4 when compared to the original biological knowledge-derived gene set counterparts, suggesting that little biological information is used in the gene set-based tPOD generation approach. These findings indicate using individual gene distributions to calculate a tPOD is a viable and parsimonious alternative to using gene sets. Importantly, individual gene distribution-based tPOD methods do not require knowledge of biological organization and can be applied to any species including those with poorly annotated gene sets.