- 1Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- 2Institute of Computer Technology, Seoul National University, Seoul, Republic of Korea
- 3AIGENDRUG Co., Ltd., Seoul, Republic of Korea
- 4Interdisciplinary Program in Artificial Intelligence, Seoul National University, Seoul, Republic of Korea
- 5Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
Introduction: Drug-induced liver injury (DILI) has been investigated at the patient level. Analysis of gene perturbation at the cellular level can help better characterize biological mechanisms of hepatotoxicity. Despite accumulating drug-induced transcriptome data such as LINCS, analyzing such transcriptome data upon drug treatment is a challenging task because the perturbation of expression is dose and time dependent. In addition, the mechanisms of drug toxicity are known only as literature information, not in a computable form.
Methods: To address these challenges, we propose a Multi-Dimensional Transcriptomic Ruler (MDTR) that quantifies the degree of DILI at the transcriptome level. To translate transcriptome data to toxicity-related mechanisms, MDTR incorporates KEGG pathways as representatives of mechanisms, mapping transcriptome data to biological pathways and subsequently aggregating them for each of the five hepatotoxicity mechanisms. Given that a single mechanism involves multiple pathways, MDTR measures pathway-level perturbation by constructing a radial basis kernel-based toxicity space and measuring the Mahalanobis distance in the transcriptomic kernel space. Representing each mechanism as a dimension, MDTR is visualized in a radar chart, enabling an effective visual presentation of hepatotoxicity at transcriptomic level.
Results and Discussion: In experiments with the LINCS dataset, we show that MDTR outperforms existing methods for measuring the distance of transcriptome data when describing for dose-dependent drug perturbations. In addition, MDTR shows interpretability at the level of DILI mechanisms in terms of the distance, i.e., in a metric space. Furthermore, we provided a user-friendly and freely accessible website (http://biohealth.snu.ac.kr/software/MDTR), enabling users to easily measure DILI in drug-induced transcriptome data.
1 Introduction
Drug-induced hepatotoxicity (also known as drug-induced liver injury; DILI) is a serious issue for both drug development and patient safety (Kaplowitz, 2004; Wilke et al., 2007). Traditional studies of toxicity have used animal models. This approach is time-consuming and costly, and may not accurately predict human toxicity (Van Norman, 2019; Bracken, 2009). Meanwhile, in vitro bioassays offer a more direct insight into human biology, with lower costs and ethical concerns (Chapman et al., 2013; Van Norman, 2019). Because these bioassays enable high-throughput screening, there is a growing interest to use the bioassays for toxicity signature screening to analyze toxicity at the individual patient cases.
The increasing availability of large-scale chemical libraries and gene expression data has significantly advanced our ability to investigate drug-induced toxicities. Several public resources have been developed to facilitate high-throughput screening and toxicity profiling. Among these, Tox21, a collaborative US federal research program, focuses on developing in vitro assays to screen for potentially toxic chemicals. Tox21 has pioneered the use of medium-to high-throughput panel assays to test thousands of chemicals for potential toxicity (Andersen and Krewski, 2009; Krewski et al., 2010). ToxCast, another program led by the US Environmental Protection Agency, extends the capabilities of Tox21 by offering medium- and high-throughput screening data for a wide range of chemicals (Dix et al., 2007). However, high throughput panel assays cannot measure toxic effect of drug at the transcriptomic level, thus molecular level mechanism of action (MoA) of drug response cannot be analyzed. To overcome the limitation of the panel assays, other resources such as the Genomics of Drug Sensitivity in Cancer (GDSC, Garnett et al. (2012)) and the Cancer Cell Line Encyclopedia (CCLE, Barretina et al. (2012)) provide valuable gene expression data and drug response metrics, such as IC50 and AUC values, across various cancer cell lines. However, both GDSC and CCLE lack the gene expression data that reflect post-drug treatment states. To address these gaps, the Library of Integrated Network-Based Cellular Signatures (LINCS) serves as a comprehensive resource that provides in vitro gene expression data after drug treatment (Subramanian et al., 2017). LINCS allows for the analysis of drug-induced perturbations over time and across different doses in various cell lines.
1.1 Challenges
Understanding how a drug interacts and affects biological systems to induce hepatotoxicity with the drug-induced transcriptome data is a goal of this study. The perturbed biological mechanisms by dysregulated genes can be interpreted as a degree of hepatotoxicity at transcriptomic level. However, despite the abundance of drug response and gene expression data, measuring the degree of drug-induced hepatotoxicity faces three major challenges.
1. Data availability varies greatly depending on drug treatment conditions. For example, the LINCS dataset includes gene expression values for 12,328 genes across an average of 16 samples (with a standard deviation of 30) for varying dose and time point combinations (Supplementary Figure S1). It causes a high-dimensional low-sample issue and hinders accurate interpretation of the MoA of drugs.
2. Identifying toxic patterns or signatures among drug-treated expression data is challenging. In the current state of knowledge, hepatotoxic signatures at transcriptomic level are insufficient (Andrade et al., 2019), and these signatures are dose- and time-dependent, as exemplified by hormesis (Mattson, 2008). Furthermore, biological mechanisms related to hepatotoxicity are available only in the literature. If possible, we need computational method to translate drug-induced transcriptome data to known biological mechanisms related to hepatotoxicity.
3. Defining the boundaries of toxicity is complex. Cell survival is governed by maintaining homeostasis, which is influenced by various conditions such as temperature and oxygen levels (Chovatiya and Medzhitov, 2014). Any disruption beyond the homeostasis boundary can lead to cell death with cytotoxic effects. Consequently, the state of toxicity cannot be confined within a specific boundary between toxic and non-toxic labels.
1.2 Our approach
To address these challenges, we propose MDTR, a knowledge-guided Multi-Dimensional Transcriptomic Ruler for quantifying the degree of drug-induced hepatotoxicity at the transcriptomic level. We first compiled five biological mechanisms of hepatotoxicity from the recent literature (Andrade et al., 2019; Weaver et al., 2020; Han et al., 2020). To translate transcriptome data to the five hepatotoxicity mechanisms, MDTR incorporates KEGG pathways as representative of mechanisms, mapping transcriptome data to these pathways and subsequently aggregating them for each of the five hepatotoxicity mechanisms. This involves three steps: (1) Identifying the most-perturbed transcriptomic samples as outliers using Dual-SVDD. The rationale behind this assumption is that toxic signatures are distinct from all remaining transcriptomic samples. (2) Constructing a transcriptomic embedding space using the Radial Basis Function (RBF) kernel and defining a ruler in the kernel space. (3) Extending the ruler into a five-dimensional radar chart for interpreting hepatotoxicity based on the knowledge-guided biological mechanisms.
As a result, MDTR represents the degree of DILI as the distances that measure the perturbation of the biological mechanisms under the drug treated environment. MDTR distance outperforms existing methods, which measure distances between transcriptomic samples, in reflecting the dose-dependent effects of drug on liver injury. In addition to its quantitative capabilities, MDTR provides interpretive power for understanding the MoA. We also provide a website for calculating and visualizing the five-dimension radar chart using drug-treated transcriptomic data input by scientists.
2 Methods and materials
In this section, we introduce the details of MDTR, which aims to measure the degree of hepatotoxicity from drug-induced transcriptome data. MDTR consists of two steps: (1) Exploration of hepatotoxic signatures through data- and knowledge-driven view. (2) Calculation of the degree of hepatotoxicity by a knowledge-guided multi-dimensional ruler. Figure 1 illustrates the overview of MDTR.
Figure 1. The overview of MDTR. To measure the degree of liver toxicity at the transcriptome level, hepatotoxic signatures were explored through both (A) data- and (B) knowledge-driven views, and (C) a knowledge-guided multi-dimensional ruler MDTR was proposed. (A) Dual-SVDD workflow. For drug-treated transcriptome data of LINCS, Dual-SVDD generates dual-boundaries to identify the most perturbed, i.e., potential toxic samples. (B) Literature search workflow. Five major DILI mechanisms were proposed through literature searches, and the corresponding biological pathways were collected from KEGG. (C) MDTR workflow. Each biological pathway within a mechanism calculates a pathway-level distance in the Radial Basis Function (RBF) kernel space. These pathway-level distances are then aggregated to form the mechanism-level distance, which serves as an axis in the multi-dimensional ruler.
2.1 Exploration of hepatotoxic signatures at transcriptomic level
As mentioned in challenge #2, the signatures that can be utilized to measure drug-induced hepatotoxicity at the cellular level have not yet been clearly defined. To bridge the current gap in available data and knowledge, we explore hepatotoxic signatures through two approaches: (i) data-driven view: identification of potential toxic samples by Dual-SVDD. (ii) knowledge-driven view: literature mining of biological mechanisms related to drug-induced hepatotoxicity.
2.1.1 Data-driven view: identification of potential toxic samples by Dual-SVDD
Upon drug treatment, biological mechanisms are perturbed, leading to changes in gene expression levels. As mentioned in challenge #2, although the toxic signatures may not be clear, samples that are significantly influenced by the toxic drug, either by the treated time or dose, may show significant dysregulation in gene expression. Indeed, the perturbation signature of LINCS is known to be associated with cell viability linked to regulation of transcription factors such as apoptosis and proliferation (Szalai et al., 2019). In particular, toxic compounds can induce cell death signatures, suggesting the potential to predict MoA through drug-specific perturbation profiles (Niepel et al., 2017). Based on this understanding, we assumed that samples containing a toxic signature would show distinct perturbations among the toxic-treated transcriptome data, and we referred to these samples as “potentially toxic (PT)” samples. However, in the process of selecting PT samples, as mentioned in challenge #3, the processes of dysregulation due to toxicity are diverse, making it difficult to define them with a single, closed boundary. Therefore, we designed an approach of defining boundaries from relatively homogeneous effects and detecting PT samples as outliers compared to other drug-induced transcriptome samples.
To identify the PT samples, we propose a novel method called Dual-SVDD, which consists of two boundaries: the drug-effect boundary and the toxic-effect boundary. This method is based on the construction of a one-class boundary method, Support Vector Data Description (SVDD, Tax and Duin (2004)). Formally, given a set of samples
where
The samples with
when
Then, we additionally designed a toxic-effect boundary. Instead of DMSO-treated samples, we utilized a target dataset
Using the
2.1.2 Knowledge-driven view: biological mechanisms related to drug-induced liver injury
Through a comprehensive literature review, we curated commonly discussed biological mechanisms related to hepatotoxicity in existing studies (Andrade et al., 2019; Weaver et al., 2020; Han et al., 2020), and summarized them into five biological mechanisms: Oxidative stress, Immunological response, Altered lipid metabolism, Mitochondrial dysfunction, and Bile acids accumulation (Table 1).
Table 1. The five hepatotoxicity mechanisms used in the MDTR. The table shows the mechanisms of hepatotoxicity categorized into five major groups based on literature search, serving as the axis of MDTR. For each mechanisms, it shows the action in the liver and the number of KEGG pathways belonging to the mechanism.
These biological mechanisms are conceptually associated with hepatotoxicity but their applicability in a computational approach is not known. To address this, we explored biological pathways associated with the mechanisms and utilized the gene sets within those pathways. To identify pathways relevant to these mechanisms, we used the KEGG Pathway Search function, which conducts a keyword search against the KEGG pathway database (Kanehisa, 2002). As the search function performs partial matches on text descriptions or legends for multiple keywords, there is a possibility of false positives in the search results. For example, when “oxidative stress” is searched, the term ‘stress’ may detect other stress-related pathways. Moreover, even if keywords are included in the description, there may be portions that are far from the main function of the pathway. Thus, we manually curated the search results to ensure their relevance to the mechanism. Further details of the selection process for each mechanism and the selected pathway list are provided in Supplementary Methods section and Supplementary Tables S1-S5.
2.2 Calculation of the degree of hepatotoxicity by a knowledge-guided multi-dimensional ruler
Through the exploration of hepatotoxic signatures from both data-driven and knowledge-driven perspectives, we identified PT samples with a high likelihood of harboring drug-induced hepatotoxic signatures, as well as biological mechanisms associated with hepatotoxicity and corresponding pathways. Leveraging this valuable information, we introduced MDTR, quantifies the degree of hepatotoxicity as dysfunctions of biological mechanisms. The MDTR is represented as a five-dimensional radar chart, with each dimension corresponding to a biological mechanism, calculated through mechanism-level toxic distances.
Formally, given a biological mechanism
where
Given a pathway
where
The weight
where
2.3 Website for measuring the degree of toxicity through MDTR
The website (http://biohealth.snu.ac.kr/software/MDTR) is designed to measure the degree of liver toxicity of drugs by analyzing their gene expression data. The website uses MDTR to show the potential liver toxicity of drugs across the five mechanisms of hepatotoxicity.
Additionally, the website leverages the LINCS dataset to access gene expression data from 514 drugs across 70 different cell lines, specifically utilizing L1000 Level 5 data, which are generated by calculating z-scores relative to the controls. This provides a comprehensive assessment of potential hepatotoxicity at the transcriptomic level. The website is implemented using the Django 3.2 framework and Bootstrap 4.6 for the front-end. The interactive radar chart, representing the indicators of liver toxicity, is generated using Chart. js. Each axis of the radar chart represents an indicator of liver toxicity. The website is tested for compatibility and functionality on various web browsers, including Chrome, Microsoft Edge, Firefox, and Safari.
2.4 Preparation of drug hepatotoxicity information and drug-induced transcriptome data
Two databases, Drug Induced Liver Injury Rank (DILIrank, Chen et al. (2016)) and LiverTox (Hoofnagle, 2013), were used to gather information on drug-induced hepatotoxicity. DILIrank categorizes drugs into four classes based on their potential for causing hepatotoxicity, while LiverTox assigns likelihood scores indicating the extent of reported liver injury cases. For this study, drugs categorized as most-DILI concern in DILIrank, and A and B in LiverTox were considered hepatotoxic. Drugs categorized as no-DILI concern in DILIrank, and D and E in LiverTox were considered non-hepatotoxic. A total of 220 hepatotoxic drugs and 402 non-hepatotoxic drugs were used in the analysis.
Drug-treated samples from the LINCS database (accession number GSE92742) were obtained for investigating drug-induced hepatotoxicity at the transcriptomic level. For the gene expression matrix of these samples, we used Level 5 data, which are generated by calculating z-scores relative to the controls. Categorization of these samples into DMSO, non-toxic, and toxic groups was guided by annotations from DILIrank and LiverTox. A total of 20,529 drug-treated samples (6,405 toxic, 11,333 non-toxic and 2,791 DMSO-treated samples) were collected. From the LINCS dataset, among the 10,174 LINCS best inference genes, 4,693 genes related to drug-induced liver injury were selected based on the Comparative Toxicogenomics Database (CTD, Davis et al. (2021)) and enrichment analysis of Gene Ontology (GO) terms. Further description and discussion of drug hepatotoxicity information, transcriptome data collection, and the selection of gene sets are provided in the Supplementary Methods section.
3 Results and discussion
3.1 MDTR: a knowledge-guided representation for liver toxicity at transcriptomic level
Among the 1,896 drug-treated transcriptome samples, Dual-SVDD selected the most perturbed samples, the extreme point of MDTR. As a result, 64 potentially toxic (PT) samples were identified (Supplementary Table S6). We showed the utility of Dual-SVDD through its ability to distinguish toxic samples and the robustness of dual-boundaries (Supplementary Figure S2). In addition, to evaluate the reliability of the proposed measurement, we conducted self-validation by partitioning the total dataset and performing cross-check analyses (Supplementary Figure S3). As a result, our measurement method showed consistent distances regardless of the configuration of the data.
Based on the PT samples and the curated biological mechanisms, MDTR measures the degree of DILI. Figure 2A shows a distribution of maximum distances, representing the density distribution of maximum values on the MDTR of the samples. Notably, significant statistical differences were observed between the sample groups identified by Dual-SVDD. This indicates that the MDTR distance measured from PT samples meaningfully reflects the perturbation of gene expression affected by drug treatments. Among the drug-induced transcriptome data, Figure 2B illustrates examples of radar charts from MDTR. The red-colored radar chart (Figure 2B, right), representing the treated compound labels as structural toxicity as ‘Toxic’, shows greater distances in comparison to the other two examples. Interestingly, even when the chemical structural information is identical and labeled as ‘Non-toxic’, the MDTR results of the two drug-induced transcriptome samples (green vs. blue-colored) show obvious differences. This indicates the importance of analyzing drug hepatotoxicity at the transcriptome level.
Figure 2. The results of MDTR. (A) Maximum distance distribution on the ruler. The density distribution shows the maximum values on the MDTR of the samples divided according to the Dual-SVDD boundary. (B) Examples of multi-dimensional rulers using radar charts for drug-treated transcriptome samples.
We further investigated the capability of MDTR in identifying samples treated with hepatotoxic drugs. We screened top-rankded drugs for liver toxicity, labeled as ‘Most-DILI’ in DILIrank or assigned category ‘A’ in LiverTox, as well as drugs with an absence of reported toxicity, labeled as ‘No-DILI’ in DILIrank or category ‘E’ in LiverTox. Then, we used samples treated with the selected drugs in liver cell lines, including HEPG2, HUH7, and PHH. This resulted in the identification of 10 toxic and 43 non-toxic samples, respectively. When comparing the distances between the two sample groups, we observed larger distances for toxic samples across all mechanisms, with statistically significant differences found in all mechanisms except for Bile acids accumulation (Figure 3). Interestingly, similar results were observed when the experiment was conducted not only in liver cell lines but also in the entire cell lines belonging to the data used (Supplementary Figure S4). Therefore, we showed the effectiveness of the multi-dimensional ruler in stratifying toxic and non-toxic samples at the transcriptomic level.
Figure 3. Distance comparison between toxic and non-toxic samples. The bar plot shows the distances between samples of toxic drugs (red) and samples of drugs with unknown toxicity (blue) in liver cell lines.
3.2 Comparison of dose-dependent liver toxicity
Our research goal is not only to measure the five-dimensional toxic mechanisms in drug-treated transcriptomic samples, but also to facilitate the understanding of toxicity mechanisms using the multi-dimensional ruler in the form of the radar chart. We explored the drug treated environment, especially the sample distance according to the drug dose. For dose-dependence analysis, a total of 198 combinations of 33 drugs with five or more drug dose points in fixed cell line and time point were used. Under the assumption that toxicity increases with increasing drug dose (Schenker et al., 1999; Hoofnagle and Björnsson, 2019), the correlation between dose and distance was calculated, and then evaluated that higher correlation resulted in better results. We compared MDTR distance with three methods: (1) Mathematical distances (Cosine, Euclidean, and Mahalanobis) calculated from the expression values of all genes in LINCS, (2) Transcriptomic Signature Distance (TSD, Manatakis et al. (2020)), and (3) Pathway Activity Score Learning (PASL, Karagiannaki et al. (2023)) with mathematical distances similar to (1) (details in Supplementary Methods).
As shown in Table 2, MDTR outperformed the other distance methods, achieving 70% positive ratio. Additionally, MDTR consistently showed the highest positive correlation ratio even when the number of drug dose points was three or four (Supplementary Table S7). Notably, the Mahalanobis distance yielded better results than Cosine and Euclidean distances. In addition, distances calculated based on biological information (MDTR and PASL) exhibited better performances compared to using the entire genes without the prior knowledge.
Table 2. Performance comparison for dose-distance relationship. The table shows the ratio (in numbers) of samples exhibiting a positive correlation between dose and distance among the 198 samples for MDTR and three comparison methods. Bold values indicate the best results.
The main advantages of MDTR are that it not only provides distances indicating the degree of toxicity depending on the drug-treated environment but also allows interpretation of the mechanisms of liver toxicity. Among drugs that show a positive correlation with MDTR and dose, we conducted case studies for the following three drugs: Doxorubicin, Mitoxantrone, and Rosiglitazone.
Case 1. We investigated Doxorubicin, a chemotherapy medication widely used for the cancer treatments, including breast and bladder cancer, while also known for its hepatotoxicity (Prasanna et al., 2020). In the MCF7 cell line and at the 6-h time point, Doxorubicin exhibited dose-dependent toxicity signature (Figure 4,
Figure 4. The relationship between the distance on the multi-dimensional ruler and the dose. The box plots and radar charts show the change in distance based on the dose of drug treatment under fixed cell line and time point. (A) Doxorubicin at MCF7 cell line and 6-hour, (B) Mitoxantrone at PC3 cell line and 24-hour, and (C) Rosiglitazone at PC3 cell line and 24-hour. Except the zoomed radar chart, all radar charts exhibit a scale ranging from a minimum of 0 to a maximum of 1.
Case 2. We investigated the potential toxicity of Mitoxantrone, a cytotoxic agent and anthracenedione analogue of Doxorubicin (Fox, 2004), which exhibits similar metabolic pathways (Paciucci and Sklarin, 1986; Llesuy and Arnaiz, 1990; Rossato et al., 2014a). Mitoxantrone, despite being classified with a likelihood score of D in the LiverTox database, showed a positive correlation between distance and dose across all combinations of MCF7, PC3, and A549 cell lines and 6 and 24 h time points, similar to Doxorubicin (Supplementary Table S9). For example, Figure 4B illustrates the dose-dependent changes in distance for Mitoxantrone in the PC3 cell line at the 24-hour time point (
In particular, the toxicity of the sample showed a non-monotonic tendency that decreased in the early stage of dose escalation (from 0.1 to 0.5
Furthermore, we performed the MDTR on an unseen drug, Rosiglitazone, a drug that has a lack of medical reports concerning liver toxicity and was not utilized in the generation of dual-boundaries or the calculation of the ruler.
Case 3. We investigated the liver toxicity of Rosiglitazone, a thiazolidinedione drug used for the treatment of type 2 diabetes treatment but discontinued in many countries due to cardiovascular risks (Nissen and Wolski, 2007). While its counterpart Troglitazone has been withdrawn from the market due to severe hepatotoxicity (Lloyd et al., 2002; Kaplowitz, 2005), limited research exists on the hepatotoxic effects of Rosiglitazone. To address this gap, we employed the multi-dimensional ruler to evaluate its toxicity. In the PC3 cell line at the 24-h time point, Figure 4C demonstrates an increase in toxic mechanisms corresponding to higher drug doses. Specifically, significant variations were observed in Bile acids accumulation and Lipid metabolism in response to varying doses. These findings align with known issues associated with Rosiglitazone, such as disturbances in fatty acid and triglyceride metabolism (Gershell, 2005; Tan et al., 2005; Watkins et al., 2002), as well as intrahepatic cholestasis (Zhang et al., 2020).
Through comparative and case study results, we show that MDTR is an interpretable method that can capture both known and potentially mechanisms of toxicity, while exhibiting a dose-dependent distance to hepatotoxicity.
3.3 MoA interpretation: Oxidative stress as a case study
Oxidative stress, one of the axis of MDTR, arises from an imbalance between oxygen-reactive species (ROS) generation and accumulation (Figure 5A, Jaeschke et al. (2002), Jaeschke et al. (2012)). The accumulation of ROS, triggered by factors such as AMP-activated protein kinase (AMPK) activation (Kang et al., 2016; Steinberg and Hardie, 2022) and an imbalance in glutathione (GSH) and glutathione disulfide (GSSG) levels (DeLeve and Kaplowitz, 1991; Yuan and Kaplowitz, 2009), promotes inflammation, disrupts cellular energy regulation, depletes antioxidants, impairs protein handling, and induces hepatocellular apoptosis, ultimately leading to liver damage.
Figure 5. MoA of Oxidative stress (A) Conceptual illustration of liver injury caused by oxidative stress mechanism. (B) Co-expression network according to ROS pathway distance group. (C) GO biological process enrichment analysis according to oxidative stress mechanism distance group. The x-axis is represents -log (Adjusted p-value) and the dotted line represents -log (0.05).
To explore the biological implications of pathway-level distances within the mechanism, we focused on the Chemical carcinogenesis - ROS pathway (hsa05208, hereafter referred to as the ROS pathway), which is one of the six pathways chosen for the oxidative stress mechanism. We divided the samples evenly into four groups (Low, Middle1, Middle2, and High) based on each pathway distance. Then, we performed co-expression network analyses based on the expression values of samples within each group (Figure 5B). As a result, larger distances correspond to large co-expression network sizes, indicating that increased distance is associated with more interactions between genes and, simultaneously, a higher degree of gene dysregulation. For example, during the transition from the Low group to the Middle1 group, downregulation of MAPK8 (also known as JNK) may hinder the expression of the AP-1 transcription factor, potentially impacting the MAPK signaling pathway (Turner et al., 2014). Furthermore, as the network progressed from the Middle1 to Middle2 and High groups, upregulation of genes associated with PI3K signaling (e.g., PIK3CA or PIK3CD) and NADPH oxidase (e.g., NOX4) was observed, indicating ROS production (Koundouros and Poulogiannis, 2018). These findings validate the efficacy of the proposed pathway-level distance in capturing biological activity. Similar results were observed for the remaining pathways (Supplementary Figures S5-S9).
We next calculated the mechanism-level distance of oxidative stress as a weighted sum of the six related pathways, including the ROS pathway. To explore the association between oxidative stress signaling and the mechanism-level distance of samples, we performed GO enrichment analysis using commonly perturbed genes from the respective group. Figure 5C and Supplementary Figure S10 illustrate that as the distance increases, the GO terms related to oxidative stress, such as response to endoplasmic reticulum stress and cellular response to oxidative stress, exhibit more significant and abundant enrichment. These GO enrichment analysis results were consistently observed across all the mechanisms (Supplementary Figures S11-S14). Thus, our findings demonstrate that both pathway- and mechanism-level distance effectively reflect the activity level of the corresponding mechanism as the distance increases.
3.4 A web service for MDTR
Our model is implemented as a freely available website (http://biohealth.snu.ac.kr/software/MDTR). The website allows for our MDTR to be utilized from two perspectives (Supplementary Figure S15): (1) detection of hepatotoxicity from the user’s drug-induced transcriptome data, (2) investigation of hepatotoxicity through drug-induced transcriptome data in the LINCS database by selecting any combinations of drugs, tissues, and cell lines.
4 Conclusion
In this study, we introduced MDTR that quantifies the degree of liver toxicity in terms of five hepatotoxicity mechanisms by analyzing transcriptome data. An important contribution of this study is that it sought to translate transcriptome data to toxicity-related mechanisms between transcriptome data and DILI mechanisms. To understand the DILI mechanisms at the transcriptomic level, MDTR integrated the KEGG pathways, mapped the transcriptome data to the pathway-specific kernel space to measure the distance, and aggregated the pathway-level distance to measure the mechanism-level distance. MDTR explores complex genetic relationships in a non-linear RBF kernel space constructed from biological pathways, while at the same time having the explanatory power of DILI through a five-dimensional radar chart where integrated pathways are represented on one axis.
We showed that MDTR represents dose-dependent liver toxicity compared to existing models that measure distance or similarity of transcriptome data. In addition, through the case studies, we showed the ability of MDTR to interpret dose-dependent DILI not only for drugs with known liver toxicity but also for drugs with no reported liver toxicity. Furthermore, MDTR measures distances for different drug-treated environments, especially for different treatment doses, thereby capturing not only monotonic drug responses but also non-monotonic phenomena such as hormesis. Lastly, we provided a user-friendly and freely accessible website, enabling users to easily measure DILI in drug-induced transcriptome data. Therefore, MDTR serves as both an interpretable and computational method, addressing the limitations of existing studies that relied only on experimental and literature information to measure potential drug toxicity across various treatment environments.
While we have expanded the gene set based on the functional analysis of genes curated from the CTD, future studies may further enhance this gene set by including genes with correlated expression levels within the dataset. Moreover, although the current study involved a manual curation process to identify liver toxicity mechanisms and their associated biological pathways from the literature, we plan to conduct a more comprehensive analysis of liver toxicity by utilizing additional external resources, such as Ingenuity Pathway Analysis (IPA).
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Author contributions
IS: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing–original draft, Writing–review and editing. SaL: Conceptualization, Formal analysis, Investigation, Methodology, Writing–original draft, Writing–review and editing. DB: Conceptualization, Formal analysis, Investigation, Writing–original draft, Writing–review and editing. JY: Investigation, Software, Visualization, Writing–original draft, Writing–review and editing. SuL: Conceptualization, Funding acquisition, Resources, Writing–original draft, Writing–review and editing. SK: Conceptualization, Funding acquisition, Resources, Supervision, Writing–original draft, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This research was supported by a grant (No. DY0002259501) from Ministry of food and Drug Safety, the Bio and Medical Technology Development Program of the National Research Foundation (NRF) funded by the Ministry of Science and ICT (NRF-2022M3E5F3085677), the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (RS-2022-NR067933 and No. NRF-2023R1A2C2006953), Institute of Information and communications Technology Planning and Evaluation (IITP) grant funded by the Korea government (MSIT) [NO. 2021-0-01343, Artificial Intelligence Graduate School Program (Seoul National University)], the ICT at Seoul National University, and AIGENDRUG Co. Ltd.
Conflict of interest
Authors DB, SuL, and SK were employed by AIGENDRUG Co., Ltd.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2024.1398370/full#supplementary-material
References
Andersen, M. E., and Krewski, D. (2009). Toxicity testing in the 21st century: bringing the vision to life. Toxicol. Sci. 107, 324–330. doi:10.1093/toxsci/kfn255
Andrade, R. J., Chalasani, N., Björnsson, E. S., Suzuki, A., Kullak-Ublick, G. A., Watkins, P. B., et al. (2019). Drug-induced liver injury. Nat. Rev. Dis. Prim. 5, 58–22. doi:10.1038/s41572-019-0105-0
Barretina, J., Caponigro, G., Stransky, N., Venkatesan, K., Margolin, A. A., Kim, S., et al. (2012). The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607. doi:10.1038/nature11003
Bracken, M. B. (2009). Why animal studies are often poor predictors of human reactions to exposure. J. R. Soc. Med. 102, 120–122. doi:10.1258/jrsm.2008.08k033
Chapman, K. L., Holzgrefe, H., Black, L. E., Brown, M., Chellman, G., Copeman, C., et al. (2013). Pharmaceutical toxicology: designing studies to reduce animal use, while maximizing human translation. Regul. Toxicol. Pharmacol. 66, 88–103. doi:10.1016/j.yrtph.2013.03.001
Chen, M., Suzuki, A., Thakkar, S., Yu, K., Hu, C., and Tong, W. (2016). Dilirank: the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans. Drug Discov. Today 21, 648–653. doi:10.1016/j.drudis.2016.02.015
Chen, P., Li, D., Chen, Y., Sun, J., Fu, K., Guan, L., et al. (2017). p53-mediated regulation of bile acid disposition attenuates cholic acid-induced cholestasis in mice. Br. J. Pharmacol. 174, 4345–4361. doi:10.1111/bph.14035
Chovatiya, R., and Medzhitov, R. (2014). Stress, inflammation, and defense of homeostasis. Mol. Cell 54, 281–288. doi:10.1016/j.molcel.2014.03.030
Davis, A. P., Grondin, C. J., Johnson, R. J., Sciaky, D., Wiegers, J., Wiegers, T. C., et al. (2021). Comparative toxicogenomics database (ctd): update 2021. Nucleic acids Res. 49, D1138–D1143. doi:10.1093/nar/gkaa891
DeLeve, L. D., and Kaplowitz, N. (1991). Glutathione metabolism and its role in hepatotoxicity. Pharmacol. and Ther. 52, 287–305. doi:10.1016/0163-7258(91)90029-l
Dix, D. J., Houck, K. A., Martin, M. T., Richard, A. M., Setzer, R. W., and Kavlock, R. J. (2007). The toxcast program for prioritizing toxicity testing of environmental chemicals. Toxicol. Sci. 95, 5–12. doi:10.1093/toxsci/kfl103
Fox, E. J. (2004). Mechanism of action of mitoxantrone. Neurology 63, S15–S18. doi:10.1212/wnl.63.12_suppl_6.s15
Garnett, M. J., Edelman, E. J., Heidorn, S. J., Greenman, C. D., Dastur, A., Lau, K. W., et al. (2012). Systematic identification of genomic markers of drug sensitivity in cancer cells. Nature 483, 570–575. doi:10.1038/nature11005
Han, H., Desert, R., Das, S., Song, Z., Athavale, D., Ge, X., et al. (2020). Danger signals in liver injury and restoration of homeostasis. J. hepatology 73, 933–951. doi:10.1016/j.jhep.2020.04.033
Hoofnagle, J. H. (2013). “Livertox: a website on drug-induced liver injury,” in Drug-induced liver disease (Elsevier), 725–732.
Hoofnagle, J. H., and Björnsson, E. S. (2019). Drug-induced liver injury—types and phenotypes. N. Engl. J. Med. 381, 264–273. doi:10.1056/NEJMra1816149
Jaeschke, H., Gores, G. J., Cederbaum, A. I., Hinson, J. A., Pessayre, D., and Lemasters, J. J. (2002). Mechanisms of hepatotoxicity. Toxicol. Sci. 65, 166–176. doi:10.1093/toxsci/65.2.166
Jaeschke, H., McGill, M. R., and Ramachandran, A. (2012). Oxidant stress, mitochondria, and cell death mechanisms in drug-induced liver injury: lessons learned from acetaminophen hepatotoxicity. Drug metab. Rev. 44, 88–106. doi:10.3109/03602532.2011.602688
Kanehisa, M. (2002). “The kegg database,” in In Silico’Simulation of biological processes: novartis foundation symposium. Editors G. R. Bock, and J. A. Goode (Wiley Online Library), 91–103.
Kang, S. W. S., Haydar, G., Taniane, C., Farrell, G., Arias, I. M., Lippincott-Schwartz, J., et al. (2016). Ampk activation prevents and reverses drug-induced mitochondrial and hepatocyte injury by promoting mitochondrial fusion and function. PloS one 11, e0165638. doi:10.1371/journal.pone.0165638
Kaplowitz, N. (2005). Idiosyncratic drug hepatotoxicity. Nat. Rev. Drug Discov. 4, 489–499. doi:10.1038/nrd1750
Karagiannaki, I., Gourlia, K., Lagani, V., Pantazis, Y., and Tsamardinos, I. (2023). Learning biologically-interpretable latent representations for gene expression data: pathway activity score learning algorithm. Mach. Learn. 112, 4257–4287. doi:10.1007/s10994-022-06158-z
Kharasch, E. D., and Novak, R. (1985). Mitoxantrone and ametantrone inhibit hydroperoxide-dependent initiation and propagation reactions in fatty acid peroxidation. J. Biol. Chem. 260, 10645–10652. doi:10.1016/s0021-9258(19)85133-5
Koundouros, N., and Poulogiannis, G. (2018). Phosphoinositide 3-kinase/akt signaling and redox metabolism in cancer. Front. Oncol. 8, 160. doi:10.3389/fonc.2018.00160
Krewski, D., Acosta, D., Andersen, M., Anderson, H., Bailar, J. C., Boekelheide, K., et al. (2010). Toxicity testing in the 21st century: a vision and a strategy. J. Toxicol. Environ. Health, Part B 13, 51–138. doi:10.1080/10937404.2010.483176
Llesuy, S. F., and Arnaiz, S. L. (1990). Hepatotoxicity of mitoxantrone and doxorubicin. Toxicology 63, 187–198. doi:10.1016/0300-483x(90)90042-f
Lloyd, S., Hayden, M. J., Sakai, Y., Fackett, A., Silber, P. M., Hewitt, N. J., et al. (2002). Differential in vitro hepatotoxicity of troglitazone and rosiglitazone among cryopreserved human hepatocytes from 37 donors. Chemico-biological Interact. 142, 57–71. doi:10.1016/s0009-2797(02)00054-6
Manatakis, D. V., VanDevender, A., and Manolakos, E. S. (2020). An information-theoretic approach for measuring the distance of organ tissue samples using their transcriptomic signatures. Bioinformatics 36, 5194–5204. doi:10.1093/bioinformatics/btaa654
Niepel, M., Hafner, M., Duan, Q., Wang, Z., Paull, E. O., Chung, M., et al. (2017). Common and cell-type specific responses to anti-cancer drugs revealed by high throughput transcript profiling. Nat. Commun. 8, 1186. doi:10.1038/s41467-017-01383-w
Nissen, S. E., and Wolski, K. (2007). Effect of rosiglitazone on the risk of myocardial infarction and death from cardiovascular causes. N. Engl. J. Med. 356, 2457–2471. doi:10.1056/NEJMoa072761
Paciucci, P. A., and Sklarin, N. T. (1986). Mitoxantrone and hepatic toxicity. Ann. Intern. Med. 105, 805–806. doi:10.7326/0003-4819-105-5-805_3
Prasanna, P. L., Renu, K., and Gopalakrishnan, A. V. (2020). New molecular and biochemical insights of doxorubicin-induced hepatotoxicity. Life Sci. 250, 117599. doi:10.1016/j.lfs.2020.117599
Rossato, L. G., Costa, V. M., Dallegrave, E., Arbo, M., Dinis-Oliveira, R. J., Santos-Silva, A., et al. (2014a). Cumulative mitoxantrone-induced haematological and hepatic adverse effects in a subchronic in vivo study. Basic and Clin. Pharmacol. and Toxicol. 114, 254–262. doi:10.1111/bcpt.12143
Rossato, L. G., Costa, V. M., Dallegrave, E., Arbo, M., Silva, R., Ferreira, R., et al. (2014b). Mitochondrial cumulative damage induced by mitoxantrone: late onset cardiac energetic impairment. Cardiovasc. Toxicol. 14, 30–40. doi:10.1007/s12012-013-9230-2
Schenker, S., Martin, R. R., and Hoyumpa, A. M. (1999). Antecedent liver disease and drug toxicity. J. hepatology 31, 1088–1097. doi:10.1016/S0168-8278(99)80325-0
Steinberg, G. R., and Hardie, D. G. (2022). New insights into activation and function of the ampk. Nat. Rev. Mol. Cell Biol. 24, 255–272. doi:10.1038/s41580-022-00547-x
Subramanian, A., Narayan, R., Corsello, S. M., Peck, D. D., Natoli, T. E., Lu, X., et al. (2017). A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell 171, 1437–1452. doi:10.1016/j.cell.2017.10.049
Szalai, B., Subramanian, V., Holland, C. H., Alföldi, R., Puskás, L. G., and Saez-Rodriguez, J. (2019). Signatures of cell death and proliferation in perturbation transcriptomics data—from confounding factor to effective prediction. Nucleic Acids Res. 47, 10010–10026. doi:10.1093/nar/gkz805
Tan, G., Fielding, B., Currie, J., Humphreys, S., Desage, M., Frayn, K., et al. (2005). The effects of rosiglitazone on fatty acid and triglyceride metabolism in type 2 diabetes. Diabetologia 48, 83–95. doi:10.1007/s00125-004-1619-9
Tang, S., Li, S., Tang, B., Wang, X., Xiao, Y., and Cheke, R. A. (2023). Hormetic and synergistic effects of cancer treatments revealed by modelling combinations of radio-or chemotherapy with immunotherapy. BMC cancer 23, 1040. doi:10.1186/s12885-023-11542-6
Tax, D. M., and Duin, R. P. (2004). Support vector data description. Mach. Learn. 54, 45–66. doi:10.1023/b:mach.0000008084.60811.49
Turner, M. D., Nedjai, B., Hurst, T., and Pennington, D. J. (2014). Cytokines and chemokines: at the crossroads of cell signalling and inflammatory disease. Biochimica Biophysica Acta (BBA)-Molecular Cell Res. 1843, 2563–2582. doi:10.1016/j.bbamcr.2014.05.014
Van Norman, G. A. (2019). Limitations of animal studies for predicting toxicity in clinical trials: is it time to rethink our current approach? JACC Basic Transl. Sci. 4, 845–854. doi:10.1016/j.jacbts.2019.10.008
Wallace, K. B., Sardão, V. A., and Oliveira, P. J. (2020). Mitochondrial determinants of doxorubicin-induced cardiomyopathy. Circulation Res. 126, 926–941. doi:10.1161/CIRCRESAHA.119.314681
Watkins, S. M., Reifsnyder, P. R., Pan, H. J., German, J. B., and Leiter, E. H. (2002). Lipid metabolome-wide effects of the PPARgamma agonist rosiglitazone. J. lipid Res. 43, 1809–1817. doi:10.1194/jlr.m200169-jlr200
Weaver, R. J., Blomme, E. A., Chadwick, A. E., Copple, I. M., Gerets, H. H., Goldring, C. E., et al. (2020). Managing the challenge of drug-induced liver injury: a roadmap for the development and deployment of preclinical predictive models. Nat. Rev. Drug Discov. 19, 131–148. doi:10.1038/s41573-019-0048-x
Wilke, R. A., Lin, D. W., Roden, D. M., Watkins, P. B., Flockhart, D., Zineh, I., et al. (2007). Identifying genetic risk factors for serious adverse drug reactions: current progress and challenges. Nat. Rev. Drug Discov. 6, 904–916. doi:10.1038/nrd2423
Yuan, L., and Kaplowitz, N. (2009). Glutathione in liver diseases and hepatotoxicity. Mol. aspects Med. 30, 29–41. doi:10.1016/j.mam.2008.08.003
Keywords: drug-induced liver injury, one-class boundary, kernel distance, transcriptomic signature, degree of toxicity, liver toxicity
Citation: Sung I, Lee S, Bang D, Yi J, Lee S and Kim S (2025) MDTR: a knowledge-guided interpretable representation for quantifying liver toxicity at transcriptomic level. Front. Pharmacol. 15:1398370. doi: 10.3389/fphar.2024.1398370
Received: 09 March 2024; Accepted: 27 December 2024;
Published: 24 January 2025.
Edited by:
Ursula Gundert-Remy, Charité University Medicine Berlin, GermanyReviewed by:
Sudin Bhattacharya, Michigan State University, United StatesFei Chen, Chinese Academy of Sciences and China National Center for Bioinformation, China
Copyright © 2025 Sung, Lee, Bang, Yi, Lee and Kim. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sun Kim, c3Vua2ltLmJpb2luZm9Ac251LmFjLmty
†Present address: Inyoung Sung, BK21 FOUR Intelligence Computing, Seoul National University, Seoul, South Korea
Sangseon Lee, Department of Artificial Intelligence, Inha University, Incheon, South Korea
‡These authors have contributed equally to this work