XMR: an explainable multimodal neural network for drug response prediction

Wang, Zihao; Zhou, Yun; Zhang, Yu; Mo, Yu K.; Wang, Yijie

doi:10.3389/fbinf.2023.1164482

ORIGINAL RESEARCH article

Front. Bioinform., 02 August 2023

Sec. Computational BioImaging

Volume 3 - 2023 | https://doi.org/10.3389/fbinf.2023.1164482

XMR: an explainable multimodal neural network for drug response prediction

Zihao Wang¹^†

Yun Zhou²^†

Yu Zhang³^†

Yu K. Mo¹^†

Yijie Wang¹*

¹Department of Computer Science, Indiana University Bloomington, Bloomington, IN, United States
²Department of Environmental and Occupational Health, School of Public Health, Indiana University Bloomington, Bloomington, IN, United States
³Department of Epidemiology and Biostatistics, School of Public Health, Indiana University Bloomington, Bloomington, IN, United States

Introduction: Existing large-scale preclinical cancer drug response databases provide us with a great opportunity to identify and predict potentially effective drugs to combat cancers. Deep learning models built on these databases have been developed and applied to tackle the cancer drug-response prediction task. Their prediction has been demonstrated to significantly outperform traditional machine learning methods. However, due to the “black box” characteristic, biologically faithful explanations are hardly derived from these deep learning models. Interpretable deep learning models that rely on visible neural networks (VNNs) have been proposed to provide biological justification for the predicted outcomes. However, their performance does not meet the expectation to be applied in clinical practice.

Methods: In this paper, we develop an XMR model, an eXplainable Multimodal neural network for drug Response prediction. XMR is a new compact multimodal neural network consisting of two sub-networks: a visible neural network for learning genomic features and a graph neural network (GNN) for learning drugs’ structural features. Both sub-networks are integrated into a multimodal fusion layer to model the drug response for the given gene mutations and the drug’s molecular structures. Furthermore, a pruning approach is applied to provide better interpretations of the XMR model. We use five pathway hierarchies (cell cycle, DNA repair, diseases, signal transduction, and metabolism), which are obtained from the Reactome Pathway Database, as the architecture of VNN for our XMR model to predict drug responses of triple negative breast cancer.

Results: We find that our model outperforms other state-of-the-art interpretable deep learning models in terms of predictive performance. In addition, our model can provide biological insights into explaining drug responses for triple-negative breast cancer.

Discussion: Overall, combining both VNN and GNN in a multimodal fusion layer, XMR captures key genomic and molecular features and offers reasonable interpretability in biology, thereby better predicting drug responses in cancer patients. Our model would also benefit personalized cancer therapy in the future.

1 Introduction

Precision medicine is a key challenge in this century, with a focus on personalized cancer treatments. Precision medicine aims to design treatments specific to a patient’s molecular profile, improving outcomes. This relies on effectively using clinical, genomics, and other “omics” data to identify prognostic and predictive biomarkers. Another important task for precision oncology is to generate drug response profiles across drugs and cancer subtypes. Large-scale drug screening initiatives (Barretina et al., 2012; Yang et al., 2012; Basu et al., 2013; Seashore-Ludlow et al., 2015) have made data publicly available, enabling the identification of biomarkers and the development of predictive models like elastic net and random forest (Iorio et al., 2016). However, the task of predicting drug response is complex due to the genetic heterogeneity among cancer patients, which presents a major obstacle in determining therapeutic efficacy (Bedard et al., 2013; Dagogo-Jack and Shaw, 2018; Fittall and Van Loo, 2019; Lim and Ma, 2019; Ramón y Cajal et al., 2020). Despite advances in the field, there is still a need for further improvement in the accuracy and reliability of drug response models. Deep learning (DL) is well suited for drug response prediction, as it can handle large amounts of high-dimensional data and capture non-linear relationships in biological data better than other machine learning algorithms. DL has been successful in a variety of drug discovery tasks and may outperform traditional machine learning approaches in drug response prediction (Yuan et al., 2016; Luo et al., 2019; Sun et al., 2019; Jiao et al., 2020) despite being underexplored until recently.

A challenge in drug response prediction is to accurately represent both the genotype and chemical structures of drugs. However, most studies have focused on enhancing genotype representation while neglecting the chemical side, resulting in models with strong genotypic embedders and weak chemical embedders (Kuenzi et al., 2020; Huang X. et al., 2021). However, this imbalance can negatively impact performance as the chemical structure of drugs contains valuable information that requires a stronger embedder, while the genotypic information is prone to overfitting and, thus, needs a lighter architecture for better generalizability. To address this issue, in this paper, we develop an XMR model, an eXplainable Multimodal neural network for drug Response prediction. Our approach emphasizes the importance of having a powerful chemical embedder while keeping the genotypic embedder relatively lightweight. To achieve this, XMR is structured as a multimodal neural network with two sub-networks: a visible neural network (VNN) for capturing genomic features and a graph neural network (GNN) for learning the structural features of drugs. To enhance the generalizability of genotypic embedding, the VNN is further pruned to form a more compact structure.

In this study, we used XMR to construct cancer-specific models for triple-negative breast cancer (TNBC) to gain a deeper insight into its biological mechanisms. The XMR models were built based on five key pathways (cell cycle, DNA repair, diseases, signaling transduction, and metabolism). These models were trained on TNBC-specific data obtained from the Cancer Therapeutics Response Portal (CTRP) v2 (Seashore-Ludlow et al., 2015) and the Genomics of Drug Sensitivity in Cancer (GDSC) database (Yang et al., 2012). To demonstrate the effectiveness of XMR’s models, we compared their predictive accuracy to several state-of-the-art methods using validation samples. Our results showed that XMR outperforms these methods with significantly higher test accuracy. In addition, we evaluated the explainability of XMR. We found that our model was able to capture the commonly mutated TNBC-related genes, several critical pathways (e.g., G2/M checkpoint, PI3K/mTOR, and MAPK pathways), and novel drugs that would provide insights into TNBC treatment (e.g., dinaciclib, panobinostat, BI 2536, and AZD7762).

2 Materials and methods

2.1 Taxonomy of multimodal models

In this paper, we formulate the drug response prediction task as a multimodal learning task, utilizing two forms of information: the genotype represented by binary mutations and the chemical structure of drugs. The current research focusing on multimodal models is primarily developing models that can effectively combine and process information from multiple modalities such as audio, text, images, and video. We believe that the insights and results from developing multimodal models for vision and language tasks can be effectively applied to the task of predicting drug response.

A taxonomy of multimodal models is proposed based on two factors: 1) the task, which can be vision and language tasks, or the drug response prediction task, and 2) the expressiveness level of the two modalities in terms of dedicated parameters or computation. This results in four archetypes as shown in Figure 1.

FIGURE 1

FIGURE 1. Six categories of multimodal models, with the height of each rectangle indicating its comparative computational size. (A) Twin tower model in vision-and-language domain. (B) Shallow model in vision-and-language domain. (C) Vision-and-language model with heavy textual embedder. (D) Model with heavy genotype embedder. (E) Shallow model in drug response prediction domain. (F) Model with heavy chemical embedder.

The top three archetypes are vision-and-language models. CLIP (Radford et al., 2021) is a typical twin tower model, as shown in Figure 1A, as it employs separate but similarly expensive embedders for each modality. Despite CLIP’s remarkable zero-shot performance in image-to-text retrieval, its performance was not as strong as other vision-and-language downstream tasks. ViLT (Kim et al., 2021) is a (Figures 1A, B) shallower and computationally lighter model with shallow embedding layers for raw pixels and text tokens. Most computations of ViLT focus on modeling modality interactions. This simple architecture provides faster inference time, but it has a slow training process due to its light visual embedder. Its performance is also limited in many tasks. Most state-of-the-art models (Lu et al., 2019; Chen et al., 2020; Li et al., 2021) belong to the archetype shown in Figure 1C, with a visual embedder much heavier than the textual embedder. This type of model generally achieves the highest performance in various vision-and-language tasks. This demonstrates that most vision-and-language tasks necessitate a powerful visual feature extractor, i.e., a heavier visual embedder, with the textual embedder being relatively lightweight. Intuitively, in the field of drug response prediction, the features of drugs can be compared to visual features as the chemical structure holds rich information, similar to images. On the other hand, genotypic features can be related to textual features as they are both binary and discrete. Hence, our hypothesis is that a successful model for drug response prediction should have a heavy chemical embedder with a relatively light genotypic embedder.

The bottom three archetypes are for drug response prediction. DrugCell (Kuenzi et al., 2020) falls under the archetype shown in Figure 1D and utilizes a deep visible neural network to extract features from genotypes and a simple multilayer perceptron (MLP) to extract features from the Morgan fingerprint of drugs. The VNN maps the neurons of a deep neural network into potential molecular components and pathways in a biological structure, which is commonly used in various cancer studies (Ma et al., 2018; Wang et al., 2018; Kuenzi et al., 2020; Elmarakeby et al., 2021). However, it is often deep and substantial due to its utilization of biological networks. ParsVNN (Huang X. et al., 2021), which falls under the archetype shown in Figure 1E, improves upon DrugCell by using a sparse learning approach to learn a simplified VNN that only contains biological architectures most relevant to the prediction task. This results in ParsVNN having a better performance than DrugCell, which confirms our hypothesis of having a lighter genotypic embedder in drug response prediction models. Our proposed XMR, belonging to the archetype shown in Figure 1F, is the first model of its kind. It follows our hypothesis that a successful model should have a lightweight genotypic embedder and a relatively heavy chemical embedder. To implement this, XMR uses a deep graph neural network to extract more complex information from the chemical structure while following a design similar to ParsVNN for the genotypic embedder.

2.2 Model architecture

The model is structured as a multimodal neural network with two sub-networks: a visible neural network to capture genomic features and a graph neural network to learn the structural features of drugs (as illustrated in Figure 2). We followed the method described by Kuenzi et al. (2020) to build VNN embedding. Briefly, the VNN model establishes a connection between gene-level data and their associated phenotypic response in a cell. The VNN architecture resembles the hierarchical structure of cellular molecular subsystems, where artificial neurons represent molecular events and edges represent the connectivity among a series of related molecular events. The hierarchical structure of the VNN was created using pathways related to the cell cycle, DNA repair, diseases, signal transduction, and metabolism, respectively, as documented in the Reactome database (Fabregat et al., 2018). Each term in the pathway is represented by a hidden layer, and the hidden layers are interconnected precisely according to the molecular subsystems.

FIGURE 2

FIGURE 2. Overview of the proposed XMR architecture. A combination of genotypic and chemical embeddings, produced by a VNN (left) and a GNN (right), is concatenated and fed into a MLP layer for drug response prediction. The VNN architecture is represented by black arrows linking genes and yellow arrows representing molecular subsystems.

In more detail, the embedding for each term ν_i is composed of gene neurons, $ν_{i}^{gene} = {ν_{1}^{gene}, \dots, ν_{p}^{gene}}$ , which take the genes directly connected to this term as an input, and subsystem neurons, $ν_{i}^{sub} = {ν_{1}^{sub}, \dots, ν_{q}^{sub}}$ , which take the outputs of its child terms as an input. That is, $ν_{i} = [ν_{i}^{gene} : ν_{i}^{sub}]$ . Then, the final genotypic embedding is represented by the embedding of the root term, i.e., y_genotype = ν_root.

To learn chemical embedding, we used the GNN, which regards each atom of a compound as a node. The atoms can exchange information through their chemical bonds. The fundamental concept of the GNN is to iteratively gather information from the neighbors of each node (i.e., atom) so that each individual atom is aware of the molecular substructures surrounding it. To address the challenges of limited learning parameters and ineffective embedding learning due to the insufficient number of atom and bond types in the molecule, we followed the method described in Costa and De Grave (2010), which embeds compounds using r radius subgraphs, which are induced by neighboring vertices and edges within a radius of r from a vertex. In detail, a graph is represented as G = (V, E), where V is the set of vertices and E is the set of edges. In a molecule, v_i ∈ V represents the ith atom, and e_ij ∈ E represents the chemical bond between the ith and jth atoms. Given a graph G = (V, E), we represent a set of all neighboring vertex indices within a radius of r from the ith vertex as $v_{i}^{r}$ . Then, the r-radius subgraph for vertex v_i is defined as $G_{sub} (v_{i}, r) = (v_{i}^{r}, e_{i}^{r})$ , where $e_{i}^{r} = {e_{m n} \in E ∣ (m, n) \in v_{i}^{r} \times v_{i}^{r - 1}}$ . Additionally, the r-radius subgraph for edge e_ij is defined as $G_{sub} (e_{i j}, r) = (v_{i}^{r - 1} \cup v_{j}^{r - 1}, e_{i}^{r} \cap e_{j}^{r})$ . Each subgraph for the r-radius vertex and r-radius edge is then represented by a unique hidden vector.

Then, we describe the transition function for updating both the vertex and edge embeddings. Given a graph G and the initial embeddings of its vertices and edges, we represent the embedding of the ith vertex at time step t as $v_{i}^{(t)}$ . This embedding is updated using the following transition function:

v_{i}^{(t + 1)} = S i g m o i d (v_{i}^{(t)} + Σ_{j \in v_{i}^{r}} h_{i j}^{(t)}), (1)

where the sigmoid function is defined as $S i g m o i d (x) = \frac{1}{1 + e^{- x}}$ , $v_{i}^{r}$ is the set of indices of neighboring vertices of i, and $h_{i j}^{(t)}$ is the hidden neighborhood vector. This hidden vector is calculated by considering the neighboring vertex v_j and edge e_ij:

h_{i j}^{(t)} = R e L U (W [\begin{matrix} v_{j}^{(t)} \\ e_{i j}^{(t)} \end{matrix}] + b), (2)

where the ReLU function is defined as ReLU(x) = max(0, x), W is a weight matrix, b is a bias vector, and $e_{i j}^{(t)}$ is the edge embedding between the ith and jth vertices at time step t. By adding up the neighboring hidden vectors and iterating over time steps, the vertex embeddings can gradually accumulate more global information about the graph.

The procedure for updating edge embeddings is similar. Specifically, the edge embedding between the ith and jth vertices at time step t, $e_{i j}^{(t)}$ , is updated as follows:

e_{i j}^{(t + 1)} = S i g m o i d (e_{i j}^{(t)} + R e L U (W (v_{i}^{(t)} + v_{j}^{(t)}) + b)) . (3)

Thus, the final chemical embedding is obtained by taking the average of the vertex vectors obtained through the transition function, given the set $V = {v_{1}^{(t)}, v_{2}^{(t)}, \dots, v_{n}^{(t)}}$ , where n is the number of vertices in the molecular graph:

y_{drug} = \frac{1}{n} Σ_{i = 1}^{n} v_{i}^{(t)} . (4)

We then combine y_genotype and y_drug into a single vector, [y_genotype: y_drug], and input it into a multilayer perceptron to make the final prediction of the drug response.

To create a more compact genotypic embedder, we follow the approach described by Huang X. et al. (2021). This method aims to simplify the VNN architecture while retaining its ability to make accurate predictions. It is based on the idea that biological processes are complex and involve many components and that sparse coding can capture the most significant components that are more directly relevant to drug administration and treatment, compared to considering all potential processes. Starting from a VNN model, we treat each edge weight as a feature of the VNN and perform sparse learning to improve prediction accuracy and select important features. This helps eliminate redundant features and improve the explainability of the downstream analysis. To accomplish this, we utilize ℓ₀ norm regularization to prune edges between genes and subsystems and group LASSO regularization to remove edges between subsystems. The optimization problem is solved using the proximal alternating linearized minimization (PALM) algorithm (Bolte et al., 2014).

2.3 Explainability in XMR

When evaluating deep learning models, it is crucial to consider not only their prediction performance but also their ability to provide explanations. Explanations can come in two forms: global and local (Du et al., 2020). Global explanations offer a comprehensive understanding of how the model operates by examining its structure and parameters. Local explanations, on the other hand, focus on explaining why a specific prediction was made by the model by analyzing the causal relationship between the input and the prediction. Both types of explanations serve important purposes. Global explanations increase the transparency of deep learning models, while local explanations build trust in individual predictions. The XMR model focuses on global explainability, as it filters out the important pathways and genes that contribute the most to the prediction task for each cancer type and biological network. This provides insights into how XMR operates and offers guidance for building models for specific cancer types. Additionally, the model’s ability to predict drug response can be used to identify new drugs that may have a significant impact on a particular cancer type. We delve into the explainability provided by XMR in a later section. It is important to note that the explainability of the XMR model is not solely dependent on the VNN architecture but also on the overall architecture, since the model is trained end-to-end. The quality of the explainability is directly proportional to the model’s performance. The better the model, the more meaningful the guidance it can provide.

2.4 Dataset and splitting

We obtained the drug response data from GDSC (Yang et al., 2012) and the CTRP (Seashore-Ludlow et al., 2015). TNBC cell lines were selected according to the cell lines listed in Chavez et al. (2010) and Dai et al. (2017). A total of 22 TNBC cell lines were selected: BT20, BT549, CAL120, CAL148, CAL51, CAL851, DU4475, HCC1143, HCC1187, HCC1395, HCC1599, HCC1806, HCC1937, HCC2157, HCC38, HCC70, HDQP1, MDAMB157, MDAMB231, MDAMB436, MDAMB468, and MFM223. Those cell lines covered all the TNBC subtypes as described in Lehmann et al. (2011), including two basal-like (BL1 and BL2), an immunomodulatory (IM), a mesenchymal (M), a mesenchymal stem-like (MSL), and a luminal androgen receptor (LAR). The mutation status was collected from the DepMap portal (DepMap, 2022). A gene was selected if at least one of the chosen cell lines had a mutation on it. A total of 6,982 genes were identified. The mutation status of the gene was recorded as a binary variable and was either “1” for mutated or “0” for non-mutated. This procedure yielded 4,851 (cell line and drug) pairs for the final data, including 22 cell lines and 279 drugs. It was split into training and validation sets in an 8:2 proportion, resulting in 3,880 training samples and 971 validation samples. A separate test set was formed using all the (cell line and drug) pairs that were not present in the training and validation sets.

2.5 Construction of TNBC-specific XMR models

The VNN architecture in XMR was built based on five key biological networks (cell cycle, DNA repair, diseases, signaling transduction, and metabolism). Each term in the biological networks used in VNNs is comprised of a hidden layer with three neurons, while in the GNN, subgraphs with a radius of 2 are formed and represented by a hidden layer with 256 neurons. The drug response is measured using the area under the dose–response curve (AUC), where a lower AUC value indicates a more effective drug response, and normalization is carried out such that AUC = 0 represents complete cell death and AUC = 1 represents no effect. The prediction accuracy of XMR was evaluated using Spearman’s correlation between predicted and observed AUC values. The model was trained for 300 epochs with a batch size of 200, and a mean squared error loss was used, with an AdamW optimizer, with an initial learning rate of 0.005 and weight decay of 10^–5. The XMR model was implemented using the PyTorch library and trained on a GPU server with an NVIDIA Tesla V100 32 GB GPU and an Intel Xeon Gold 6248 CPU.

3 Results

3.1 Heavyweight chemical embedding is critical to model performance

Following the “Construction of TNBC-specific XMR models” section, we constructed five TNBC-specific XMR models and compared them with two state-of-the-art approaches: DrugCell (Kuenzi et al., 2020) and ParsVNN (Huang X. et al., 2021). The architecture of DrugCell is depicted in Figure 1D, and it utilizes the VNN architecture as the genotypic embedder and a MLP as the chemical embedder. ParsVNN was built based on DrugCell and made the VNN in DrugCell more compact via pruning the VNN architecture, resulting in the architecture shown in Figure 1E.

First, we observed that the TNBC-specific XMR models learned were highly compact, as illustrated in Figure 3. For instance, the XMR model constructed with the signal transduction biological network only had 13 terms and 121 genes remaining, which showed that approximately 96% of the terms and 88% of the genes were removed from the original VNN architecture. All the other TNBC-specific XMR models also had a limited number of terms and genes. This substantial reduction in the complexity of the genotypic embedder could enhance the generalizability of the VNN embedder.

FIGURE 3

FIGURE 3. Genotypic pathways after pruning. Each color represents a distinct hierarchy. The numbers near the root term describe the number of terms and genes left. (A) the cell cycle pathway. (B) DNA repair pathway. (C) Pathways for Growth factor receptor- and metabolism-mediated diseases. (D) Pathways for signaling transduction. (E) Pathways for metabolism.

We also compared the performance of the TNBC-specific XMR models with the rival methods in terms of accuracy on the validation set (Figure 4). The results showed that XMR outperformed the other two methods with a minimum advantage of 2.3%. Furthermore, we used the method proposed by Diedenhofen and Musch (2015) to test the hypothesis that the correlation (shown in Figure 4) obtained by our model is not larger than the correlation (shown in Figure 4) obtained by a competing method. We found that in all the tests we performed, the p-value is smaller than 0.01, indicating that the correlation obtained by our model is significantly larger than the correlation obtained by the competing methods. These results further support the hypothesis discussed in Section 2.1. We can see that both a simplified genotypic embedder and a heavyweight chemical embedder contribute to the performance.

FIGURE 4

FIGURE 4. Comparison of XMR’s performance with other competing methods for predicting drug response across five separate pathways. Spearman’s correlation (rho) between predicted and observed drug responses was used as an evaluation criterion.

To gain a deeper understanding of how the complexity of the genotypic embedder and the chemical embedder influences performance, we conducted an ablation study to evaluate the impact of the number of hidden neurons of each term in both embedders on performance. The results are displayed in Figure 5. Figure 5A illustrates the effect of the number of hidden neurons in the GNN on performance, and we can see that the performance increases with the number of hidden neurons in the GNN. The performance continues to improve even when the number of hidden neurons is increased to 512, indicating that a heavyweight chemical embedder is necessary. Figure 5B shows the impact of the number of hidden neurons in the VNN on performance, and we can see that the performance deteriorates when the number of hidden neurons grows from three to six, indicating that the VNN is highly susceptible to overfitting. Therefore, a pruning method would greatly benefit the VNN, leading to a more lightweight genotypic embedder. These phenomena also align with our hypothesis.

FIGURE 5

FIGURE 5. Impact of the number of hidden neurons on prediction accuracy. (A) the effect of the number of hidden neurons in the GNN on performance (B) the effect of the number of hidden neurons in the VNN on performance.

3.2 Interpretations of genes and pathways

To verify whether our XMR model can generate a reasonable gene-level explanation, we checked whether the commonly mutated genes in TNBC were preserved by our XMR model. We first extracted a series of commonly or frequently altered genes in TNBC (see Supplementary Material) from the literature on the PubMed database using the following search string: (triple-negative breast cancer OR TNBC) AND (commonly mutated genes OR highly frequently mutated genes). We found that our model identified 13 such genes out of 22 genes: TP53, PIK3CA, BRCA1/2, RB1, NOTCH2/3, BRAF, ERBB3, APC, STK11, KRAS, and NF1 (Philipovskiy et al., 2020; Kudelova et al., 2022; Li et al., 2022). We further conducted Fisher’s exact test to evaluate their significance. The p-value of 1.84 × 10⁻⁴ indicated that these genes were not randomly selected (Figure 6). Although the model did not retain other reported frequently mutated genes [e.g., PTEN, AKT1, and ATM], it still remains reasonable, given that the mutation frequency of these genes greatly varies among multiple studies (Kudelova et al., 2022). Considering allowable uncertainty and variability in simulation, these findings supported the feasibility and plausibility of the modeling framework coupled with the pruning approach to identify likely critical genes.

FIGURE 6

FIGURE 6. Comparison of genes identified by our model and frequently mutated genes in the literature.

We further checked the pathways identified by our XMR model. The retained pathways belonging to five categories of biological processes are graphically presented in Figure 3 and explained as follows. The genes corresponding to each term along individual pathways are provided in the Supplementary Material. Generally, our model found G2/M checkpoint, DNA repair-related, PI3K/mTOR signaling, RAS/RAF/MAPK/ERK signaling pathways, etc., which are enriched in different subtypes of TNBC [e.g., BL1, BL2, M, and LAR subtypes] (Lehmann et al., 2011; 2021).

1. Figure 3A indicates the pathway retained in the process of the cell cycle. The G2/M DNA damage checkpoint is likely sensitive to drug exposure based on our model. The loss of cell cycle checkpoints has been well-viewed as the hallmark of cancer (Löbrich and Jeggo, 2007). The remaining genes, including TP53, RB1, BRCA1/2, STAG2, and TP53BP1, are the common genes engaged in the cell cycle (Coussy et al., 2019; Kudelova et al., 2022). Another two genes (i.e., RAD21 and MDC1) are likely associated with DNA damage, possibly implying a G2/M checkpoint loss as well (Lehmann et al., 2011). Despite limited research, gene functional analyses suggested the potential role of meiosis, given that the differentially expressed genes of TNBC were markedly enriched in the oocyte meiosis pathway (Cao et al., 2021).

2. A total of four pathways belonging to DNA repair were retained by our model (Figure 3B). DNA repair defects are thought to be more common than homologous recombination defects for breast cancer (Lee et al., 2019). Common genes relevant to this process (e.g., TP53, RAD50, and POLE) (Coussy et al., 2019; Lehmann et al., 2021; Kudelova et al., 2022) were also detected in this study. Our model identified four breast cancer susceptibility genes in the Fanconi anemia pathway: FANCC, FANCD2, and FANCM (Fang et al., 2020). XRCC1, a potentially critical gene associated with TNBC through both base excision repair and double-strand break repair pathways, was also detected by the model (Lee et al., 2019). Currently, limited data on the impacts of DNA damage bypass defects on TNBC development can be found.

3. Growth factor receptor- and metabolism-mediated diseases are shown in Figure 3C. Sufficient evidence has indicated that aberrant MAPK signaling is associated with TNBC occurrence (Jiang et al., 2020; Lehmann et al., 2021). Activating mutations in MAP2K and RAF1 (i.e., MAP2K1, BRAF, KRAS, and NF1) would dysregulate cellular proliferation, differentiation, and survival. These genes are common genes involved in the MAPK signaling pathway (Coussy et al., 2019). The metabolism-mediated process mainly captured two genes (i.e., NOTCH2/3) in NOTCH signaling. The dysfunction of NOTCH signaling could contribute to the development of many cancers (e.g., TNBC) (Bray, 2006; Lehmann et al., 2021).

4. The commonly accepted PI3K/mTOR pathway that affects TNBC, especially BL2 and LAR subtypes (Lehmann et al., 2021), is presented in Figure 3D. The common genes we identified in this pathway include PIK3CA, PIK3R1, MTOR, STK11, and TSC1 (Coussy et al., 2019). Although the more straightforward pathway (i.e., PI3K/AKT signaling) was pruned, PI3K signaling activated by erythropoietin could sufficiently induce the proliferation of breast cancer cell lines (Tóthová et al., 2021). Additionally, the model also detected several mechanisms involved in the cascade by the phosphorylation of MAPK (i.e., RAS/ERK signaling) and VEGF signaling (Figure 3D). The aberrant activities of these pathways are potential factors for TNBC (Butti et al., 2018; Lehmann et al., 2021).

5. The metabolic dysregulation of TNBC may include remarkably altered amino acids, lipids, carbohydrates, nucleotides, and energy levels (Gong et al., 2021), which is also an emerging focus for cancer treatment (Sun et al., 2020). Similar molecular features of TNBC related to metabolism were observed by our model (Figure 3E). Targeting identified genes [e.g., FASN and SLC7A5] may provide a potential strategy to treat TNBC (Sun et al., 2020; Nachef et al., 2021), but the metabolism of TNBC still calls for further investigations due to limited studies.

3.3 Interpretations of drugs

In this section, we verified the top 10 drugs predicted to be effective by our XMR model. The top 10 drugs that would target TNBC were identified in consideration of both the predicted drug response and relatively adequate information on chemical–drug interactions provided by the PubChem database (Kim et al., 2022). The potentially effective drugs and corresponding information are summarized in Table 1. Specifically, the top 10 drugs derived from the synthesis of five major pathways (i.e., cell cycle, DNA repair, signaling transduction, metabolism, and diseases) are listed under “Integrated results,” followed by the drugs only identified by the individual pathways listed under “Pathway-specific results.” It is worth noting that the top drugs were screened and selected based on the (cell line and drug) pairs that were not presented in the entire dataset, thereby reflecting the prediction capability of our model. In addition, platinum salts, a classical group of alkylating agents applied in neoadjuvant chemotherapy for TNBC (Nedeljković and Damjanović, 2019), were not included in the drug repository we utilized in this study, since we aimed to gain a better understanding of the efficacy of relatively novel drugs.

TABLE 1

TABLE 1. Integrated and pathway-specific top 10 drugs identified by XMR and their corresponding descriptions.

Generally, the five pathways identified similar drugs. Four drugs have been studied in recent clinical trials to explore their potentials to treat TNBC or breast cancer (BC) with leptomeningeal metastases (i.e., dinaciclib, panobinostat, BI 2536, and cytarabine). Although the majority of clinical trials utilize combination therapies, these drugs may play a role in a synergistic way. The other four drugs are still under in vitro or in vivo investigations of TNBC treatment (i.e., AZD7762, ouabain, homoharringtonine, and GSK461364) (Giordano et al., 2019; Yakhni et al., 2019; Du et al., 2021; Zhu et al., 2021; Plett et al., 2022). Interestingly, leptomycin B (LMB) was considered unsafe in previous clinical trials due to its adequate adverse effect and thus not approved for use (Wang and Liu, 2019), while it is the most effective drug identified by our model. As an alternative to LMB, selinexor exhibits manageable side effects and processes a similar mechanism. It shows the potential to treat TNBC based on several preclinical studies (Cheng et al., 2014; Arango et al., 2017) and clinical trials (e.g., NCT05035745, NCT02402764, and NCT02419495). If selinexor was included in our drug repository, it would probably be identified instead of LMB. For pathway-specific drugs, BC-related clinical trials using vincristine and bleomycin can be found, but bleomycin is usually applied to evaluate the effects of electrochemotherapy (Radica et al., 2020).

From the point of mechanism-based view, the majority of the top agents target the cell cycle process, DNA replication, and DNA repair (e.g., LMB, dinaciclib, AZD7762, panobinostat, BI 2536, homoharringtonine, TW-37, cytarabine, and GSK461364), which are in line with part of the characteristics of BL1 and BL2 subtypes (Lehmann et al., 2011; Lehmann et al., 2021). Although there are limited studies on the effects of TW-37 on TNBC (Table 1), it does not necessarily mean that TW-37 is unpromising. Since myeloid cell leukemia-1 (Mcl-1) is a critical factor for the survival and motility of TNBC cells (Goodwin et al., 2015), TW-37 likely plays a key role in TNBC treatment. Two drugs are characterized by ion channel transport (i.e., ouabain and thapsigargin), which may be related to critical signaling pathways for TNBC. For example, it was shown that overexpression of the NOTCH signaling pathway (e.g., NOTCH 1) would induce proliferation and tumorigenesis of TNBC (Nedeljković and Damjanović, 2019), especially for BL1 and M subtypes. As a sarco-endoplasmic reticulum Ca²⁺-ATPase (SERCA) modulator, thapsigargin may inhibit oncogenic NOTCH 1 signaling (Pagliaro et al., 2021), thereby possibly suppressing tumor growth.

4 Discussion

Predicting drug response in cancer patients can be difficult due to the genetic diversity among them. The end-to-end training of such a model requires balancing the representation of two types of data fed into the model while also serving its prediction purpose, i.e., accurately projecting the relationship between the two modalities. There has been limited exploration in this area. For example, DrugCell used the VNN instead of the usual neural network to fully incorporate biological processes at molecular and cellular levels (Kuenzi et al., 2020). Another effort leveraged ParsVNN to select important terms and edges of those biological architectures to improve the performance and explanation (Huang X. et al., 2021). However, ParsVNN still used the simple MLP to extract features from the Morgan fingerprint of drugs similar to DrugCell (Kuenzi et al., 2020). The information on drugs’ molecular structures is not fully used in those models. Meanwhile, there have been many studies in the field of vision-and-language processing (VLP) tasks. Those insights and results were gained from developing multimodal models for VLP, which can be effectively applied to drug response prediction. In this paper, we have developed XMR, which follows a similar structure to ParsVNN for the gene information (Huang X. et al., 2021) but uses the GNN to extract the information between neighbor atoms from the molecular structure of a certain chemical (Tsubaki et al., 2019).

We chose drug response data in TNBC cell lines from GDSC (Yang et al., 2012) and CTRP (Seashore-Ludlow et al., 2015). Selecting this disease as the case in this study is primarily because TNBC treatment (e.g., classical regimens) still remains challenging compared to other types of breast cancer, given the lack of specific hormone receptors, common driver mutations (Rajput et al., 2016; Zhu et al., 2022), and high heterogeneity and resistance (Nedeljković and Damjanović, 2019). This brings us to seek novel personalized approaches to treating TNBC. In this study, we compared our model with the existing methods under five biological pathways obtained from the Reactome Pathway Database (Fabregat et al., 2018). It indicates that XMR outperforms both DrugCell and ParsVNN in terms of test accuracy (Figure 4). Moreover, the results derived from the XMR model can be explained and verified at levels of genes, pathways, and drugs (Figure 3).

Overall, 13 commonly or frequently mutated genes related to TNBC were retained by our model. We also identified commonly accepted pathways (e.g., cell cycle, DNA repair, PI3K/mTOR, and MAPK signaling) and promising pathways associated with metabolic reprogramming. Additionally, several novel drugs tested under clinical trials/cell experiments/animal studies were selected (Table 1). For the purpose of selecting new drugs, we ensured that the test dataset contained as many novel (cell line and drug) pairs as possible. In other words, the (cell line and drug) pairs applied for drug screening have ruled out all combinations shown in both training and validation datasets from previous experiments. This way, the commonly used agents for TNBC, such as DNA-damaging agents (e.g., doxorubicin and cyclophosphamide) and mitotic inhibitors (e.g., docetaxel) (Powell et al., 2020), and recently discovered agents, such as poly(ADP-ribose) polymerase (PARP) inhibitors (olaparib, veliparib, and rucaparib), were ultimately not considered in this study. For example, in the test dataset, only four combinations among a total of 1,864 (cell line and drug) pairs were relevant to docetaxel, not to mention doxorubicin which had no pairs to be tested.

Nevertheless, there are some underlying limitations that would be addressed by future efforts. First, our model contains three components: a genotypic embedder, a chemical embedder, and a modality interaction block. In this study, we focused on balancing the complexity of the first two embedders while maintaining the design of the modality interaction block simple. However, the modality interaction block has been recognized as an essential element in VLP tasks, as demonstrated in studies such as ViLBERT (Lu et al., 2019), UNITER (Chen et al., 2020), and ViLT (Kim et al., 2021). It allows us to improve the interaction in the future by employing a multi-headed self-attention layer to extract more comprehensive features in the interaction between the two modalities. Second, GDSC provides drug response data with multi-drugs (Huang S. et al., 2021), while the current model only considers the effects of a single drug. The model could be further refined by synthesizing disparate types of drugs (e.g., classical regimens and immunotherapies) and by delving into their synergistic effects to better facilitate TNBC treatment.

Although our model showed the ability to provide biologically reasonable interpretations, most drugs exhibit mechanisms that are associated with cell cycle and DNA repair. Apart from the property of our test dataset itself (e.g., perhaps uneven distributions of cell-line data), the pathways utilized in this study may be another contributor. Currently, our model is characterized by five pathways, including cell cycle, DNA repair, diseases, signaling transduction, and metabolism, listed in the Reactome Pathway Database (Fabregat et al., 2018). However, other pathways, such as the immune system, developmental biology, and hemostasis, have also been reported as having potential linkages with TNBC development, especially the specific subtypes (Lehmann et al., 2011; Lehmann et al., 2021). These pathways may inform the VNN structure and lead to more comprehensive results of drug discovery when incorporated into the model. Additionally, distinct databases of biological processes (e.g., Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Reactome) may differ in their ultimate findings, due to different annotations and genes they cover. Although the comparison of the model performance based on these databases is beyond the scope of our study, it would be an invaluable factor in refining the interpretability of our model.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author. The source code of XMR and the datasets are available at https://github.com/zwa2/XMR.

Author contributions

ZW, YZo, YZa, and YM contributed equally to the study. YW supervised this work. YW, ZW, YZo, YZa, and YM conceived and designed the study. ZW conducted the experiments. YZo provided the explanations. ZW, YZo, YZa, and YM wrote the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This research was supported by the National Institutes of Health grant R35GM147241-01 to YW and Precision Health Initiative at Indiana University to YW.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbinf.2023.1164482/full#supplementary-material

References

Arango, N. P., Yuca, E., Zhao, M., Evans, K. W., Scott, S., Kim, C., et al. (2017). Selinexor (kpt-330) demonstrates anti-tumor efficacy in preclinical models of triple-negative breast cancer. Breast cancer Res. 19, 93. doi:10.1186/s13058-017-0878-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Barretina, J., Caponigro, G., Stransky, N., Venkatesan, K., Margolin, A. A., Kim, S., et al. (2012). The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483, 603–607. doi:10.1038/nature11003

PubMed Abstract | CrossRef Full Text | Google Scholar

Basu, A., Bodycombe, N. E., Cheah, J. H., Price, E. V., Liu, K., Schaefer, G. I., et al. (2013). An interactive resource to identify cancer genetic and lineage dependencies targeted by small molecules. Cell 154, 1151–1161. doi:10.1016/j.cell.2013.08.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Bedard, P. L., Hansen, A. R., Ratain, M. J., and Siu, L. L. (2013). Tumour heterogeneity in the clinic. Nature 501, 355–364. doi:10.1038/nature12627

PubMed Abstract | CrossRef Full Text | Google Scholar

Bolte, J., Sabach, S., and Teboulle, M. (2014). Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Math. Program. 146, 459–494. doi:10.1007/s10107-013-0701-9

CrossRef Full Text | Google Scholar

Bray, S. J. (2006). Notch signalling: A simple pathway becomes complex. Nat. Rev. Mol. Cell Biol. 7, 678–689. doi:10.1038/nrm2009

PubMed Abstract | CrossRef Full Text | Google Scholar

Butti, R., Das, S., Gunasekaran, V. P., Yadav, A. S., Kumar, D., and Kundu, G. C. (2018). Receptor tyrosine kinases (rtks) in breast cancer: Signaling, therapeutic implications and challenges. Mol. cancer 17, 34–18. doi:10.1186/s12943-018-0797-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Cao, W., Jiang, Y., Ji, X., Guan, X., Lin, Q., and Ma, L. (2021). Identification of novel prognostic genes of triple-negative breast cancer using meta-analysis and weighted gene co-expressed network analysis. Ann. Transl. Med. 9, 205. doi:10.21037/atm-20-5989

PubMed Abstract | CrossRef Full Text | Google Scholar

Chavez, K. J., Garimella, S. V., and Lipkowitz, S. (2010). Triple negative breast cancer cell lines: One tool in the search for better treatment of triple negative breast cancer. Breast Dis. 32, 35–48. doi:10.3233/bd-2010-0307

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, Y., Li, L., Yu, L., Kholy, A. E., Ahmed, F., Gan, Z., et al. (2020). “Uniter: Universal image-text representation learning,” in Computer Vision - ECCV 2020 - 16th European Conference, August 23-28, 2020, Glasgow, UK, 104–120. Proceedings, Part XXX. doi:10.1007/978-3-030-58577-8_7

CrossRef Full Text | Google Scholar

Cheng, Y., Holloway, M. P., Nguyen, K., McCauley, D., Landesman, Y., Kauffman, M. G., et al. (2014). Xpo1 (crm1) inhibition represses stat3 activation to drive a survivin-dependent oncogenic switch in triple-negative breast cancer. Mol. cancer Ther. 13, 675–686. doi:10.1158/1535-7163.MCT-13-0416

PubMed Abstract | CrossRef Full Text | Google Scholar

Costa, F., and De Grave, K. (2010). “Fast neighborhood subgraph pairwise distance kernel,” in Proceedings of the 26th international conference on machine learning (Madison, WI, USA: Omnipress), 255–262.

Google Scholar

Coussy, F., de Koning, L., Lavigne, M., Bernard, V., Ouine, B., Boulai, A., et al. (2019). A large collection of integrated genomically characterized patient-derived xenografts highlighting the heterogeneity of triple-negative breast cancer. Int. J. cancer 145, 1902–1912. doi:10.1002/ijc.32266

PubMed Abstract | CrossRef Full Text | Google Scholar

Dagogo-Jack, I., and Shaw, A. T. (2018). Tumour heterogeneity and resistance to cancer therapies. Nat. Rev. Clin. Oncol. 15, 81–94. doi:10.1038/nrclinonc.2017.166

PubMed Abstract | CrossRef Full Text | Google Scholar

Dai, X., Cheng, H., Bai, Z., and Li, J. (2017). Breast cancer cell line classification and its relevance with breast tumor subtyping. J. Cancer 8, 3131–3141. doi:10.7150/jca.18457

PubMed Abstract | CrossRef Full Text | Google Scholar

DepMap (2022). DepMap 22Q4 public. doi:10.6084/m9.figshare.21637199.v2

CrossRef Full Text | Google Scholar

Diedenhofen, B., and Musch, J. (2015). cocor: a comprehensive solution for the statistical comparison of correlations. PLoS One 10, e0121945. doi:10.1371/journal.pone.0121945

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, J., Jiang, L., Chen, F., Hu, H., and Zhou, M. (2021). Cardiac glycoside ouabain exerts anticancer activity via downregulation of stat3. Front. Oncol. 11, 684316. doi:10.3389/fonc.2021.684316

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, M., Liu, N., and Hu, X. (2020). Techniques for interpretable machine learning. Commun. ACM 63, 68–77. doi:10.1145/3359786

CrossRef Full Text | Google Scholar

Elmarakeby, H. A., Hwang, J., Arafeh, R., Crowdis, J., Gang, S., Liu, D., et al. (2021). Biologically informed deep neural network for prostate cancer discovery. Nature 598, 348–352. doi:10.1038/s41586-021-03922-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Fabregat, A., Jupe, S., Matthews, L., Sidiropoulos, K., Gillespie, M., Garapati, P., et al. (2018). The reactome pathway knowledgebase. Nucleic acids Res. 46, D649–D655. doi:10.1093/nar/gkx1132

PubMed Abstract | CrossRef Full Text | Google Scholar

Fang, C.-B., Wu, H.-T., Zhang, M.-L., Liu, J., and Zhang, G.-J. (2020). Fanconi anemia pathway: Mechanisms of breast cancer predisposition development and potential therapeutic targets. Front. Cell Dev. Biol. 8, 160. doi:10.3389/fcell.2020.00160

PubMed Abstract | CrossRef Full Text | Google Scholar

Fittall, M. W., and Van Loo, P. (2019). Translating insights into tumor evolution to clinical practice: Promises and challenges. Genome Med. 11, 20–14. doi:10.1186/s13073-019-0632-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Giordano, A., Liu, Y., Armeson, K., Park, Y., Ridinger, M., Reuben, J., et al. (2019). Polo-like kinase 1 (plk1) inhibition synergizes with taxanes in triple negative breast cancer. PloS one 14, e0224420. doi:10.1371/journal.pone.0224420

PubMed Abstract | CrossRef Full Text | Google Scholar

Gong, Y., Ji, P., Yang, Y.-S., Xie, S., Yu, T.-J., Xiao, Y., et al. (2021). Metabolic-pathway-based subtyping of triple-negative breast cancer reveals potential therapeutic targets. Cell metab. 33, 51–64.e9. doi:10.1016/j.cmet.2020.10.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodwin, C. M., Rossanese, O. W., Olejniczak, E. T., and Fesik, S. W. (2015). Myeloid cell leukemia-1 is an important apoptotic survival factor in triple-negative breast cancer. Cell death Differ. 22, 2098–2106. doi:10.1038/cdd.2015.73

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, S., Hu, P., and Lakowski, T. M. (2021a). Predicting breast cancer drug response using a multiple-layer cell line drug response network model. BMC cancer 21, 648. doi:10.1186/s12885-021-08359-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, X., Huang, K., Johnson, T., Radovich, M., Zhang, J., Ma, J., et al. (2021b). ParsVNN: Parsimony visible neural networks for uncovering cancer-specific and drug-sensitive genes and pathways. NAR Genomics Bioinforma. 3, lqab097. doi:10.1093/nargab/lqab097

CrossRef Full Text | Google Scholar

Iorio, F., Knijnenburg, T. A., Vis, D. J., Bignell, G. R., Menden, M. P., Schubert, M., et al. (2016). A landscape of pharmacogenomic interactions in cancer. Cell 166, 740–754. doi:10.1016/j.cell.2016.06.017

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiang, W., Wang, X., Zhang, C., Xue, L., and Yang, L. (2020). Expression and clinical significance of mapk and egfr in triple-negative breast cancer. Oncol. Lett. 19, 1842–1848. doi:10.3892/ol.2020.11274

PubMed Abstract | CrossRef Full Text | Google Scholar

Jiao, W., Atwal, G., Polak, P., Karlic, R., Cuppen, E., Danyi, A., et al. (2020). A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns. Nat. Commun. 11, 728. doi:10.1038/s41467-019-13825-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, S., Chen, J., Cheng, T., Gindulyte, A., He, J., He, S., et al. (2022). PubChem 2023 update. Nucleic Acids Res. 51, D1373–D1380. doi:10.1093/nar/gkac956

CrossRef Full Text | Google Scholar

Kim, W., Son, B., and Kim, I. (2021). “Vilt: Vision-and-language transformer without convolution or region supervision,” in Proceedings of the 38th International Conference on Machine Learning, ICML, 18-24 July 2021, 5583–5594. Virtual Event.

Google Scholar

Kudelova, E., Smolar, M., Holubekova, V., Hornakova, A., Dvorska, D., Lucansky, V., et al. (2022). Genetic heterogeneity, tumor microenvironment and immunotherapy in triple-negative breast cancer. Int. J. Mol. Sci. 23, 14937. doi:10.3390/ijms232314937

PubMed Abstract | CrossRef Full Text | Google Scholar

Kuenzi, B. M., Park, J., Fong, S. H., Sanchez, K. S., Lee, J., Kreisberg, J. F., et al. (2020). Predicting drug response and synergy using a deep learning model of human cancer cells. Cancer Cell 38, 672–684.e6. doi:10.1016/j.ccell.2020.09.014

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, K. J., Piett, C. G., Andrews, J. F., Mann, E., Nagel, Z. D., and Gassman, N. R. (2019). Defective base excision repair in the response to dna damaging agents in triple negative breast cancer. PLoS One 14, e0223725. doi:10.1371/journal.pone.0223725

PubMed Abstract | CrossRef Full Text | Google Scholar

Lehmann, B. D., Bauer, J. A., Chen, X., Sanders, M. E., Chakravarthy, A. B., Shyr, Y., et al. (2011). Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies. J. Clin. investigation 121, 2750–2767. doi:10.1172/JCI45014

PubMed Abstract | CrossRef Full Text | Google Scholar

Lehmann, B. D., Colaprico, A., Silva, T. C., Chen, J., An, H., Ban, Y., et al. (2021). Multi-omics analysis identifies therapeutic vulnerabilities in triple-negative breast cancer subtypes. Nat. Commun. 12, 6276. doi:10.1038/s41467-021-26502-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Selvaraju, R. R., Gotmare, A., Joty, S. R., Xiong, C., and Hoi, S. C. (2021). “Align before fuse: Vision and language representation learning with momentum distillation,” in Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, December 6-14, 2021 (NeurIPS), 9694–9705. virtual.

Google Scholar

Li, Y.-Z., Chen, B., Lin, X.-Y., Zhang, G.-C., Lai, J.-G., Li, C., et al. (2022). Clinicopathologic and genomic features in triple-negative breast cancer between special and no-special morphologic pattern. Front. Oncol. 12, 830124. doi:10.3389/fonc.2022.830124

PubMed Abstract | CrossRef Full Text | Google Scholar

Lim, Z.-F., and Ma, P. C. (2019). Emerging insights of tumor heterogeneity and drug resistance mechanisms in lung cancer targeted therapy. J. Hematol. Oncol. 12, 134. doi:10.1186/s13045-019-0818-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Löbrich, M., and Jeggo, P. A. (2007). The impact of a negligent g2/m checkpoint on genomic instability and cancer induction. Nat. Rev. Cancer 7, 861–869. doi:10.1038/nrc2248

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, J., Batra, D., Parikh, D., and Lee, S. (2019). “Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks,” in Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, Vancouver, BC, Canada, December 8-14, 2019 (Vancouver, USA: NeurIPS), 13–23.

Google Scholar

Luo, P., Ding, Y., Lei, X., and Wu, F.-X. (2019). deepdriver: predicting cancer driver genes based on somatic mutations using deep convolutional neural networks. Front. Genet. 10, 13. doi:10.3389/fgene.2019.00013

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, J., Yu, M. K., Fong, S., Ono, K., Sage, E., Demchak, B., et al. (2018). Using deep learning to model the hierarchical structure and function of a cell. Nat. methods 15, 290–298. doi:10.1038/nmeth.4627

PubMed Abstract | CrossRef Full Text | Google Scholar

Nachef, M., Ali, A. K., Almutairi, S. M., and Lee, S.-H. (2021). Targeting slc1a5 and slc3a2/slc7a5 as a potential strategy to strengthen anti-tumor immunity in the tumor microenvironment. Front. Immunol. 12, 624324. doi:10.3389/fimmu.2021.624324

PubMed Abstract | CrossRef Full Text | Google Scholar

Nedeljković, M., and Damjanović, A. (2019). Mechanisms of chemotherapy resistance in triple-negative breast cancer-how we can rise to the challenge. Cells 8, 957. doi:10.3390/cells8090957

PubMed Abstract | CrossRef Full Text | Google Scholar

Pagliaro, L., Marchesini, M., and Roti, G. (2021). Targeting oncogenic notch signaling with serca inhibitors. J. Hematol. Oncol. 14, 8. doi:10.1186/s13045-020-01015-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Philipovskiy, A., Dwivedi, A. K., Gamez, R., McCallum, R., Mukherjee, D., Nahleh, Z., et al. (2020). Association between tumor mutation profile and clinical outcomes among hispanic latina women with triple-negative breast cancer. PLoS One 15, e0238262. doi:10.1371/journal.pone.0238262

PubMed Abstract | CrossRef Full Text | Google Scholar

Plett, R., Mellor, P., Kendall, S., Hammond, S. A., Boulet, A., Plaza, K., et al. (2022). Homoharringtonine demonstrates a cytotoxic effect against triple-negative breast cancer cell lines and acts synergistically with paclitaxel. Sci. Rep. 12, 15663. doi:10.1038/s41598-022-19621-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Powell, R. T., Redwood, A., Liu, X., Guo, L., Cai, S., Zhou, X., et al. (2020). Pharmacologic profiling of patient-derived xenograft models of primary treatment-naïve triple-negative breast cancer. Sci. Rep. 10, 17899. doi:10.1038/s41598-020-74882-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., et al. (2021). “Learning transferable visual models from natural language supervision,” in Proceedings of the 38th International Conference on Machine Learning, ICML, 18-24 July 2021, 8748–8763. Virtual Event.

Google Scholar

Radica, M. K., Fabbri, N., Santandrea, G., Bonazza, S., Stefanelli, A., and Carcoforo, P. (2020). Use of electrochemotherapy in a voluminous chest wall recurrence of triple-negative breast cancer: Case report. AME case Rep. 4, 30. doi:10.21037/acr-20-54

PubMed Abstract | CrossRef Full Text | Google Scholar

Rajput, S., Khera, N., Guo, Z., Hoog, J., Li, S., and Ma, C. X. (2016). Inhibition of cyclin dependent kinase 9 by dinaciclib suppresses cyclin b1 expression and tumor growth in triple negative breast cancer. Oncotarget 7, 56864–56875. doi:10.18632/oncotarget.10870

PubMed Abstract | CrossRef Full Text | Google Scholar

Ramón y Cajal, S., Sesé, M., Capdevila, C., Aasen, T., De Mattos-Arruda, L., Diaz-Cano, S. J., et al. (2020). Clinical implications of intratumor heterogeneity: Challenges and opportunities. J. Mol. Med. 98, 161–177. doi:10.1007/s00109-020-01874-2

CrossRef Full Text | Google Scholar

Seashore-Ludlow, B., Rees, M. G., Cheah, J. H., Cokol, M., Price, E. V., Coletti, M. E., et al. (2015). Harnessing connectivity in a large-scale small-molecule sensitivity dataset. Cancer Discov. 5, 1210–1223. doi:10.1158/2159-8290.cd-15-0235

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, X., Wang, M., Wang, M., Yu, X., Guo, J., Sun, T., et al. (2020). Metabolic reprogramming in triple-negative breast cancer. Front. Oncol. 10, 428. doi:10.3389/fonc.2020.00428

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, Y., Zhu, S., Ma, K., Liu, W., Yue, Y., Hu, G., et al. (2019). Identification of 12 cancer types through genome deep learning. Sci. Rep. 9, 17256. doi:10.1038/s41598-019-53989-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Tóthová, Z., Šemeláková, M., Solárová, Z., Tomc, J., Debeljak, N., and Solár, P. (2021). The role of pi3k/akt and mapk signaling pathways in erythropoietin signalization. Int. J. Mol. Sci. 22, 7682. doi:10.3390/ijms22147682

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsubaki, M., Tomii, K., and Sese, J. (2019). Compound–protein interaction prediction with end-to-end learning of neural networks for graphs and sequences. Bioinformatics 35, 309–318. doi:10.1093/bioinformatics/bty535

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, A. Y., and Liu, H. (2019). The past, present, and future of crm1/xpo1 inhibitors. Stem Cell Investig. 6, 6. doi:10.21037/sci.2019.02.03

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, D., Liu, S., Warrell, J., Won, H., Shi, X., Navarro, F. C., et al. (2018). Comprehensive functional genomic resource and integrative model for the human brain. Science 362, eaat8464. doi:10.1126/science.aat8464

PubMed Abstract | CrossRef Full Text | Google Scholar

Yakhni, M., Briat, A., Guerrab, A. E., Furtado, L., Kwiatkowski, F., Miot-Noirault, E., et al. (2019). Homoharringtonine, an approved anti-leukemia drug, suppresses triple negative breast cancer growth through a rapid reduction of anti-apoptotic protein abundance. Am. J. cancer Res. 9, 1043–1060.

PubMed Abstract | Google Scholar

Yang, W., Soares, J., Greninger, P., Edelman, E. J., Lightfoot, H., Forbes, S., et al. (2012). Genomics of drug sensitivity in cancer (gdsc): A resource for therapeutic biomarker discovery in cancer cells. Nucleic acids Res. 41, D955–D961. doi:10.1093/nar/gks1111

PubMed Abstract | CrossRef Full Text | Google Scholar

Yuan, Y., Shi, Y., Li, C., Kim, J., Cai, W., Han, Z., et al. (2016). Deepgene: An advanced cancer type classifier based on deep learning and somatic point mutations. BMC Bioinforma. 17, 476–256. doi:10.1186/s12859-016-1334-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, H., Rao, Z., Yuan, S., You, J., Hong, C., He, Q., et al. (2021). One therapeutic approach for triple-negative breast cancer: Checkpoint kinase 1 inhibitor azd7762 combination with neoadjuvant carboplatin. Eur. J. Pharmacol. 908, 174366. doi:10.1016/j.ejphar.2021.174366

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhu, Y., Hu, Y., Tang, C., Guan, X., and Zhang, W. (2022). Platinum-based systematic therapy in triple-negative breast cancer. Rev. cancer 1877, 188678. doi:10.1016/j.bbcan.2022.188678

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: drug response prediction, machine learning, interpretable deep learning, multimodal deep learning, triple-negative breast cancer

Citation: Wang Z, Zhou Y, Zhang Y, Mo YK and Wang Y (2023) XMR: an explainable multimodal neural network for drug response prediction. Front. Bioinform. 3:1164482. doi: 10.3389/fbinf.2023.1164482

Received: 12 February 2023; Accepted: 14 July 2023;
Published: 02 August 2023.

Edited by:

Kevin Eliceiri, University of Wisconsin-Madison, United States

Reviewed by:

Yang Zhang, Carnegie Mellon University, United States
Munazzah Tasleem, University of Electronic Science and Technology, China

Copyright © 2023 Wang, Zhou, Zhang, Mo and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yijie Wang, eWlqd2FuZ0BpdS5lZHU=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

XMR: an explainable multimodal neural network for drug response prediction

1 Introduction

2 Materials and methods

2.1 Taxonomy of multimodal models

2.2 Model architecture

2.3 Explainability in XMR

2.4 Dataset and splitting

2.5 Construction of TNBC-specific XMR models

3 Results

3.1 Heavyweight chemical embedding is critical to model performance

3.2 Interpretations of genes and pathways

3.3 Interpretations of drugs

4 Discussion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good