- 1Key Laboratory of Intelligent Computing and Signal Processing, School of Artificial Intelligence, Anhui University, Hefei, China
- 2Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, Anhui, China
- 3State Key Laboratory of Pathogenesis, Prevention, and Treatment of Central Asian High Incidence Diseases, Clinical Medical Research Institute, First Affiliated Hospital of Xinjiang Medical University, Urumqi, China
- 4College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
Echinococcosis is a zoonotic parasitic disease caused by the larvae of echinococcus tapeworms infesting the human body. Drug combination therapy is highly valued for the treatment of echinococcosis because of its potential to overcome resistance and enhance the response to existing drugs. Traditional methods of identifying drug combinations via biological experimentation is costly and time-consuming. Besides, the scarcity of existing drug combinations for echinococcosis hinders the development of computational methods. In this study, we propose a transfer learning-based model, namely TransferBAN-Syn, to identify synergistic drug combinations against echinococcosis based on abundant information of drug combinations against parasitic diseases. To the best of our knowledge, this is the first work that leverages transfer learning to improve prediction accuracy with limited drug combination data in echinococcosis treatment. Specifically, TransferBAN-Syn contains a drug interaction feature representation module, a disease feature representation module, and a prediction module, where the bilinear attention network is employed in the drug interaction feature representation module to deeply extract the fusion feature of drug combinations. Besides, we construct a special dataset with multi-source information and drug combinations for parasitic diseases, including 21 parasitic diseases and echinococcosis. TransferBAN-Syn is designed and initially trained on the abundant data from the 21 parasitic diseases, which serves as the source domain. The parameters in the feature representation modules of drug interactions and diseases are preserved from this source domain, and those in the prediction module are then fine-tuned to specifically identify the synergistic drug combinations for echinococcosis in the target domain. Comparison experiments have shown that TransferBAN-Syn not only improves the accuracy of predicting echinococcosis drug combinations but also enhances generalizability. Furthermore, TransferBAN-Syn identifies potential drug combinations that hold promise in the treatment of echinococcosis. TransferBAN-Syn not only offers new synergistic drug combinations for echinococcosis but also provides a novel approach for predicting potential drug pairs for diseases with limited combination data.
1 Introduction
Echinococcosis is a zoonotic parasitic disease caused by the larval stages of tapeworms of the genus echinococcus, primarily affecting organs such as the liver and lungs (Alvi et al., 2023; Meng et al., 2023). This disease manifests in two forms: Cystic echinococcosis and Alveolar echinococcosis, both having significant clinical consequences, potentially resulting in high mortality (Autier et al., 2023; Casulli et al., 2023). The disease is predominantly found in the Mediterranean basin, South America, North Africa, Central Asia, and Eastern Europe, with the western pastoral areas of China being high-prevalence zones (Wen et al., 2019). Alveolar echinococcosis, also known as ‘worm cancer’, is especially dangerous, with untreated cases facing a 10-year mortality rate over
The treatment strategies for echinococcosis include surgical and pharmacological interventions. In the situation that patients cannot undergo surgery, pharmacological interventions are more suitable options (Hogea et al., 2024). Currently available anti-echinococcosis drugs are mainly anti-parasitic drugs and cancer-fighting drugs (Wang et al., 2022). In clinical practice, benzimidazole derivatives, such as albendazole and mebendazole, are widely used to treat echinococcosis (Wen et al., 2019). However, clinical studies have shown that long-term administration of albendazole and mebendazole might cause adverse reactions such as the skin and mucous membranes, nervous system, and cardiovascular system (Qing et al., 2023). Therefore, there is an urgent need to explore effective and safe therapeutic strategies against echinococcosis.
The application of drug combinations in treating various complex diseases, such as cancers and hypertension, is becoming increasingly widespread (Parati et al., 2021; Jaaks et al., 2022). Compared with single-drug treatment, combinations can synergize within biological pathways, enhancing efficacy and hastening recovery, while reducing doses of individual drugs, mitigating potential adverse effects and resistance (Csermely et al., 2013). These improvements enhance the quality of life for patients and reduce discomfort during treatment. Furthermore, combinations can lower the risk of disease resistance, highlighting the importance of identifying effective synergistic pairs in treatment strategies.
Recently, the main strategies to develop anti-echinococcosis drug combinations are based on in vitro and in vivo experiments.
Loos et al. investigate the in vitro anti-echinococcal activity of Octreotide combined with Metformin, demonstrating significant reduction in parasite viability through induced autophagy and upregulation of key autophagic genes, proposing a potential new therapeutic approach for treating cystic echinococcosis (Loos et al., 2020). Mohammadi et al. demonstrate that the combined treatment of Allium sativum methanolic extract with a reduced dose of Albendazole enhances anti-hydatidosis efficacy, achieving similar parasitological outcomes as a higher dose of Albendazole alone, but with reduced hepatotoxic effects (Haji Mohammadi et al., 2019). However, this process is time-consuming and costly. Besides, patients may also be subjected to unnecessary treatment risks (Rani et al., 2022). Moreover, it is difficult to explore all the possible drug combinations solely through biological experiments.
The inefficiency and limitations of this widespread trial-and-error method underline the critical need for more effective experimental evaluation techniques.
In the past decade, machine learning-based methods have played a significant role in the field of drug combination prediction, greatly expanding the capability to explore effective drug combinations (Wu et al., 2022; Liu et al., 2023). For instance, Janizek et al. introduced TreeCombo, which is based on the extreme gradient boosting trees (XGBoost) algorithm to predict the synergy scores of drug pairs (Janizek et al., 2018). Although traditional machine learning-based methods have made progress in drug combination prediction, they still have limitations such as the need for complex manual feature engineering and expertise, as well as insufficient computational power to support large-scale rapid predictions. With the rapid development of deep learning technology and the availability of extensive drug combination data, using deep learning for drug combination prediction has become a new trend. For example, the DeepSynergy model predicts the synergy between drugs by combining their chemical properties and gene expression data from cell lines (Preuer et al., 2018). GAECDS integrates graph autoencoders and convolutional neural networks to predict the synergistic effects of drug combinations (Li et al., 2023). These deep learning approaches not only overcome some limitations of traditional machine learning-based methods but also open new possibilities for exploring and predicting effective drug combinations in cancer (Güvenç Paltun et al., 2021). However, current deep learning approaches still have limitations in predicting synergistic drug combinations for echinococcosis. Firstly, deep learning algorithms for drug combination prediction have primarily focused on cancer due to the extensive genomic data (e.g. gene expression data) available from cancer cell lines, effectively capturing cancer features (Sarmah et al., 2023). Nevertheless, data similar to cancer genomics data is not yet available for parasitic diseases like echinococcosis, which limits the development of computational prediction methods. Secondly, the effective training of deep learning models is contingent upon extensive, high-quality datasets, which should include a vast number of evaluations of drug synergistic combinations (O’Neil et al., 2016). Yet, data on effective synergistic combinations for echinococcosis is exceedingly rare, considerably constraining the training and predictive precision of deep learning approaches. Furthermore, current techniques generally fall short in the integration of features. The current approach is usually to simply concatenate the features of individual drugs without fully considering the possible complex interactions between them. This simple method of combining features ignores the comprehensive effects of drug interactions. Therefore, it cannot reveal the potential interactions between different drug properties. Due to the aforementioned limitations, there are currently no effective algorithms for identifying potential echinococcosis drug combinations.
To overcome these limitations and improve the accuracy of identifying potential synergistic drug combinations for echinococcosis, this study has developed a transfer learning-based framework named the TransferBAN-Syn model. The model, integrating drug combination data from other parasitic diseases, effectively predicts treatment combinations for this disease despite limited existing information. This paper makes three significant contributions:
2 Materials and methods
2.1 Dataset description
2.1.1 Comprehensive parasitic disease drug combination dataset
The comprehensive parasitic disease drug combination dataset collects drug and drug combination information for treating echinococcosis and other parasitic diseases. The dataset is sourced from the China National Knowledge Infrastructure (CNKI) and PubMed databases. For the treatment of echinococcosis, the related drugs and drug combinations include 55 single drugs and 50 drug combinations that have been confirmed to have synergistic effects. Considering the limited information on anti-echinococcosis drug combinations, the dataset also includes 21 other parasitic diseases similar to echinococcosis (see Supplementary Table S1), including 263 single drugs and 283 drug combinations. These 21 parasitic diseases are selected based on their significant biological similarities to echinococcosis. Specifically, these diseases share critical descriptors with echinococcosis, which suggests potential genetic or pathway similarities that may influence their response to drug treatments. The selection process utilizes the Malacards database, where a GeneAnalytics tool analyzes gene-sharing characteristics between echinococcosis and other diseases. Parasitic diseases with a similarity score greater than eight are chosen, as they are likely to exhibit similar responses to drug combinations, making them valuable in supporting the prediction of effective drug combinations for echinococcosis. The efficacy of each drug and its combination is supported by literature. Additionally, drug combinations are identified and organized that have been clearly shown to have no synergistic effects and are unsuitable for use together as negative samples, to enhance the accuracy of the research. This dataset provides a comprehensive and detailed data foundation for the synergistic drug combinations for echinococcosis and related parasitic diseases as shown in Table 1, supporting subsequent research on the identification of potential drug combinations.
2.1.2 Multi-source information of parasitic disease
Information from the MalaCards database http://www.malacard.org is utilized to effectively characterize echinococcosis and other parasitic diseases (O’Neil et al., 2016). Malacards is a comprehensive database that provides detailed information on various diseases, integrating data on disease characteristics, associated pathways, clinical features, and related medical conditions. It offers a valuable resource for understanding disease phenotypes and their underlying genetic and clinical aspects. The MalaCards composite relevance score served as the basis for similarity scores between parasitic diseases, uncovering further relevant connections by identifying significant gene overlaps between two diseases, leading to the generation of a composite relevance score (Rappaport et al., 2013). For details on the specific calculation of the MalaCards composite relevance score, see “MalaCards - The Human Disease Database” (https://www.malacards.org/pages/info#disorders).
2.2 TransferBAN-Syn model
2.2.1 Overview of transfer learning strategy and TransferBAN-Syn model
The aim of transfer learning is to leverage the source domain and source task to learn the target domain and improve the performance of the target task. The training phase of the transfer learning-based framework usually includes two stages. Initially, a source model is obtained by training the network using a sufficient amount of source training data. This is also known as the pre-trained source model. Then, the pre-trained source model is used as the initial weights and retrained with a small amount of target training data to obtain the target model. The most common transfer learning technique is fine-tuning, which is essentially parameter-based transfer learning. Based on the assumption that the learned parameter values (i.e., weights) from the source domain contain useful knowledge, better performance is achieved by transferring these parameter values to the target model. The parameter values obtained from the source model become the initial values for the target model’s parameters. Thus, the weights of the target model start from the converged values of the pre-trained source model rather than random values. The target model is also retrained with a small amount of target training data and converges faster with fewer training epochs.
The transfer learning-based model, TransferBAN-Syn, employs the predictive knowledge of drug synergistic effects from
Figure 1. TransferBAN-Syn Transfer Learning strategy. TransferBAN-Syn consists of source domain and target domain models. The source domain model is pre-trained with data-rich parasitic diseases to comprehend the underlying mechanisms between drug combinations and diseases. The target domain model for echinococcosis shares parameters with the source domain model and fine-tunes the prediction module parameters to achieve optimal predictive performance.
The TransferBAN-Syn model consists of three key modules as shown in Figure 2: a drug interaction feature representation module, a disease feature representation module, and a prediction module. Specifically, TransferBAN-Syn is initially trained on the data from 21 parasitic diseases to predict their synergistic drug combinations, serving as a source model with its parameters preserved. Then, the target model is developed by retaining the parameters in the drug and disease feature representation modules from the pre-trained source model, and fine-tuning the prediction module to identify potential drug combinations for echinococcosis.
Figure 2. Three modules in TransferBAN-Syn. Three key modules in TransferBAN-Syn are the drug interaction feature representation module, the disease feature representation module, and the prediction module. The drug interaction feature representation module uses GCN to extract atomic-level features from drug molecular graphs and employs a bilinear attention network to capture interactions between drugs, thereby forming a characteristic representation of the drug combination. The disease feature representation module integrates disease pathway and disease similarity information to form a disease feature representation. The prediction module integrates the drug interaction feature representation and disease feature representation to predict the potential of drug combinations in synergistically treating specific diseases.
In particular, the drug interaction feature representation module uses Graph Convolutional Networks (GCN) to extract atomic-level features from individual drug molecules. It then employs a bilinear attention network to combine these single drug features, capturing the interactions among drugs and forming a representation of drug combination features. The disease feature representation module combines pathway information and disease similarity for parasitic diseases, encoding these features using a Multilayer Perceptron (MLP) to obtain the disease representation. Finally, the drug combination features and disease features are merged and propagated through a fully connected layer to predict drug synergy combinations. This module serves as the upper part of the TransferBAN-Syn model, predicting potential drug combinations. The specifics of the TransferBAN-Syn model will be detailed below.
2.2.2 Drug interaction feature representation module
For precise and comprehensive representation of information among drug combinations, TransferBAN-Syn employs molecular graphs to depict each drug within a combination. It utilizes GCN to extract atomic-level features and a bilinear attention network to combine single-drug features, capturing interactions between drugs of the drug combination.
This study uses the open-source cheminformatics software RDKit to convert SMILES to molecular graph
Following the construction of molecular graph
TransferBAN-Syn employs a
where
TransferBAN-Syn utilizes Bilinear Attention Networks (BAN) to capture the feature representations of pairwise local interactions between drug pairs (as shown in Figure 3). Simultaneously achieving exceptional performance, the time complexity in low-rank bilinear pooling is optimized through matrix chain multiplication and leveraging the attributes of low-rank factorization. Therefore, BAN can capture complex interactions between drug molecules and extract their comprehensive features, which is crucial for understanding and predicting the synergistic effects of drug combinations.
Figure 3. Drug molecular bilinear attention network for extracting drug combination features. The bilinear attention network consists of a bilinear attention step and a bilinear pooling step to generate a joint representation.
In this research, we apply the BAN module to capture pairwise local interactions between drug combination. The BAN module primarily comprises two components: the construction of the drug bilinear interaction map and the bilinear pooling layer of the drug interaction map, with the former aimed at capturing pairwise attention weights and the latter at extracting the holistic feature representation of the drug combination. If the feature representations
where
After obtaining the drug bilinear interaction map
where the subscript
where the function
Additionally, by computing multiple drug bilinear interaction map, we can extend the single pairwise interactions into a multi-head format. The final joint representation vector is the sum of all heads. Since the weight matrices
2.2.3 Disease feature representation module
In this study, disease feature representation is achieved by integrating disease pathway and inter-disease similarity information, aiming to improve the accuracy of drug combination predictions for parasitic diseases. The progression of parasitic diseases is often influenced by multiple biological pathways. This pathway information not only reveals the underlying mechanisms of the disease but also facilitates research implementation and result reproducibility due to its relative ease of access and low-dimensionality. Additionally, considering the similarity between diseases helps identify biological markers and pathways shared by different diseases, thereby enhancing the model’s generalization ability. Thus, selecting parasitic disease pathway information and disease similarity as disease features holds significant theoretical and practical value in enhancing the predictive performance and robustness of drug combinations against echinococcosis.
For this study, pathway information related to parasitic diseases and their similarity scores are collected from the Malacards human disease database (http://www.malacards.org/) (Rappaport et al., 2013). For the construction of the disease pathway feature matrix
For in-depth extraction of disease features, TransferBAN-Syn utilizes a Multi-Layer Perceptron (MLP) model comprising two hidden layers, with neurons in these layers transforming and extracting features via nonlinear activation functions. The final feature embedding representation
where
2.2.4 Prediction module
The prediction module concatenates drug interaction feature and disease feature to ascertain the potential of a drug combination to synergistically treat a specific disease.
The interaction features
where
Finally, all learnable parameters are jointly optimized via backpropagation. To boost the model’s generalization capability, a minimized cross-entropy loss function with
where
3 Results
3.1 Experimental parameter settings
The TransferBAN-Syn algorithm is implemented in a Python 3.8 and PyTorch 1.7.1 environment. In the algorithm configuration, the batch size is set to 64. The maximum number of atoms in drug molecules is set to 150. The embedding dimension
The architecture of TransferBAN-Syn is essentially determined by a set of hyperparameters, including the GCN layers, learning rate, activation function, the hierarchical structure of training rounds, and so on. Considering the computational cost of exhaustively enumerating hyperparameters, we adopted a grid search strategy to adjust these parameters. Details of the parameter adjustments are provided in Supplementary Table S2. The selection of hyperparameters is refined through five-fold cross-validation on a benchmark dataset. The experimental results, detailed in the supplementary materials, demonstrate that the optimal configuration for the GCN involves three layers with dimensions [128, 256, 128], which effectively extract drug features as shown in Supplementary Figure S1. In the MLP used for disease feature extraction, the best performance is achieved with two hidden layers sized [128, 256]. The multi-head bilinear attention mechanism performs optimally with two heads as shown in Supplementary Figure S2. Additionally, the fully connected prediction layer includes 512 hidden units. The ReLU activation function is selected to enhance model performance, and the learning rate for the optimizer is set at 5e-5 as shown in Supplementary Figure S3. Subsequent experiments are conducted using these optimized model parameters.
3.2 Baseline methods
To evaluate the predictive performance of the TransferBAN-Syn model, we compared it with five state-of-the-art predicting drug synergy combinations, including classic machine learning and deep learning-based methods such as TreeCombo (Janizek et al., 2018), DeepSynergy (Preuer et al., 2018), TranSynergy (Liu and Xie, 2021), GAECDS (Li et al., 2023), and Attensyn (Wang et al., 2023).
3.3 Performance evaluation
This study assesses the performance of our TransferBAN-Syn model by comparing it with the aforementioned five state-of-the-art drug synergy combination prediction methods. To fairly compare the algorithms, we apply the same transfer learning strategy to all five algorithms and ours. Specifically, the drug combination prediction tasks for 21 other parasitic diseases are selected as the source tasks, with the drug combination prediction task for echinococcosis serving as the target task. The predictive capacities of the models are assessed using five iterations of five-fold cross-validation, wherein the training samples are randomly divided into five roughly equal subsets, with one subset reserved as the test set for each iteration and the remainder serving as the training set. During our experiments, we encountered an imbalance between the positive and negative samples in the target domain, where the ratio of positive to negative samples was approximately 1:2. This imbalance posed a challenge to the model’s predictive accuracy and generalization ability. To address this issue, we implemented an under-sampling technique in the target domain. Specifically, for each iteration, we randomly selected a subset of negative samples that matched the number of positive samples, effectively balancing the dataset. The average predictive accuracy from the five iterations of five-fold cross-validation serves as the ultimate metric for performance evaluation. The performance evaluation metrics include Area Under the Curve (AUC), the Area Under the Precision-Recall curve (AUPR), Recall (Rec), Precision (Prec), F1 score (F1), and accuracy (ACC) to comprehensively reflect the model’s performance in various aspects. The experimental results are shown in Table 2.
Table 2. Results (Mean ± STD) of TransferBAN-Syn and other five state-of-the-art drug synergy combination prediction methods in terms of six classification metrics.
The results indicate that TransferBAN-Syn performs outstandingly across all metrics, demonstrating excellent stability and robustness under different data splits. Specifically, TransferBAN-Syn has a precision of 0.9115, the highest among all models, indicating the lowest false positive rate in predicting positive samples. Its accuracy is 0.9220, also the highest, showing the best overall classification accuracy across all samples. Although TransferBAN-Syn’s recall is 0.9142, second only to GAECDS’s 0.9284, it still performs excellently, indicating the model’s effectiveness in identifying most positive samples. GAECDS combines graph autoencoder (GAE) and CNN, capturing complex relationships between drugs through the GAE module and performing collaborative score prediction through the CNN module. This combined structure allows for a more comprehensive handling and integration of various features, thereby enhancing the model’s ability to predict positive samples. However, the GAECDS model may overfit to positive samples during training, resulting in an inability to effectively distinguish negative samples, thereby reducing prediction accuracy and the performance of AUC and AUPR values. The results indicate that TransferBAN-Syn’s F1 score is 0.9304, the best among all algorithms, showing that the model excels in balancing precision and recall, making it the model with the best overall performance.
Overall, the TransferBAN-Syn model not only exhibits high prediction accuracy but also demonstrates excellent stability and robustness.
3.4 Ablation study
To investigate the impact of the transfer learning strategy, bilinear attention module, and disease feature representation on model performance, we conduct a series of ablation experiments testing various variants of the TransferBAN-Syn algorithm. (1) The original TransferBAN-Syn model; (2) A model that removes the bilinear attention module in favor of a self-attention module (w.o attention1); (3) A model that removes the attention module and directly predicts (w.o attention2); (4) A model without disease pathway information (w.o pathway); (5) A model excluding disease similarity information (w.o similarity); and (6) A model that does not utilize transfer learning from other parasitic disease information (w.o Transfer). The experimental results are shown in Figure 4.
Figure 4. Ablation study results for TransferBAN-Syn. The lines represent the positive error bars of the standard deviation.
The experimental results show that the original TransferBAN-Syn model performs the best. As shown in Figure 4, feature fusion with attention mechanisms, as opposed to direct concatenation of drug features, better captures the higher-order features of drug combinations. Compared to traditional self-attention networks, the bilinear attention module can more effectively capture the interactions between drug combinations, particularly the pairwise interaction information between their substructures, which may be the main reason for the superior performance of the original model. Moreover, the pathway information of diseases plays a significant role in providing accurate disease feature representations.
It is noteworthy that without using information from other parasitic diseases for transfer learning, the model struggles to be effectively trained with only echinococcosis data, indicating that leveraging information from other parasitic diseases for auxiliary training is effective and necessary in data-scarce scenarios of predicting drug combinations against echinococcosis. These findings emphasize the importance of each module in the model for enhancing predictive performance and confirm the effectiveness of integrating various sources of information and model structures in predicting drug combinations against echinococcosis. More importantly, through Table 3, we can see that the introduction of the transfer learning strategy in the results has the most significant improvement in experimental outcomes. In the TransferBAN-syn model, the introduction of transfer learning techniques significantly enhances the model’s predictive capability and stability. Specifically, we implemented a variant—referred to as w. o transfer1—that trains the model simultaneously on both the Echinococcosis data and the 21 parasitic diseases data, with the fine-tuning step removed. We also evaluated a model trained solely on Echinococcosis data without employing the transfer learning framework, referred to as w. o transfer2. From the experimental results presented in Figure 5, we observe that our proposed TransferBAN-Syn model outperforms both w. o transfer1 and w. o transfer2 in terms of predictive accuracy and stability. Specifically, while the w. o transfer1 shows some improvement over training solely on Echinococcosis data (w.o transfer2), it does not achieve the same level of performance as our transfer learning model. This indicates that simply combining data from Echinococcosis and other parasitic diseases without proper knowledge transfer is insufficient. This ablation study reveals the key finding that integrating information related to 21 types of parasitic diseases can significantly improve the performance of the echinococcosis prediction model, while the transfer learning strategy effectively utilizes data from 21 parasitic diseases similar to echinococcosis, further enhancing the precision of echinococcosis predictions. Therefore, the application of the transfer learning strategy can not only effectively mine deep information related to echinococcosis but also play an important role in improving the accuracy of drug combination predictions.
Figure 5. Efficacy analysis of Transfer Learning in TransferBAN-Syn The box plots show the median as the center lines, and the mean as the triangles.
3.5 Case study
Based on the performance evaluation experimental results of this study, the proposed the TransferBAN-syn model demonstrated significant superior performance in predicting echinococcosis drug combinations. To verify the model’s generalization ability, the work employs an independent validation set to assess the model’s predictive accuracy. Specifically, from the collected echinococcosis drug combination dataset, we select five drug combinations known to have synergistic effects and five drug combinations without synergistic effects, and set them as an independent test set to test the model’s discrimination ability. The remaining drug combination data are used to fine-tune the pre-trained general model, we retain the best-performing model parameters and tested them on the independent validation set. The test results, shown in Table 3, reveal that the trained model can not only accurately identify drug combinations with synergistic effects (positive samples) but also effectively distinguish drug combinations without synergistic effects (negative samples). These results strongly suggest that our model possesses high accuracy and discriminative ability in predicting the synergistic effects of drug combinations, providing powerful tools and methodological guidance for future research in this field.
In order to thoroughly investigate the application of the TransferBAN-syn model in predicting the efficacy of new drug combinations, this study devise an independent validation set approach to precisely evaluate the model’s capability to predict unknown drug combinations. The study reveal through external validation the effectiveness of the model in identifying drug combinations that potentially have therapeutic potential for alveolar echinococcosis (AE) and cystic echinococcosis (CE). All potential drug combinations are systematically arranged and screened from the database, excluding those known to have synergistic effects on AE or CE, thereby yielding a set of potential therapeutic combinations. The synergy probabilities of these combinations are calculated and ranked through the model, identifying ten most promising synergistic drug combinations for AE and CE, respectively, with the detailed lists shown in Tables 4, 5. We have conducted a literature-based validation of the predicted drug combinations. For example, the combination of Primaquine and Pyronaridine Tetraphosphate is supported by existing research, which shows that these drugs exhibit synergistic effects in the treatment of malaria. Primaquine generates reactive oxygen species that disrupt the parasite’s mitochondrial function, while Pyronaridine Tetraphosphate inhibits heme detoxification in the parasite (Stone et al., 2022). The complementary mechanisms of these drugs suggest that they could also have potential as a synergistic combination for treating echinococcosis. These findings not only provide important guidance for the design of future drug combination treatment schemes but also open new possibilities for the development and validation of new drugs, as well as offer new research directions and theoretical bases for subsequent biological experimental designs and echinococcosis treatment studies.
Table 4. Potential synergistic drug combinations for alveolar echinococcosis evaluated by TransferBAN-Syn.
Table 5. Potential synergistic drug combinations for cystic echinococcosis evaluated by TransferBAN-syn.
4 Discussion and conclusion
Echinococcosis, as a chronic and complex parasitic disease, poses a serious threat to human health. Pharmacotherapy is indispensable in the treatment of echinococcosis, with combination drug therapy demonstrating higher treatment efficacy and lower risk of drug resistance. However, given the unique complexity of echinococcosis and the scarcity of treatment drug combination data, discovering effective drug combinations becomes particularly challenging. In response to this challenge, the TransferBAN-Syn model propose in this paper adopts a transfer learning strategy, supplementing echinococcosis data with drug combination data from other parasitic diseases. This strategy not only enhances the model’s accuracy in predicting the synergistic effects of echinococcosis drug combinations but also improves the model’s generalizability. Furthermore, by constructing a drug combination dataset for 21 parasitic diseases, this paper further enriches the research foundation, providing valuable resources for subsequent drug discovery and evaluation. Additionally, the TransferBAN-Syn model effectively captures the complex interactions between drug molecules through deep graph neural networks and attention-based aggregation modules, thereby achieving accurate prediction of drug combination synergistic effects. Compared to five state-of-the-art traditional machine learning methods and deep learning models, the TransferBAN-Syn model has shown significant performance advantages, proving its potential application in the study of drug combinations against echinococcosis.
Future studies will aim to further refine the algorithm’s framework, reduce computational complexity, and explore more effective transfer learning strategies to better address differences among various parasitic diseases. Moreover, expanding the drug combination dataset to include a broader range of parasitic diseases and drug combinations will enhance the model’s generalizability and applicability. With these improvements, we hope to more accurately and efficiently predict and evaluate the synergistic effects of drugs against echinococcosis in the future, providing a more reliable scientific basis for the treatment of echinococcosis.
Data availability statement
The original contributions presented in the study are publicly available. This data can be found here: https://github.com/ahu-bioinf-lab/TransferBAN-Syn.
Author contributions
HL: Methodology, Writing–original draft, Writing–review and editing. YC: Investigation, Software, Visualization, Writing–original draft. LJ: Data curation, Writing–review and editing. LL: Conceptualization, Writing–review and editing. GL: Supervision, Validation, Writing–review and editing. YL: Supervision, Writing–review and editing. CZ: Supervision, Writing–review and editing. YS: Conceptualization, Funding acquisition, Supervision, Writing–original draft, Writing–review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was funded by the National Key Research and Development Program of China (2021YFE0102100); The University Synergy Innovation Program of Anhui Province (GXXT-2022-035); National Natural Science Foundation of China (62172002, 62202004, 62322301); Anhui Provincial Natural Science Foundation (2108085QF267, 2008085QF294); The University Outstanding Youth Research Project of Anhui Province (2022AH020010); The University Synergy Innovation Program of Anhui Province (No. GXXT-2021-039); The Project of Key Laboratory of Intelligent Computing and Signal Processing (Anhui University), Ministry of Education (2020A005). Tianshan Young Talent Scientific and Technological Innovation Team: Innovative Team for Research on Prevention and Treatment of High-incidence Diseases in Central Asia, Grant No. 2023TSYCTD0020 State Key Laboratory of Pathogenesis, Prevention and Treatment of High Incidence Diseases in Central Asia – Anhui University Work station Joint Fund Project, Grant No. SKL-HIDCA-2024-AH3.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1465368/full#supplementary-material
References
Alvi, M. A., Ali, R. M. A., Khan, S., Saqib, M., Qamar, W., Li, L., et al. (2023). Past and present of diagnosis of echinococcosis: a review (1999-2021). Acta Trop. 243, 106925. doi:10.1016/j.actatropica.2023.106925
Autier, B., Gottstein, B., Millon, L., Ramharter, M., Gruener, B., Bresson-Hadni, S., et al. (2023). Alveolar echinococcosis in immunocompromised hosts. Clin. Microbiol. Infect. 29, 593–599. doi:10.1016/j.cmi.2022.12.010
Casulli, A., Abela-Ridder, B., Petrone, D., Fabiani, M., Bobić, B., Carmena, D., et al. (2023). Unveiling the incidences and trends of the neglected zoonosis cystic echinococcosis in europe: a systematic review from the meme project. Lancet Infect. Dis. 23, e95–e107. doi:10.1016/S1473-3099(22)00638-7
Csermely, P., Korcsmáros, T., Kiss, H. J., London, G., and Nussinov, R. (2013). Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol. and Ther. 138, 333–408. doi:10.1016/j.pharmthera.2013.01.016
Güvenç Paltun, B., Kaski, S., and Mamitsuka, H. (2021). Machine learning approaches for drug combination therapies. Briefings Bioinforma. 22, bbab293. doi:10.1093/bib/bbab293
Haji Mohammadi, K., Heidarpour, M., and Borji, H. (2019). Allium sativum methanolic extract (garlic) improves therapeutic efficacy of albendazole against hydatid cyst: in vivo study. J. Investigative Surg. 32, 723–730. doi:10.1080/08941939.2018.1459967
Hogea, M. O., Ciomaga, B. F., Muntean, M. M., Muntean, A. A., Popa, M. I., and Popa, G. L. (2024). Cystic echinococcosis in the early 2020s: a review. Trop. Med. Infect. Dis. 9, 36. doi:10.3390/tropicalmed9020036
Jaaks, P., Coker, E. A., Vis, D. J., Edwards, O., Carpenter, E. F., Leto, S. M., et al. (2022). Effective drug combinations in breast, colon and pancreatic cancer cells. Nature 603, 166–173. doi:10.1038/s41586-022-04437-2
Janizek, J. D., Celik, S., and Lee, S.-I. (2018). Explainable machine learning prediction of synergistic drug combinations for precision cancer medicine. BioRxiv, 331769.
Li, H., Zou, L., Kowah, J. A., He, D., Wang, L., Yuan, M., et al. (2023). Predicting drug synergy and discovering new drug combinations based on a graph autoencoder and convolutional neural network. Interdiscip. Sci. Comput. Life Sci. 15, 316–330. doi:10.1007/s12539-023-00558-y
Li, M., Zhou, J., Hu, J., Fan, W., Zhang, Y., Gu, Y., et al. (2021). Dgl-lifesci: an open-source toolkit for deep learning on graphs in life science. ACS omega 6, 27233–27238. doi:10.1021/acsomega.1c04017
Liu, H., Fan, Z., Lin, J., Yang, Y., Ran, T., and Chen, H. (2023). The recent progress of deep-learning-based in silico prediction of drug combination. Drug Discov. Today 28, 103625. doi:10.1016/j.drudis.2023.103625
Liu, Q., and Xie, L. (2021). Transynergy: mechanism-driven interpretable deep neural network for the synergistic prediction and pathway deconvolution of drug combinations. PLoS Comput. Biol. 17, e1008653. doi:10.1371/journal.pcbi.1008653
Loos, J. A., Negro, P., and Cumino, A. C. (2020). In vitro anti-echinococcal activity of octreotide: additive effect of metformin linked to autophagy. Acta trop. 203, 105312. doi:10.1016/j.actatropica.2019.105312
Meng, Y., Ren, Q., Xiao, J., Sun, H., Huang, Y., Liu, Y., et al. (2023). Progress of research on the diagnosis and treatment of bone cystic echinococcosis. Front. Microbiol. 14, 1273870. doi:10.3389/fmicb.2023.1273870
O’Neil, J., Benita, Y., Feldman, I., Chenard, M., Roberts, B., Liu, Y., et al. (2016). An unbiased oncology compound screen to identify novel combination strategies. Mol. cancer Ther. 15, 1155–1162. doi:10.1158/1535-7163.MCT-15-0843
Parati, G., Kjeldsen, S., Coca, A., Cushman, W. C., and Wang, J. (2021). Adherence to single-pill versus free-equivalent combination therapy in hypertension: a systematic review and meta-analysis. Hypertension 77, 692–705. doi:10.1161/HYPERTENSIONAHA.120.15781
Preuer, K., Lewis, R. P., Hochreiter, S., Bender, A., Bulusu, K. C., and Klambauer, G. (2018). Deepsynergy: predicting anti-cancer drug synergy with deep learning. Bioinformatics 34, 1538–1546. doi:10.1093/bioinformatics/btx806
Qing, C., Chuanchuan, L., Caixia, L., Boen, Z., and Haining, F. (2023). Traditional Chinese medicine for treatment of echinococcosis: a review. Chin. J. Schistosomiasis Control 35, 398. doi:10.16250/j.32.1374.202266
Rani, P., Dutta, K., and Kumar, V. (2022). Artificial intelligence techniques for prediction of drug synergy in malignant diseases: past, present, and future. Comput. Biol. Med. 144, 105334. doi:10.1016/j.compbiomed.2022.105334
Rappaport, N., Nativ, N., Stelzer, G., Twik, M., Guan-Golan, Y., Iny Stein, T., et al. (2013). Malacards: an integrated compendium for diseases and their annotation. Database 2013, 2013. doi:10.1093/database/bat018
Sarmah, D., Meredith, W. O., Weber, I. K., Price, M. R., and Birtwistle, M. R. (2023). Predicting anti-cancer drug combination responses with a temporal cell state network model. PLoS Comput. Biol. 19, e1011082. doi:10.1371/journal.pcbi.1011082
Stone, W., Mahamar, A., Sanogo, K., Sinaba, Y., Niambele, S. M., Sacko, A., et al. (2022). Pyronaridine–artesunate or dihydroartemisinin–piperaquine combined with single low-dose primaquine to prevent plasmodium falciparum malaria transmission in ouélessébougou, Mali: a four-arm, single-blind, phase 2/3, randomised trial. Lancet Microbe 3, e41–e51. doi:10.1016/S2666-5247(21)00192-0
Wang, S., Ma, Y., Wang, W., Dai, Y., Sun, H., Li, J., et al. (2022). Status and prospect of novel treatment options toward alveolar and cystic echinococcosis. Acta Trop. 226, 106252. doi:10.1016/j.actatropica.2021.106252
Wang, T., Wang, R., and Wei, L. (2023). Attensyn: an attention-based deep graph neural network for anticancer synergistic drug combination prediction. J. Chem. Inf. Model. 64, 2854–2862. doi:10.1021/acs.jcim.3c00709
Wen, H., Vuitton, L., Tuxun, T., Li, J., Vuitton, D. A., Zhang, W., et al. (2019). Echinococcosis: advances in the 21st century. Clin. Microbiol. Rev. 32, e00075. doi:10.1128/CMR.00075-18
Keywords: echinococcosis, drug combination, transfer learning, synergistic drug combinations, parasitic diseases
Citation: Li H, Chu Y, Jiang L, Li L, Lv G, Liu Y, Zheng C and Su Y (2025) TransferBAN-Syn: a transfer learning-based algorithm for predicting synergistic drug combinations against echinococcosis. Front. Genet. 15:1465368. doi: 10.3389/fgene.2024.1465368
Received: 16 July 2024; Accepted: 06 December 2024;
Published: 06 January 2025.
Edited by:
Quan Zou, University of Electronic Science and Technology of China, ChinaReviewed by:
Xiaoqiang Sun, Sun Yat-sen University, ChinaQiu Xiao, Hunan Normal University, China
Fangping Wan, University of Pennsylvania, United States
Zhenyu Yue, Anhui Agricultural University, China
Copyright © 2025 Li, Chu, Jiang, Li, Lv, Liu, Zheng and Su. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yansen Su, c3V5YW5zZW5AYWh1LmVkdS5jbg==