Skip to main content

ORIGINAL RESEARCH article

Front. Microbiol., 22 January 2025
Sec. Systems Microbiology

BANNMDA: a computational model for predicting potential microbe–drug associations based on bilinear attention networks and nuclear norm minimization

Mingmin LiangMingmin Liang1Xianzhi LiuXianzhi Liu2Juncai Li
Juncai Li2*Qijia Chen
Qijia Chen2*Bin ZengBin Zeng2Zhong WangZhong Wang3Jing LiJing Li2Lei WangLei Wang4
  • 1School of Intelligent Equipment, Hunan Vocational College of Electronic and Technology, Changsha, China
  • 2School of Information Engineering, Hunan Vocational College of Electronic and Technology, Changsha, China
  • 3School of Humanities and Education, Hunan Vocational College of Electronic and Technology, Changsha, China
  • 4Big Data Innovation and Entrepreneurship Education Center of Hunan Province, Changsha University, Changsha, China

Introduction: Predicting potential associations between microbes and drugs is crucial for advancing pharmaceutical research and development. In this manuscript, we introduced an innovative computational model named BANNMDA by integrating Bilinear Attention Networks(BAN) with the Nuclear Norm Minimization (NNM) to uncover hidden connections between microbes and drugs.

Methods: In BANNMDA, we initially constructed a heterogeneous microbe-drug network by combining multiple drug and microbe similarity metrics with known microbe-drug relationships. Subsequently, we applied both BAN and NNM to compute predicted scores of potential microbe-drug associations. Finally, we implemented 5-fold cross-validation frameworks to evaluate the prediction performance of BANNMDA.

Results and discussion: The experimental results indicated that BANNMDA outperformed state-of-the-art competitive methods. We conducted case studies on well-known drugs such as the Amoxicillin and Ceftazidime, as well as on pathogens such as Bacillus cereus and Influenza A virus, to further evaluate the efficacy of BANNMDA, and experimental outcomes showed that there were 9 out of the top 10 predicted drugs, along with 8 and 9 out of the top 10 predicted microbes having been corroborated by relevant literatures. These findings underscored the capability of BANNMDA to achieve commendable predictive accuracy.

Introduction

Microorganisms are tiny, structurally simple, and widely distributed organisms, including bacteria, viruses, and fungi. They are closely related to human health, offering both benefits and potential risks (Human Microbiome Project Consortium, 2012; Cheng et al., 2020). Various organs of the human body are inhabited by them and are even covered by them (Gill et al., 2006). These microorganisms play a role not only in promoting the absorption of food and maintaining intestinal health but also in effectively regulating the host’s mucosal and systemic immune systems by adjusting the balance of the gut microbiota (Ventura et al., 2009; Sommer and Bäckhed, 2013). In the intestinal environment, these microorganisms are interdependent and mutually beneficial. When the balance of the gut microbiota is disrupted, it can lead to a variety of diseases, including obesity (Ley et al., 2006), inflammatory bowel disease (Durack and Lynch, 2019), and cancer (Schwabe and Jobin, 2013). In addition, a multitude of studies have confirmed that there is a significant interaction between microorganisms and drugs during the drug treatment process (Human Microbiome Project Consortium, 2012; McCoubrey et al., 2022; Zhang et al., 2023). Therefore, a deep understanding of the relationship between microorganisms and drugs is crucial for the effective treatment of diseases.

Through in-depth biological research, humanity has uncovered key connections between drugs and microbes. However, biological experiments often require a significant investment of human resources, materials, and time, which may limit further in-depth research. To overcome the limitations of biological studies, the application of computational methods has been increasing in recent years, driven by the rapid development of related research tools. These computational methods are dedicated to predicting the interactions between drugs and microbes (Wang et al., 2022). Concurrently, databases of microbe–drug associations that have been experimentally validated, such as MDAD (Sun et al., 2018) (Doi: figshare.com/articles/dataset/MDAD__/24798456) and aBiofilm (Rajput et al., 2018) (Doi: figshare.com/articles/dataset/aBiofilm_dataset/28045016), have also been established, providing valuable data resources for research. For instance, Zhu et al. (2022) have introduced NNAN, a method that utilizes a nearest-neighbor information aggregator and a feature attention module to identify correlations between microbes and drugs. Deng et al. (2022) have proposed a new method, Graph2MDA, which utilizes a Variational Graph Auto-Encoder (VGAE) to predict associations between microbes and drugs. In an effort to infer novel relationships between microbes and drugs, Yang et al. (2022) have proposed a multi-kernel fusion model based on Graph Convolutional Networks (GCN), known as MKGNN. Tian et al. (2023) have crafted a contrastive learning model for predicting connections between microbes and drugs, called SCSMDA. Tan et al. (2022) have developed a computational technique based on graph attention networks and sparse autoencoders for predicting potential microbe–drug correlations, named GSAMDA. Ma et al. (2023) have developed a predictive model for microbe–drug interactions that integrate the capabilities of Graph Attention Networks (GAT) with the image-processing prowess of Convolutional Neural Networks (CNN).

Inspired by Liu et al. (2023) and Bai et al. (2023), we designed a novel prediction model called BANNMDA based on the bilinear attention network and kernel norm minimization to accurately infer potential associations between microorganisms and drugs. As illustrated in Figure 1, the principal contributions of BANNMDA include:

• A novel heterogeneous microbe–drug network H was established by amalgamating the microbe similarity network, the drug similarity network, and known associations between microbes and drugs.

• To forecast potential microbe–drug association scores more accurately, we would first use a BAN-based autoencoder alongside the nuclear norm minimization technique on N to calculate two predicted scores for potential microbe–drug associations, respectively. Then, we would further combine these two predicted scores through a weighted average to derive the conclusive outcomes.

Figure 1
www.frontiersin.org

Figure 1. Overall structure diagram of BANNMDA. (A) The heterogeneous microbe–drug network was established by amalgamating the microbe similarity network, the drug similarity network, and known associations between microbes and drugs. (B) Predicting potential microbe–drug associations by BAN. (C) Predicting potential microbe–drug associations by NNM. (D) Predicting the final scores of potential microbe–drug associations.

Materials and methods

Data sources

To assess the predictive performance of the BANNMDA model, we selected the MDAD dataset. The MDAD dataset, compiled by Sun et al. (2018), is an extensive compilation of microbe–drug associations. It was sourced from various drug databases such as TTD and DrugBank, along with extensive literature, resulting in a database of 1,373 drugs and 173 microbes connected by 2,470 associations after removing redundant entries. Table 1 provides specific statistical data for the MDAD dataset.

Table 1
www.frontiersin.org

Table 1. Specific statistical data for MDAD datasets.

Methods

Microbe–drug adjacency matrix

An adjacency matrix, designated as A R n y × n w , is initially constructed. This matrix captures the interactions between a set of drugs (denoted by n y ) and microbes (denoted by n w ). The matrix is populated such that each entry is marked 1 if a relationship is established between a specific drug y i and a particular microbe w j , and 0 otherwise. As the Equation 1 shown.

A i j     (1) = { 1 , if y i associats with w j 0 , otherwise

Microbe/drug Gaussian kernel similarity

The Gaussian kernel similarity is calculated by using the Gaussian kernel function, which is a widely used kernel function for measuring the similarity between elements. In the field of microbe–drug association prediction, the Gaussian kernel similarity is one of the most popular methods for measuring similarity between microbes and drugs, which is based on the assumption that two similar microbes will exhibit similar interactive and non-interactive relationships with the same drug.

The Gaussian kernel similarity Y G I P y i y j R n y × n y between drugs y i and y j , can be calculated by using the Equation 2:

Y G I P = exp γ y A y i A y j 2     (2)

Certainly, let us clarify the role of A y i A y j in the context of Gaussian kernel similarity, particularly as it pertains to the Euclidean distance between two drugs. The parameter γ y plays a crucial role in determining the influence of the distance between feature points. Equation 3 shows how it works:

γ y = 1 / 1 n y i = 1 n y A y i 2     (3)

The Gaussian kernel similarity W G I P w i w j R n w × n w can be similarly applied to measure the similarity between microbes. Equations 4, Equation 5 show how to get the Gaussian kernel similarity:

W G I P = exp γ w A w i A w j 2     (4)
γ w = 1 / 1 n w i = 1 n w A w i 2     (5)

Microbe/drug functional similarity

The microbe functional similarity is determined by leveraging the Kamneva tool (Kamneva, 2017), which is grounded in the analysis of microbial gene families. The process begins with the construction of a microbial protein–protein functional association network using the comprehensive STRING (Szklarczyk et al., 2019) dataset, which provides a rich collection of gene functional networks related to microbes. In this network, nodes represent gene families encoded by the genome, and edges signify genetic neighborhood scores. To evaluate the functional similarities between microbes, a matrix W F R n w × n w is crafted using the Kamneva tool, which calculates the similarity by comparing the score of the edges between two microbes to the sum of all link scores corresponding to their microbial gene families.

Furthermore, the SIMCOMP (Hattori et al., 2010) tool harnesses the chemical structures and molecular formulas of drugs to quantify their structural similarity. The core of this method is to realize the automated matching of nodes and edges across two chemical structure diagrams by software algorithms. By identifying the most extensive common substructure, this method can assess and calculate the similarities between different drug frameworks. Based on this method, a drug functional similarity matrix Y F R n y × n y can be constructed.

Microbe/drug integrated similarities

It is essential to acknowledge that not all microbes can be effectively compared in terms of functional similarity. To address this, we have utilized both the structural similarity and the Gaussian kernel similarity of microbes. By combining these metrics, we have successfully created a novel matrix W R n w × n w by using Equation 6. This integrated matrix provides a more comprehensive and nuanced assessment of microbe similarities, offering valuable insights into their complex relationships.

W = { W G I P + W F / 2 , if W F 0 W G I P , if W F = 0     (6)

Similarly, the drug matrix can be obtained as Equation 7:

Y = { Y G I P + Y F / 2 , if Y F 0 Y G I P , if Y F = 0     (7)

Constructing the heterogeneous network H

By integrating the microbe–drug adjacency matrix with the drug functional similarity matrix and the microbe functional similarity matrix, we have constructed a unified matrix H R n y + n w × n y + n w .

H = Y A A T W     (8)

where A T represents A s transposition. As Equation 8 shows.

Predicting potential microbe–drug associations by BANs

Bilinear attention networks (BANs) are composed of a model proposed by Kim et al. (2018). The central component of BANs is the bilinear attention mechanism, which was initially designed to learn the distribution of attention by taking into account the bilinear interactions between the input channels.

In BANs, two pivotal technologies are used to enhance the interaction of features and manage intricate data relationships: bilinear transformation and attention mechanism. The bilinear transformation uses a weight matrix and an additive bias to process input features. It excels at revealing the nuanced relationships within complex datasets, providing a robust framework for analyzing interactions. The attention mechanism is a fundamental technique in neural networks, designed to improve the model’s focus on specific aspects of the input data. In the context of BANs, this focus is achieved through the application of bilinear transformations. These transformations provide a more adaptable way to adjust the weights associated with different features, thereby enhancing the model’s ability to prioritize relevant information within the data. Its formula can be expressed as:

z = h T Wh + b     (9)

In the above Equation 9, h is the input vector of BANs, W is a trainable weight matrix, b is the bias term, and z is the output vector of BANs.

The forward propagation process of BANs is as Equation 10:

x = W 1 h + b 1 x Relu = Relu z = x Relu T W x Relu + b q = W 2 z + b 2     (10)

where W 1 is the weight matrix of the first fully connected layer, W 2 is the weight matrix of the classification layer, b 2 is the weight matrix of the classification layer, q is the final output of the model, Relu is the activation function, defined as shown in Equation 11 and x Relu is the feature vector processed by the Relu activation function.

Relu h = { h , h > 0 0 , otherwise     (11)

Incorporating the BANs into predictive models enables a more nuanced capture of both the local features and the overarching structure of the data. This enhanced understanding, in turn, bolsters the model’s capacity for representation and elevates its predictive accuracy.

Obviously, after inputting H into the BANs, a low-dimensional matrix Q = Q y Q w R n y + n w × l can be derived, in which, the indices Q y and Q w represent the drug nodes and microbial nodes, respectively.

Thereafter, by integrating the drug matrix Q y , Q w with W F , and A separately inspired by Xuan et al., 2020, it is easy to see that we can construct a new drug feature matrix R y and a new microbe feature matrix R w as Equations 12, 13:

R y = Q y Y F A     (12)
R w = Q w W F A T     (13)

Finally, based on R y and R y , we can obtain predicted scores for any given microbe w j and drug y i as follows:

M i j = Relu R y y i . R w w j T     (14)

Hence, based on the above Equation 14, we can obtain a novel matrix M1 = [ M i j ].

Predicting potential microbe–drug associations by NNM

The kernel norm, alternatively referred to as the Schatten p-norm, is a matrix norm characterized by its reliance on the singular values of the matrix in question (Recht et al., 2010). This concept is pivotal in the field of optimization, particularly in the context of kernel norm minimization (Candès and Recht, 2012). The essence of this technique lies in reducing the kernel norm of a matrix, which is essentially the aggregate of its singular values. By doing so, it becomes feasible to approximate solutions for matrices that exhibit low-rank properties.

In BANNMDA, we define the kernel norm of the prediction matrix E as Equation 15:

E = i = 1 min m n σ i E     (15)

where E denotes the nuclear norm of the matrix E , σ i E is the i-th largest singular value of the matrix E , and m and n are the number of rows and columns of the matrix E , respectively.

The objective of minimizing the nuclear norm is to identify a matrix E that achieves the lowest possible nuclear norm value, subject to fulfilling specific constraints. The optimization problem can be mathematically formulated as Equation 16:

min E E subject to E i j = A i j , i j Ω     (16)

Consider Ω as a set that encompasses the known positions of the elements. To ensure that the prediction results fall within the range of 0 to 1 and to enhance the model’s robustness against noise in the data, we impose the following constraints on the model, as Equation 17 shows:

min E E subject to σ Ω E σ Ω H < ς     (17)

In this context, ς denotes the measurement noise, which accounts for the random variations or inaccuracies in the data. Meanwhile, σ Ω signifies an orthogonal mapping that is applied to Ω . Subsequently, we replace the inequality-constrained models with regularized ones.

min E E + ω 2 σ Ω E σ Ω H F 2     (18)

where ω is the regularization parameter. Inspired by Huttner et al. (2020), we use enhanced Lagrangian functions and the alternating direction method of multipliers (ADMMs) to address optimization problems that incorporate equality constraints. Equation 18 can be rewritten into the following form:

Ι γ E Y Z = min E E + ω 2 σ Ω E σ Ω H F 2 + γ 2 E Z F 2 + Y , E ZKAY     (19)

where Y is an introduced auxiliary variable, Z is the Lagrange multiplier matrix, and γ > 0 is the penalty parameter. The ADMM algorithm can solve E , Y , and Z iteratively, and in each round of iteration, there are the following three steps:

E k + 1 = argmin ω 2 σ Ω E σ Ω H F 2 + γ 2 E Z F 2 + Y , E Z     (20)
Y k + 1 = argmin E + γ 2 Y E k + 1 + 1 γ Y k F 2     (21)
Z k + 1 = Z k + γ E k + 1 Y k + 1     (22)

Obviously, based on the above Equations 2022, after k rounds of iteration, we can finally obtain a convergent matrix E , in which, the unknown values in A have been completed.

Calculating the final predicted scores of potential microbe–drug associations

In this section, we will use a weighted average approach to amalgamate the outcomes of the two prediction models. This method assigns different weights to each prediction, reflecting their relative importance or reliability. By doing so, we can create a composite forecast that leverages the strengths of both models while potentially mitigating the weaknesses of either.

The final microbe–drug associations prediction score matrix M is calculated as follows:

M = α M 1 + 1 α E     (23)

where 0 α 1 is the weight value.

Model evaluation method

To enhance the model’s generalization capability and robustness, and to ensure the stability and reliability of performance evaluation, we implemented a five-fold cross-validation to assess the model’s predictive performance. Initially, we randomly selected 80% of the recognized and unrecognized associations from the dataset as the training dataset, while the remaining 20% was the independent testing dataset. Subsequently, we further randomly divided the training dataset, which was derived from the full dataset, into five equally sized subsets to facilitate the five-fold cross-validation. By utilizing the MDAD dataset, we performed five separate cross-validations, while ensuring that each trial was conducted independently. Upon the completion of the five-fold cross-validation, the model’s performance was assessed across various subsets of the training set. Ultimately, we used the pre-allocated independent test set to evaluate the model’s final performance.

Experiments and results

In this section, we first conducted a sensitivity analysis of key parameters to optimize the model’s performance. Then, we selected six leading-edge methods for comparison with BANNMDA. To further validate the reliability of our model, we specifically chose two representative microbes and drugs for testing.

Parameter sensitivity analysis

Considering the actual conditions of the model, we identified and analyzed four parameters that significantly impact the final predictive outcomes. In this context, within the BANs, dimension l emerges as a pivotal parameter. Within the NNM, parameters ω and γ specified in Equation 19 hold significant importance. In Equation 23, parameter α stands out as another crucial element. In this part, we aimed to identify optimal settings and maintain the separation of our training and testing datasets. In BANs, we resolved to modify the dimensionality parameter l , which was initially derived from the set 4 8 16 32 64 . Subsequently, using a five-fold cross-validation (CV) approach, we assessed the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (AUPR) for the parameter configuration. The results are presented in Figure 2A.

Figure 2
www.frontiersin.org

Figure 2. AUC and AUPR values on different parameter sensitivity analysis.

In NNM, we opted to perform comprehensive tests by adjusting parameters ω and γ , derived from 1 10 100 1000 , and carried out integrated experiments. The results are presented in Figure 2B.

Ultimately, the outcomes are presented in Figure 2C, which illustrates the influence of parameter α in Equation 23 after its modification from 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 within the context of a five-fold CV on the MDAD dataset. The parameter analysis is depicted in Figure 2. As illustrated by the data in Figure 2, the optimal model performance is attained when the parameters are configured as follows: ω  = 10, γ  = 1, l  = 8, and α  = 0.75.

Comparison with advanced methods

To enhance the validation of BANNMDA’s predictive capabilities, this section presents a comparative evaluation against six notable and competitive methods. During experiments, we adopted the original parameters of each competing method and executed all competitive methods using the same five-fold cross-validation approach on the MDAD dataset to ensure a fair and consistent comparison.

• HMDAKATZ (Zhu et al., 2019): The method harnesses the KATZ algorithm as its foundation to predict associations between microbes and drugs.

• SCSMDA (Tian et al., 2023): This approach uses a structure-enhanced contrastive learning technique coupled with a self-paced negative sampling strategy to forecast associations between microbes and drugs.

• GSAMDA (Tan et al., 2022): This model utilizes graph attention networks and sparse autoencoders to provide a new approach for predicting potential microbial drug interactions.

• GACNNMDA (Ma et al., 2023): Incorporating graph attention networks alongside CNN binary classifiers, this model pioneers a novel predictive framework for identifying potential microbial drug interactions

• GARFMDA (Kuang et al., 2024): This model deduces potential associations between microbes and drugs through an integration of graph attention networks and a dual-layer random forest architecture.

• MDASAE (Fan et al., 2023): This model uses a stacked autoencoder along with a multi-head attention mechanism to extract and understand the complex association system between microbes and drugs.

We performed an assessment of these techniques with their default parameters and measured their performance via a five-fold CV process. The efficacy of the introduced BANNMDA model was evaluated using the AUC, AUPR, accuracy, and F1-score metrics, utilizing the MDAD dataset. The findings are detailed in Table 2 and Figure 3, showcasing the BANNMDA model’s exceptional predictive accuracy, surpassing the other evaluated approaches.

Table 2
www.frontiersin.org

Table 2. Results of the compared methods.

Figure 3
www.frontiersin.org

Figure 3. AUC and AUPR curves of six competitive methods based on the MDAD dataset.

As shown in Table 2, our model excelled in three of the four assessment criteria, with only a slight lag behind the SCSMDA model in the AUPR metric. The lower AUPR value compared to the SCSMDA method may be attributed to the SCSMDA method’s use of a self-paced negative sampling strategy, which adeptly selects negative samples that are richest in information content for training purposes. This approach is particularly effective in addressing imbalanced datasets and consequently elevates the AUPR values. Consequently, BANNMDA stands out as a highly effective predictive tool.

Case study

To rigorously evaluate the predictive capabilities of the BANNMDA model, we selected two renowned drugs—amoxicillin and ceftazidime—as well as two prevalent microbes—Bacillus cereus and influenza A virus—for our case studies.

Amoxicillin (Huttner et al., 2020), classified within the penicillin family of antimicrobials, has been the subject of numerous studies that have demonstrated its association with the activity against Bacillus subtilis (Matei-Lațiu et al., 2023), Clostridium perfringens (Sárvári et al., 2022), and Listeria monocytogenes (Sixt et al., 2024). Based on the predictive scores, the microbes related to amoxicillin were ranked in descending order of their scores. After excluding the three associations already present in the MDAD dataset, the top 10 microbes were selected for further validation. As shown in Table 3, of the top 10 predicted microbes associated with amoxicillin, nine have been confirmed by existing research indexed in PubMed. For instance, Dewachter et al. (2022) confirms that amoxicillin has antibacterial effects against Streptococcus pneumoniae, while Gómez-Sánchez et al. (2023) establishes the association between amoxicillin and Staphylococcus aureus.

Table 3
www.frontiersin.org

Table 3. Top 10 amoxicillin-associated candidate microbes on MDAD.

Cefotaxime is a potent aminothiazolyl cephalosporin antibiotic, renowned for its efficacy against a spectrum of Gram-negative bacteria (Gentry, 1985). Multiple research studies have highlighted the effectiveness of cefotaxime, showing its association with combating infections caused by Pseudomonas aeruginosa (Wang et al., 2023), Escherichia coli (Feng et al., 2021), Streptococcus pneumoniae (Ataee et al., 2014), and various other pathogens. As detailed in Table 4, following the exclusion of seven known associations recorded in the MDAD dataset, we identified nine microbes from the top 10 predicted cefotaxime-associated microbes that have been substantiated by PubMed-indexed literature. For instance, Awad et al. (2014) examined the relationship between cefotaxime and Staphylococcus aureus.

Table 4
www.frontiersin.org

Table 4. Top 10 cefotaxime-associated candidate microbes on MDAD.

Bacillus cereus, a Gram-positive bacterium characterized by its rod-shaped structure and beta-hemolytic activity, is frequently detected in soil and food products. This organism is notorious for its role in foodborne illnesses, particularly the “fried rice syndrome,” a form of food poisoning (Leong et al., 2023). Based on the pertinent literature, there is confirmation of associations between Bacillus cereus and various substances, including copper sulfate (Arokiyaraj et al., 2019) and silver nitrate (Babu et al., 2011). Upon the exclusion of three known associations recorded in the MDAD dataset, an analysis of the top 10 predicted drugs associated with Bacillus cereus identified 8 that have been confirmed by studies indexed in PubMed, as presented in Table 5. Park et al. (2022) elucidates the association between Bacillus cereus and rifampicin through an investigation into the prevalence and traits of toxin-producing Bacillus cereus strains isolated from low-moisture foods.

Table 5
www.frontiersin.org

Table 5. Top 10 Bacillus cereus-associated candidate drugs on MDAD.

Influenza A virus is a member of the Orthomyxoviridae family, renowned for its significant pathogenic potential in humans (Nypaver et al., 2021). Existing scholarly studies have documented associations between the influenza A virus and a range of pharmaceuticals, including ribavirin (Ayari et al., 2021), zanamivir (Lee et al., 2022), oseltamivir (Ormond et al., 2017), and others, highlighting their potential roles in treatment strategies. Upon the exclusion of five known associations from the MDAD dataset, Table 6 reveals that nine out of the top 10 candidate drugs identified were correlated with the influenza A virus, underscoring a significant connection. For instance Li et al. (2022) highlights the significant role of curcumin in inhibiting the influenza A virus.

Table 6
www.frontiersin.org

Table 6. Top 10 influenza A virus-associated candidate drugs on MDAD.

In summary, these pairs of case studies provide additional evidence of the BANNMDA model’s capability to predict potential associations between microbes and drugs.

Discussion

The linkage between drugs and microbes is of pivotal significance in the therapeutic realm of disease management, as emphasized by biomedical inquiries. Therefore, the advent of a sophisticated computational model for predictive models can significantly bolster the discovery of novel microbe–drug associations, optimizing treatment modalities for a spectrum of diseases.

In this study, we introduced a novel model BANNMDA by integrating the BANs and NNM to detect potential associations between microbes and drugs. The BANNMDA model was initiated by amalgamating the drug similarity network with the extant microbe–drug associations, alongside the similarity and association data between the nodes, to construct a novel heterogeneous network for microbes and drugs. Subsequently, the model leveraged both the BANs and the NNM to prognosticate the correlation scores between these microbes and drugs. To derive the predictive outcomes, these two forecasted scores were averaged with assigned weights. The empirical results demonstrated that BANNMDA surpassed contemporary methodologies and yielded satisfactory results in case study evaluations.

Although the BANNMDA model offered commendable predictive performance, there was still room for improvement. Notably, the BAN component of the model, while proficient in assimilating diverse information across heterogeneous networks, has demonstrated limitations in capturing the subtleties of local neighborhood information. This limitation is crucial as local neighborhood information is pivotal for understanding the intricate relationships within complex networks. The BAN model’s limitation in this area may be due to its inability to fully explore the nuanced interactions between nodes and their immediate surroundings, which is required for accurate predictions in network-based tasks. To address this, integrating BANs with graph convolutional networks (GCNs) could be a strategic approach. GCNs are particularly adept at leveraging local neighborhood information by aggregating features from neighboring nodes, which can significantly enhance the model’s representational capabilities. This fusion would allow for a more comprehensive understanding of the network’s structure and the relationships between nodes, leading to improved predictive performance.

Furthermore, to elevate the precision of the model’s forecasts, the incorporation of an expanded array of biological data was suggested. This enrichment would involve incorporating comprehensive data on drug side effects, elucidating the ties between bacterial strains and diseases, and exploring the linkages between pharmaceuticals and disease pathology. By doing so, the model gains a more intricate and detailed understanding of drugs and microorganisms, thereby improving the accuracy of its predictions.

Conclusion

In conclusion, the BANNMDA model presents a significant advancement in the field of computational prediction of microbe–drug associations. It has demonstrated superior performance compared to existing methods, as evidenced by its successful application in case study evaluations. However, the model’s predictive capabilities can be further enhanced by integrating graph convolutional networks (GCNs) to better capture local neighborhood information and by expanding the scope of biological data considered. This would provide a more nuanced understanding of the complex interactions between drugs and microbes, ultimately leading to more accurate predictions and a deeper insight into disease management. The future incorporation of these enhancements is anticipated to propel the BANNMDA model to new heights in its predictive accuracy and applicability in therapeutic strategies.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding authors.

Author contributions

ML: Funding acquisition, Writing – original draft, Writing – review & editing. XL: Funding acquisition, Writing – original draft, Writing – review & editing. JuL: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. QC: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. BZ: Writing – review & editing. ZW: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. JiL: Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. LW: Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was partly sponsored by the National Natural Science Foundation of China (No. 62272064), the Natural Science Foundation of Hunan Province (No. 2023JJ60185), Scientific Research Project of Hunan Provincial Department of Education (Nos. 23C0543 and 23C0544).

Acknowledgments

The authors thank the referees for suggestions that helped improve the paper substantially.

Conflict of interest

The authors declare that the research was conducted without any commercial or financial relationships that could be construed as potential conflicts of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Arokiyaraj, S., Varghese, R., Ali, A. B., Duraipandiyan, V., and Al-Dhabi, N. (2019). Optimizing the fermentation conditions and enhanced production of keratinase from Bacillus cereus isolated from halophilic environment. Saudi. J. Biol. Sci. 26, 378–381. doi: 10.1016/j.sjbs.2018.10.011

PubMed Abstract | Crossref Full Text | Google Scholar

Ataee, R. A., Habibian, S., Mehrabi-Tavana, A., Ahmadi, Z., Jonaidi, N., and Salesi, M. (2014). Determination of vancomycin minimum inhibitory concentration for ceftazidime resistant Streptococcus pneumoniae in Iran. Ann. Clin. Microbiol. Antimicrob. 13:53. doi: 10.1186/s12941-014-0053-1

PubMed Abstract | Crossref Full Text | Google Scholar

Awad, S. S., Rodriguez, A. H., Chuang, Y. C., Marjanek, Z., Pareigis, A. J., Reis, G., et al. (2014). A phase 3 randomized double-blind comparison of ceftobiprole medocaril versus ceftazidime plus linezolid for the treatment of hospital-acquired pneumonia. Clin. Infect. Dis. 59, 51–61. doi: 10.1093/cid/ciu219

PubMed Abstract | Crossref Full Text | Google Scholar

Ayari, M., Favetta, P., Warszycki, D., Vasseur, V., Hervé, V., Degardin, P., et al. (2021). Molecularly imprinted hydrogels selective to ribavirin as new drug delivery systems to improve efficiency of antiviral nucleoside analogue: a proof-of-concept study with influenza A virus. Macromol. Biosci. 22:e2100291. doi: 10.1002/mabi.202100291

PubMed Abstract | Crossref Full Text | Google Scholar

Babu, M. M., Sridhar, J., and Gunasekaran, P. (2011). Global transcriptome analysis of Bacillus cereus ATCC 14579 in response to silver nitrate stress. J. Nanobiotechnology 9:49. doi: 10.1186/1477-3155-9-49

PubMed Abstract | Crossref Full Text | Google Scholar

Bai, P., Miljković, F., John, B., and Lu, H. (2023). Interpretable bilinear attention network with domain adaptation improves drug-target prediction. Nat. Mach. Intell. 5, 126–136. doi: 10.1038/s42256-022-00605-1

PubMed Abstract | Crossref Full Text | Google Scholar

Candès, E., and Recht, B. (2012). Simple bounds for recovering low-complexity models. Math. Program. 141, 577–589. doi: 10.1007/s10107-012-0540-0

PubMed Abstract | Crossref Full Text | Google Scholar

Cheng, L., Qi, C., Zhuang, H., Fu, T., and Zhang, X. (2020). gutMDisorder: a comprehensive database for dysbiosis of the gut microbiota in disorders and interventions. Nucleic Acids Res. 48, D554–D560. doi: 10.1093/nar/gkz843

PubMed Abstract | Crossref Full Text | Google Scholar

Deng, L., Huang, Y., Liu, X., and Hui, L. (2022). Graph2MDA: a multi-modal variational graph embedding model for predicting microbe–drug associations. Bioinformatics 38, 1118–1125. doi: 10.1093/bioinformatics/btab792

PubMed Abstract | Crossref Full Text | Google Scholar

Dewachter, L., Dénéréaz, J., Liu, X., de Bakker, V., Costa, C., Baldry, M., et al. (2022). Amoxicillin-resistant Streptococcus pneumoniae can be resensitized by targeting the mevalonate pathway as indicated by sCRilecs-seq. eLife 11:e75607. doi: 10.7554/eLife.75607

PubMed Abstract | Crossref Full Text | Google Scholar

Durack, J., and Lynch, S. V. (2019). The gut microbiome: relationships with disease and opportunities for therapy. J Exp Med 216, 20–40. doi: 10.1084/jem.20180448

PubMed Abstract | Crossref Full Text | Google Scholar

Fan, L., Wang, L., and Zhu, X. (2023). A novel microbe–drug association prediction model based on stacked autoencoder with multi-head attention mechanism. Sci. Rep. 13:7396. doi: 10.1038/s41598-023-34438-8

PubMed Abstract | Crossref Full Text | Google Scholar

Feng, W., Zhang, L., Yuan, Q., Wang, Y., Yao, P., Xia, P., et al. (2021). Effect of sub-minimal inhibitory concentration ceftazidime on the pathogenicity of uropathogenic Escherichia coli. Microb. Pathog. 151:104748. doi: 10.1016/j.micpath.2021.104748

PubMed Abstract | Crossref Full Text | Google Scholar

Gentry, L. (1985). Antimicrobial activity, pharmacokinetics, therapeutic indications and adverse reactions of ceftazidime. Pharmacotherapy 5, 254–267. doi: 10.1002/j.1875-9114.1985.tb03424.x

PubMed Abstract | Crossref Full Text | Google Scholar

Gill, S. R., Pop, M., DeBoy, R. T., Eckburg, P. B., Turnbaugh, P. J., Samuel, B. S., et al. (2006). Metagenomic analysis of the human distal gut microbiome. Science 312, 1355–1359. doi: 10.1126/science.1124234

PubMed Abstract | Crossref Full Text | Google Scholar

Gómez-Sánchez, E., Franco-de la Torre, L., Hernández-Gómez, A., Alonso-Castro, Á. J., Serafín-Higuera, N., Terán-Rosales, F., et al. (2023). Antagonistic, synergistic, and additive antibacterial interaction between ciprofloxacin and amoxicillin against Staphylococcus aureus. Fundam. Clin. Pharmacol. 37, 174–181. doi: 10.1111/fcp.12832

PubMed Abstract | Crossref Full Text | Google Scholar

Hattori, M., Tanaka, N., Kanehisa, M., and Goto, S. (2010). SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res. 38, W652–W656. doi: 10.1093/nar/gkq367

PubMed Abstract | Crossref Full Text | Google Scholar

Human Microbiome Project Consortium (2012). Structure, function and diversity of the healthy human microbiome. Nature 486, 207–214. doi: 10.1038/nature11234

PubMed Abstract | Crossref Full Text | Google Scholar

Huttner, A., Bielicki, J., Clements, M. N., Frimodt-Møller, N., Muller, A. E., Paccaud, J. P., et al. (2020). Oral amoxicillin and amoxicillin-clavulanic acid: properties, indications and usage. Clin. Microbiol. Infect. 26, 871–879. doi: 10.1016/j.cmi.2019.11.028

PubMed Abstract | Crossref Full Text | Google Scholar

Kamneva, O. (2017). Genome composition and phylogeny of microbes predict their co-occurrence in the environment. PLoS Comput. Biol. 13:e1005366. doi: 10.1371/journal.pcbi.1005366

PubMed Abstract | Crossref Full Text | Google Scholar

Kim, J.-H., Jun, J., and Zhang, B.-T. (2018). Bilinear attention networks. NIPS’18: Proceedings of the 32nd International Conference on Neural Information Processing Systems

Google Scholar

Kuang, H., Zhang, Z., Zeng, B., Liu, X., Zuo, H., Xu, X., et al. (2024). A novel microbe–drug association prediction model based on graph attention networks and bilayer random forest. BMC Bioinformatics 25:78. doi: 10.1186/s12859-024-05687-9

PubMed Abstract | Crossref Full Text | Google Scholar

Lee, D., Jo, H., Jang, Y., Bae, S., Agura, T., Kang, D., et al. (2022). Alloferon and zanamivir show effective antiviral activity against influenza A virus (H1N1) infection in vitro and in vivo. Int. J. Mol. Sci. 24:678. doi: 10.3390/ijms24010678

PubMed Abstract | Crossref Full Text | Google Scholar

Leong, S., Korel, F., and King, J. (2023). Bacillus cereus: a review of “fried rice syndrome” causative agents. Microb. Pathog. 185:106418. doi: 10.1016/j.micpath.2023.106418

PubMed Abstract | Crossref Full Text | Google Scholar

Ley, R., Turnbaugh, P., Klein, S., and Gordon, J. I. (2006). Human gut microbes associated with obesity. Nature 444, 1022–1023. doi: 10.1038/4441022a

PubMed Abstract | Crossref Full Text | Google Scholar

Li, C. Z., Chang, H. M., Hsu, W. L., Venkatesan, P., Lin, M. H., and Lai, P. S. (2022). Curcumin-loaded oil-free self-assembled micelles inhibit the influenza a virus activity and the solidification of curcumin-loaded micelles for pharmaceutical applications. Pharmaceutics 14:2422. doi: 10.3390/pharmaceutics14112422

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, H., Bing, P., Zhang, M., Tian, G., Ma, J., Li, H., et al. (2023). MNNMDA: predicting human microbe-disease association via a method to minimize matrix nuclear norm. Comput. Struct. Biotechnol. J. 21, 1414–1423. doi: 10.1016/j.csbj.2022.12.053

PubMed Abstract | Crossref Full Text | Google Scholar

Ma, Q., Tan, Y., and Wang, L. (2023). GACNNMDA: a computational model for predicting potential human microbe–drug associations based on graph attention network and CNN-based classifier. BMC Bioinformatics 24:35. doi: 10.1186/s12859-023-05158-7

PubMed Abstract | Crossref Full Text | Google Scholar

Matei-Lațiu, M., Gal, A. F., Rus, V., Buza, V., Martonos, C., Lațiu, C., et al. (2023). Intestinal dysbiosis in rats: interaction between amoxicillin and probiotics, a histological and immunohistochemical evaluation. Nutrients 15:15. doi: 10.3390/nu15051105

PubMed Abstract | Crossref Full Text | Google Scholar

McCoubrey, L. E., Gaisford, S., Orlu, M., and Basit, A. W. (2022). Predicting drug-microbiome interactions with machine learning. Biotechnol. Adv. 54:107797. doi: 10.1016/j.biotechadv.2021.107797

PubMed Abstract | Crossref Full Text | Google Scholar

Nypaver, C., Dehlinger, C., and Carter, C. (2021). Influenza and influenza vaccine: a review. J. Midwifery Womens Health 66, 45–53. doi: 10.1111/jmwh.13203

PubMed Abstract | Crossref Full Text | Google Scholar

Ormond, L., Liu, P., Matuszewski, S., Renzette, N., Bank, C., Zeldovich, K., et al. (2017). The combined effect of oseltamivir and favipiravir on influenza A virus evolution. Genome Biol. Evol. 9, 1913–1924. doi: 10.1093/gbe/evx138

PubMed Abstract | Crossref Full Text | Google Scholar

Park, K. M., Kim, A. Y., Kim, H. J., Cho, Y. S., and Koo, M. (2022). Prevalence and characterization of toxigenic Bacillus cereus group isolated from low-moisture food products. Food Sci. Biotechnol. 31, 1615–1629. doi: 10.1007/s10068-022-01144-6

PubMed Abstract | Crossref Full Text | Google Scholar

Rajput, A., Thakur, A., Sharma, S., and Kumar, M. (2018). aBioflm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res. 46, D894–D900. doi: 10.1093/nar/gkx1157

PubMed Abstract | Crossref Full Text | Google Scholar

Recht, B., Fazel, M., and Parrilo, P. (2010). Guaranteed minimum-rank solutions of linear matrix equations via nuclear norm minimization. SIAM Rev. 52, 471–501. doi: 10.1137/070697835

PubMed Abstract | Crossref Full Text | Google Scholar

Sárvári, K. P., Rácz, N. B., and Burián, K. (2022). Epidemiology and antibiotic susceptibility in anaerobic bacteraemia: a 15-year retrospective study in south-eastern Hungary. Infect. Dis. 54, 16–25. doi: 10.1080/23744235.2021.1963469

PubMed Abstract | Crossref Full Text | Google Scholar

Schwabe, R. F., and Jobin, C. (2013). The microbiome and cancer. Nat. Rev. Cancer 13, 800–812. doi: 10.1038/nrc3610

PubMed Abstract | Crossref Full Text | Google Scholar

Sixt, T., Moretto, F., Das Neves, S., Amoureux, L., Neuwirth, C., Piroth, L., et al. (2024). Amoxicillin and ceftriaxone: a synergistic association against Listeria monocytogenes. Open Forum Infect. Dis. 11:ofae295. doi: 10.1093/ofid/ofae295

PubMed Abstract | Crossref Full Text | Google Scholar

Sommer, F., and Bäckhed, F. (2013). The gut microbiota-masters of host development and physiology. Nat. Rev. Microbiol. 11, 227–238. doi: 10.1038/nrmicro2974

PubMed Abstract | Crossref Full Text | Google Scholar

Sun, Y., Zhang, D. H., Cai, S. B., Ming, Z., Li, J. Q., and Chen, X. (2018). MDAD: a special resource for microbe–drug associations. Front. Cell. Infect. Microbiol. 8:424. doi: 10.3389/fcimb.2018.00424

PubMed Abstract | Crossref Full Text | Google Scholar

Szklarczyk, D., Gable, A., Lyon, D., Junge, A., Wyder, S., Huerta-Cepas, J., et al. (2019). STRING v11. Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613. doi: 10.1093/nar/gky1131

PubMed Abstract | Crossref Full Text | Google Scholar

Tan, Y., Zou, J., Kuang, L., Wang, X., Zeng, B., Zhang, Z., et al. (2022). GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder. BMC Bioinformatics 23:492. doi: 10.1186/s12859-022-05053-7

PubMed Abstract | Crossref Full Text | Google Scholar

Tian, Z., Yu, Y., Fang, H., Xie, W., and Guo, M. (2023). Predicting microbe–drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy. Brief. Bioinform. 24:bbac634. doi: 10.1093/bib/bbac634

PubMed Abstract | Crossref Full Text | Google Scholar

Ventura, M., O’Flaherty, S., Claesson, M. J., Turroni, F., Klaenhammer, T. R., van Sinderen, D., et al. (2009). Genome-scale analyses of health-promoting bacteria: probiogenomics. Nat. Rev. Microbiol. 7, 61–71. doi: 10.1038/nrmicro2047

PubMed Abstract | Crossref Full Text | Google Scholar

Wang, L., Tan, Y., Yang, X., Kuang, L., and Ping, P. Y. (2022). Review on predicting pairwise relationships between human microbes, drugs and diseases: from biological data to computational models. Brief. Bioinform. 23:bbac080. doi: 10.1093/bib/bbac080

Crossref Full Text | Google Scholar

Wang, L., Zhang, X., Zhou, X., Yang, F., Guo, Q., and Wang, M. (2023). Comparison of in vitro activity of ceftazidime-avibactam and imipenem-relebactam against clinical isolates of Pseudomonas aeruginosa. Microbiol. Spectr. 11:e0093223. doi: 10.1128/spectrum.00932-23

PubMed Abstract | Crossref Full Text | Google Scholar

Xuan, P., Gao, L., Sheng, N., Tiangang, Z., and Toshiya, N. (2020). Graph convolutional autoencoder and fully-connected autoencoder with attention mechanism based method for predicting drug-disease associations. IEEE J. Biomed. Health Inform. 25, 1793–1804. doi: 10.1109/JBHI.2020.3039502

PubMed Abstract | Crossref Full Text | Google Scholar

Yang, H., Ding, Y., Tang, J., and Gao, F. (2022). Inferring human microbe–drug associations via multiple kernel fusion on graph neural network. Knowl. Based Syst. 238:107888. doi: 10.1016/j.knosys.2021.107888

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Z., Zhu, Y., Pei, H., Wang, X., and Wang, L. (2023). An iterative model for identifying essential proteins based on the whole process network of protein evolution. Curr. Bioinform. 18, 359–373. doi: 10.2174/1574893618666230315154807

Crossref Full Text | Google Scholar

Zhu, L., Duan, G., Yan, C., and Wang, J. (2019). Prediction of microbe–drug associations based on KATZ measure. 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

Google Scholar

Zhu, B., Xu, Y., Zhao, P., Yiu, S. M., Yu, H., and Shi, J. Y. (2022). NNAN: nearest neighbor attention network to predict drug-microbe associations. Front. Microbiol. 13:846915. doi: 10.3389/fmicb.2022.846915

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: computational model, microbe–drug associations, bilinear attention networks, nuclear norm minimization, prediction

Citation: Liang M, Liu X, Li J, Chen Q, Zeng B, Wang Z, Li J and Wang L (2025) BANNMDA: a computational model for predicting potential microbe–drug associations based on bilinear attention networks and nuclear norm minimization. Front. Microbiol. 15:1497886. doi: 10.3389/fmicb.2024.1497886

Received: 18 September 2024; Accepted: 31 December 2024;
Published: 22 January 2025.

Edited by:

Kunal R. Jain, Sardar Patel University, India

Reviewed by:

Ravi Raghavbhai Sonani, University of Virginia, United States
Wei Ma, Peking University, China

Copyright © 2025 Liang, Liu, Li, Chen, Zeng, Wang, Li and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Juncai Li, MTgyNTM0NTk3N0BxcS5jb20=; Qijia Chen, NzUyMzE5MzgzQHFxLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.